Why is my vector search slower than molasses in winter?

Because you didn't configure [HNSW parameters](https://docs.weaviate.io/weaviate/concepts/vector-index#hnsw) and you're using the [default settings](https://docs.weaviate.io/weaviate/config-refs/schema/vector-index#hnsw-parameters) that assume you have 100 documents, not 100,000. Also, your embeddings are probably too chunky. Everyone uses [1536 dimensions](https://platform.openai.com/docs/guides/embeddings) because OpenAI says so, but [512 works fine](https://docs.weaviate.io/weaviate/concepts/vector-index#vector-dimensions) for most use cases and performs like a 2005 laptop running Crysis in comparison. And stop using [hybrid search](https://docs.weaviate.io/weaviate/search/hybrid) unless you absolutely need it - it adds 200ms to every query that users will absolutely notice.

Can I use this integration in production without getting fired?

Maybe. If you implement proper [error handling](https://js.langchain.com/docs/how_to/debugging), [caching](https://docs.weaviate.io/weaviate/configuration/caching), [monitoring](https://docs.langchain.com/tracing), and have a [rollback plan](https://vercel.com/docs/deployments/rollbacks). The examples here are toys. Production requires dealing with [failed embeddings](https://platform.openai.com/docs/guides/error-codes), [connection timeouts](https://docs.weaviate.io/weaviate/configuration/timeout), [rate limits](https://platform.openai.com/docs/guides/rate-limits), and users who search for garbage. Budget 40% of your time for debugging random failures.

Why does everything work locally but fail in production?

Because [local development lies to you](https://nextjs.org/docs/app/building-your-application/configuring/environment-variables#loading-environment-variables). Your [connection timeouts](https://docs.weaviate.io/weaviate/configuration/timeout) are different, your [environment variables are managed](https://vercel.com/docs/projects/environment-variables) by Vercel's weird system, and you're hitting actual [rate limits](https://platform.openai.com/docs/guides/rate-limits) for the first time. Test with production-like data volumes and network conditions or prepare for pain.

What Next.js version won't make me want to quit?

Use [Next.js 18.17.0+](https://nextjs.org/blog/next-13-4) with the [App Router](https://nextjs.org/docs/app). Versions [18.2.0 through 18.4.0](https://github.com/vercel/next.js/releases) have bugs that cause random crashes. Don't use [Server Components](https://nextjs.org/docs/app/building-your-application/rendering/server-components) for vector queries - they make debugging impossible. Stick with [API routes](https://nextjs.org/docs/app/building-your-application/routing/route-handlers) where you can actually see what's failing.

Should I use OpenAI or save money with alternatives?

OpenAI embeddings work and their rate limits are reasonable. Cohere costs more and results vary. Local models are free but terrible. Azure OpenAI is the same as regular OpenAI but with more bureaucracy. Pick OpenAI and move on with your life.

Where do I put vector operations without exposing API keys?

Server-side only. Client-side vector queries are security suicide - your API keys will be in browser dev tools within minutes. API routes are your friend. Server Actions sound cool but make debugging impossible when things break at 2am.

How do I upload 10,000 documents without crashing everything?

Batch processing in chunks of 50-100 documents, not the thousands you were planning. Your memory will explode otherwise. Background jobs are mandatory for large datasets - don't try to do this in an API route or you'll hit timeout limits. Also, budget $200-500 for embedding costs because nobody ever warns you about this.

How much data can I actually store before things explode?

Forget the marketing about "billions of objects." Your setup will start crawling around 1M documents with default settings. 10M+ requires serious tuning, more memory, and a bigger AWS bill. Performance degrades gradually, then suddenly falls off a cliff.

What are the real memory requirements?

More than they tell you: - Next.js app: **1-2GB** (not 512MB) when handling real traffic - Weaviate connections: **200-400MB** with connection pooling - LangChain overhead: **300-500MB** because memory leaks - Buffer for growth: **1-2GB** or your process crashes

How do I make this faster without losing my mind?

Stop overthinking HNSW parameters unless you have a PhD in vector math. Use 1024 dimensions instead of 1536 (3x faster, barely noticeable quality drop). Cache everything with Redis. Don't use hybrid search unless your users specifically demand keyword matching. Connection pooling helps until it doesn't.

Why does everything fail with cryptic TypeScript errors?

Because the v3 client's "full type safety" is bullshit for anything beyond basic schemas. Create your own interfaces and ignore the generated types. The official types are wrong for complex queries and nested properties.

Why am I getting ECONNREFUSED 127.0.0.1:8080?

Your Weaviate connection is fucked. Check: 1. Environment variables (they're probably wrong) 2. Weaviate Cloud is down (check their status page) 3. Your API key expired (they don't email you) 4. You hit rate limits (no error message, just silent failures)

What's the dumb thing I should check first when nothing works?

`docker system prune -a && docker-compose up` if you're running locally. 60% of the time, it works every time. For production, restart your Next.js process - the memory leaks will kill it eventually anyway.

How do I figure out why searches are slow as hell?

LangSmith tracing costs $30/month to watch your system fail in real-time. Enable it if you hate money: ```typescript process.env.LANGSMITH_TRACING = "true"; ``` Real issues that matter: - **You're returning 1000 results** - nobody reads past 10, limit to 20 max - **Your embeddings are too big** - use 1024 dimensions, not 1536 - **Network latency** - if your Weaviate is in Europe and your app is in US East, you're fucked

How do I test this locally without Docker hell?

Good luck. Docker works great until it doesn't: ```bash docker run -p 8080:8080 semitechnologies/weaviate:latest ``` It'll work for a week, then randomly start failing with port conflicts. Use separate collections for testing or you'll accidentally nuke your production data (yes, this happens).

How do I secure API keys without breaking everything?

Environment variables will end up in client code anyway when someone rushes a feature. Rate limiting will block your legitimate traffic during peak hours. IP whitelisting breaks when your infrastructure changes. Just use environment variables and pray.

What monitoring actually matters?

Monitor the things that will wake you up at 3am: - **Error rate > 5%** - your system is dying - **Search latency > 2 seconds** - users are leaving - **Memory usage > 80%** - restart imminent - **Monthly AWS bill** - track embedding costs or get surprised - **Your sanity** - not trackable, but important

How do I handle updates without everything breaking?

You don't. Updates will break something. Plan accordingly: 1. **Stage everything** with real data (not toy examples) 2. **Feature flags** for rollbacks when shit hits the fan 3. **Monitor like crazy** after updates 4. **Have a rollback plan** that actually works 5. **Update one thing at a time** or you'll never figure out what broke

Can I make this work with multiple tenants?

Sure, if you enjoy complexity. Multi-tenancy works until you hit the undocumented tenant limits or one tenant's data bleeds into another's. Use separate collections if you can't afford data leaks ending your career.

Currently viewing the AI version

Switch to human version

Weaviate + LangChain + Next.js Integration: Technical Reference

Architecture Overview

Component Flow:

User Query → Next.js API Routes → LangChain Processing → Weaviate Vector Store → AI Generation → Response

Core Technologies:

Weaviate v3: Vector database with gRPC transport for 30% performance improvement (when working)
LangChain.js: AI workflow orchestration with memory leak issues
Next.js: Frontend framework serving as the most reliable component

Critical Configuration Requirements

Production-Ready Settings

Weaviate Client Configuration:

// Singleton pattern with connection pooling
const client = weaviate.connectToWeaviateCloud({
  clusterURL: process.env.WEAVIATE_URL,
  options: {
    authCredentials: new ApiKey(process.env.WEAVIATE_API_KEY),
    headers: {
      'X-OpenAI-Api-Key': process.env.OPENAI_API_KEY,
    },
  },
});

Schema Requirements:

Use text-embedding-3-small model with 1536 dimensions (industry standard)
Implement proper tokenization with word setting
Include vectorizers with source properties for optimal embedding

HNSW Vector Index Settings:

Default settings assume 100 documents, not 100,000+
Use 1024 dimensions instead of 1536 for 3x performance improvement with minimal quality loss
Configure connection pooling (will still leak memory)

Failure Modes and Critical Warnings

Connection Issues

Timeout Patterns:

EU morning hours (7-9 AM UTC): Weaviate Cloud overwhelmed
Connection timeouts during peak usage
gRPC connection failures with cryptic error messages

Memory Leaks:

LangChain.js vector store singleton consumes increasing RAM
Node processes become zombies after ~100 requests
Restart required every few hours in production

Performance Degradation

Breaking Points:

UI becomes unusable at 1000+ spans in distributed transactions
Search latency >2 seconds causes user abandonment
System crawls at 1M+ documents with default settings
10M+ documents require serious tuning and higher costs

Resource Requirements:

Next.js app: 1-2GB RAM (not 512MB as documented)
Weaviate connections: 200-400MB with connection pooling
LangChain overhead: 300-500MB due to memory leaks
Buffer for growth: 1-2GB minimum

Implementation Best Practices

Security Architecture

API Key Management:

// Server-side only - API routes mandatory
export async function POST(req: NextRequest) {
  // All vector operations must be server-side
  // Client-side vector queries expose API keys
}

Critical Security Rules:

Never expose vector operations to client-side code
API keys will end up in browser source within hours if not properly isolated
Use environment variables with production-specific validation

Document Processing

Chunking Strategy:

Large documents cost $2-5 to vectorize (50-page PDF)
Batch processing in 50-100 document chunks (not thousands)
Background jobs mandatory for large datasets
Budget $200-500 monthly for embedding costs

Metadata Structure:

metadata: {
  category: "docs",           // strings work reliably
  date: "2025-09-06",        // dates as strings, not Date objects
  score: 5,                  // numbers are safe
  tags: ["api", "auth"]      // arrays work until they don't
}

Search Implementation

Search Strategy Trade-offs:

Pure vector search: Fast but returns weird results for exact terms
Keyword search: Works but defeats vector purpose
Hybrid search: Adds 200ms latency, tune alpha parameter extensively
Filtered search: Breaks silently when metadata schema changes

Performance Optimization:

Limit results to 20 maximum (users don't read beyond 10)
Cache everything with Redis (+$50/month AWS cost)
Avoid hybrid search unless explicitly required by users

Cost Analysis

Monthly Operational Costs

Minimum Production Budget:

Weaviate Cloud: $200-500/month (scales poorly)
OpenAI embeddings: $50-200/month (depends on document volume)
LangSmith tracing: $30/month (for failure monitoring)
Vercel functions: Variable (timeout costs multiply)
Redis caching: $50/month (additional failure point)

Hidden Costs:

40% development time for debugging random failures
Weekend debugging sessions for connection timeouts
Memory leak investigation and process restarts

Development Environment Setup

Node.js Version Requirements

Critical Version Issues:

Node 18.2.0-18.4.0: Contains App Router bugs causing random crashes
Use Node 18.17.0+ for stability
Windows PATH limit issues during npm installation

Dependency Management

npm install @langchain/weaviate @langchain/core @langchain/openai weaviate-client uuid dotenv
npm install -D @types/uuid

Installation Failure Scenarios:

Peer dependency conflicts with newer Next.js versions
Windows PATH limitations during installation
Incompatible package versions requiring exact pinning

Local Development Issues

Docker Environment:

docker run -p 8080:8080 semitechnologies/weaviate:latest

Common Problems:

Works for one week, then port conflicts emerge
Use separate collections for testing to avoid production data loss
Memory requirements exceed typical development machine capacity

Production Deployment Considerations

Monitoring Requirements

Critical Metrics:

Error rate >5%: System failure imminent
Search latency >2 seconds: User abandonment
Memory usage >80%: Process restart required
Monthly AWS bill tracking: Embedding costs spiral unexpectedly

Error Handling Patterns

Connection Recovery:

// Test connection before operations
await client.collections.listAll();

Debugging Strategy:

LangSmith tracing for real-time failure observation
Generic error messages provide no actionable information
Connection timeout debugging requires systematic elimination

Scaling Limitations

Performance Degradation:

1M documents: Performance starts declining
10M+ documents: Requires HNSW parameter expertise
Hybrid search: +200ms per query (noticeable to users)
Multi-tenancy: Undocumented tenant limits cause silent failures

Framework-Specific Implementation

Next.js Architecture Decisions

Recommended Patterns:

Use API routes for all vector operations
Avoid Server Components for vector queries (debugging impossible)
Client-side hydration issues with large datasets
Edge functions unusable due to memory limitations

Deployment Platforms:

Vercel: Edge functions timeout with vector queries
Railway: Simple for prototypes, questionable for production
AWS App Runner: Enterprise complexity requires enterprise problems

LangChain Integration Issues

Memory Management:

Vector store singleton pattern leaks memory
Long-running processes consume increasing RAM
Restart processes every few hours in production
Complex chains fail with cryptic error messages

Document Processing:

PDF processing fails with weird encoding
Tool integration introduces new failure points
Chain orchestration complexity scales poorly

Troubleshooting Guide

Common Error Patterns

ECONNREFUSED 127.0.0.1:8080:

Environment variables incorrect
Weaviate Cloud service down
API key expired (no notification)
Rate limits hit (silent failures)

TypeScript Compilation Errors:

Generated types incorrect for complex schemas
Create custom interfaces for production use
Official examples skip error-prone scenarios

Performance Issues:

Default HNSW parameters assume toy datasets
Embedding dimensions too large (use 1024 vs 1536)
Network latency between regions significant

Nuclear Fixes

Development Environment:

docker system prune -a && docker-compose up

Success rate: 60% of time, works every time

Production Environment:

Restart Next.js process (memory leaks accumulate)
Clear connection pools
Verify environment variable propagation

Alternative Approaches

Technology Substitutions

Vector Database Alternatives:

Pinecone: More expensive, better performance
ChromaDB: Free, significantly worse results
pgvector: Requires PostgreSQL expertise

Embedding Models:

OpenAI: Reliable, reasonable rate limits
Cohere: Higher cost, variable results
Local models: Free, terrible performance
Azure OpenAI: Same as OpenAI with more bureaucracy

Architecture Patterns

Simplified Stack:

Remove LangChain for direct API calls (less abstraction, fewer failures)
Use REST instead of gRPC (slower but more predictable)
Implement custom error handling vs framework abstractions

Success Criteria

Minimum Viable Production

Technical Requirements:

Error rate <5%
Search latency <2 seconds
Memory usage monitoring
Automated restart capabilities
Proper backup and recovery

Operational Requirements:

40% development time budget for debugging
Weekend availability for production issues
Monitoring and alerting for all failure modes
Rollback plan for updates

Performance Benchmarks

Expected Improvements:

gRPC: 20-30% improvement in real-world conditions (not 60% as marketed)
Vector search: Sub-second response for <100K documents
Hybrid search: +200ms latency penalty
Memory usage: Increases linearly with request volume

Failure Thresholds:

1000+ spans: UI debugging becomes impossible
1M+ documents: Performance degradation begins
10M+ documents: Requires expertise and budget increases
Connection timeout: System becomes unreliable

This technical reference provides the operational intelligence needed for successful implementation while acknowledging the real-world challenges and failure modes that official documentation omits.

Useful Links for Further Investigation

Resources That Actually Help (Sometimes)

Link	Description
Weaviate v3 TypeScript Client	Provides good examples for the Weaviate v3 TypeScript client, though it struggles with complex scenarios and has incorrect types for nested schemas.
Weaviate Academy - Next.js Tutorial	A tutorial demonstrating Weaviate integration with Next.js, which functions well in a controlled environment but often fails when used with real-world data.
Hybrid Search Docs	A comprehensive guide to implementing hybrid search in Weaviate, which can significantly increase search latency and is only recommended if explicit keyword matching is a user requirement.
Vector Configuration	A useful setup guide for configuring vector settings in Weaviate, but it notably omits crucial "gotchas" or potential pitfalls that developers might encounter during implementation.
Multi-tenancy Guide	Explores enterprise patterns for multi-tenancy within Weaviate, which are functional until developers encounter undocumented limitations that can hinder scalability and performance.
WeaviateStore API	A complete API reference for LangChain's WeaviateStore, providing detailed information on its functionalities, though its examples often lack comprehensive error handling demonstrations.
LangChain Weaviate Guide	The official documentation for integrating LangChain with Weaviate, offering a basic overview but providing limited insights into critical production-level considerations and best practices.
LangChain.js Concepts	Offers a good conceptual overview of LangChain.js, explaining its core principles and components, but notably omits any discussion regarding potential memory leak issues.
Document Processing	Covers the basic ingestion of documents using LangChain's document loaders, but fails to address important aspects like embedding costs or potential failures during the processing pipeline.
Next.js App Router	A guide to the modern architecture of the Next.js App Router, providing surprisingly well-written and comprehensive documentation for developers.
Server Actions	Documentation on Next.js Server Actions, with a strong recommendation against using them for vector operations due to the extreme difficulty in debugging issues.
API Routes	A solid guide for creating API endpoints in Next.js, explicitly recommended as the appropriate location for housing and managing your vector operations.
Weaviate Next.js Template	A basic starter template for integrating Weaviate with Next.js, which functions adequately for small datasets (around 100 documents) but fails to perform at larger scales.
LangChain Next.js Starter	Provides integration examples for LangChain with Next.js, but notably lacks robust error handling mechanisms and is known to include memory leaks, making it unsuitable for production.
Vercel Template	A "production-ready" template from Vercel for LangChain, which often times out when processing real-world queries, making it primarily suitable for demonstrations rather than actual deployment.
Weaviate TypeScript Recipes	A collection of common Weaviate patterns implemented in TypeScript, where some examples are outdated while others remain functional and useful for developers.
Weaviate Code Examples	The official collection of Weaviate code examples, offering a more comprehensive range of demonstrations compared to the simpler starter templates available.
Hybrid Search Demo	Interactive examples demonstrating hybrid search, which effectively illustrate why this advanced search technique is often an unnecessary and overly complex solution for most use cases.
Weaviate Introduction Tutorial	A beginner-friendly video tutorial that provides a comprehensive introduction to Weaviate, covering its initial setup and fundamental concepts for new users.
Weaviate Official Channel	The official YouTube channel for Weaviate, featuring product demos and marketing content; it's advisable to check the comments section for more realistic user experiences and feedback.
Enterprise Workflows Guide	A comprehensive guide published in July 2025 focusing on enterprise workflows with LangChain and Weaviate, which is heavily marketing-oriented and provides limited insight into actual production pain points.
gRPC Performance Benchmarks	Presents actual performance data and benchmarks for gRPC improvements in Weaviate, but the reported results often do not accurately reflect real-world operational conditions.
Custom Chatbots Tutorial	A practical tutorial on building custom chatbots using Next.js, LangChain, OpenAI, and Supabase, notable for being one of the rare resources that openly discusses actual development problems.
LangSmith	A monitoring and debugging tool for LangChain, costing $30/month, which allows real-time observation of system failures and is particularly useful for managing complex AI chains.
Weaviate Cloud Console	The cloud dashboard for Weaviate, which generally functions well for monitoring, but its utility diminishes significantly when urgent debugging of critical issues is required.
OpenAI Usage Dashboard	A dashboard for tracking OpenAI API usage, particularly useful for monitoring embedding costs, which often accumulate much faster than developers initially anticipate.
Weaviate Docker Setup	Instructions for setting up a local Weaviate environment using Docker Compose, which typically works smoothly for about a week before encountering persistent port conflicts.
Weaviate Docker Compose Templates	Provides pre-configured Docker Compose templates for Weaviate, strongly advising users to select one and adhere to it to avoid endless debugging of configuration issues.
Weaviate CLI Tools	A collection of command-line utilities for Weaviate, which are useful for various tasks but introduce another dependency that can potentially cause installation or operational issues.
Weaviate Slack Community	An active Slack community for Weaviate users, offering a platform for quick discussions and generally providing faster responses compared to traditional forums.
LangChain Discord	An active Discord community for LangChain, suitable for asking quick questions and getting immediate feedback, but generally ineffective for resolving complex debugging challenges.
Next.js GitHub Discussions	The official GitHub Discussions forum for Next.js, providing a reliable platform for seeking help and finding solutions to framework-related issues.
Weaviate Professional Services	Offers enterprise consulting services for Weaviate, which are expensive but provide access to experts who are knowledgeable about undocumented issues and complex implementation challenges.
LangChain Enterprise Support	Commercial enterprise support for LangChain, which is a viable option for organizations with substantial budgets seeking dedicated assistance and advanced problem-solving.
Weaviate Cloud Services	Managed hosting services for Weaviate, with a minimum cost of $200-500 per month, known to experience downtime during European morning hours.
Vercel	A popular platform for Next.js deployment, where edge functions frequently timeout when handling vector queries, though it performs excellently for other types of applications.
Railway	A simple deployment platform suitable for prototypes and quick projects, but its reliability and suitability for robust production environments are often questionable.
AWS App Runner	A container deployment service from AWS, offering enterprise-grade complexity that is typically justified only when addressing equally complex enterprise-level problems and requirements.
Kubernetes Helm Charts	Helm charts for deploying Weaviate on Kubernetes, considered "production-ready" for those who are prepared to spend late nights debugging complex YAML configurations.
Kubernetes Deployment Guide	A guide to deploying Weaviate using Kubernetes container patterns, which functions effectively until the system encounters scaling challenges under increased load.
Performance Tuning	An optimization guide for Weaviate performance, which implicitly assumes that users possess an infinite amount of time and patience to implement and fine-tune its recommendations.
OpenAI Integration	Documentation for integrating OpenAI's GPT models with Weaviate, which offers reliable functionality but comes with associated costs for API usage.
Cohere Integration	Integration guide for Cohere, marketed as "enterprise-grade" and accompanied by corresponding high costs, with actual performance and results often varying significantly.
Hugging Face Integration	Documentation for integrating Hugging Face open-source models with Weaviate, which offers a free solution but typically delivers significantly poorer performance compared to commercial alternatives.
Local Models	Guide to using self-hosted local models with Weaviate, which incurs zero direct cost but often results in a noticeable compromise in the quality and accuracy of the model's output.
Multi-modal Search	Explores multi-modal search capabilities in Weaviate, allowing for text, image, and audio queries, which makes for impressive demos but can become a significant production challenge.
GraphQL API	Documentation for Weaviate's GraphQL API, enabling advanced querying, but introducing an additional layer of complexity that can make debugging more challenging.
Backup & Recovery	A guide to Weaviate's backup and recovery features, essential for data protection and absolutely necessary for mitigating data loss when unforeseen issues inevitably arise.
Stack Overflow	A widely used platform for developers to find real-world problems and practical solutions related to Weaviate, often providing more candid and effective answers than official documentation.
GitHub Issues	The official GitHub repository's issue tracker for Weaviate, serving as a crucial resource for reporting bugs, tracking known issues, and discovering community-contributed workarounds.
Weaviate Community Forum	An official community forum for Weaviate users, providing a platform for honest discussions and sharing practical insights about what truly works in real-world scenarios.

Weaviate + LangChain + Next.js Integration: Technical Reference

Architecture Overview

Critical Configuration Requirements

Production-Ready Settings

Failure Modes and Critical Warnings

Connection Issues

Performance Degradation

Implementation Best Practices

Security Architecture

Document Processing

Search Implementation

Cost Analysis

Monthly Operational Costs

Development Environment Setup

Node.js Version Requirements

Dependency Management

Local Development Issues

Production Deployment Considerations

Monitoring Requirements

Error Handling Patterns

Scaling Limitations

Framework-Specific Implementation

Next.js Architecture Decisions

LangChain Integration Issues

Troubleshooting Guide

Common Error Patterns

Nuclear Fixes

Alternative Approaches

Technology Substitutions

Architecture Patterns

Success Criteria

Minimum Viable Production

Performance Benchmarks

Useful Links for Further Investigation

Resources That Actually Help (Sometimes)

Related Tools & Recommendations

Milvus vs Weaviate vs Pinecone vs Qdrant vs Chroma: What Actually Works in Production

LangChain vs LlamaIndex vs Haystack vs AutoGen - Which One Won't Ruin Your Weekend

OpenAI Gets Sued After GPT-5 Convinced Kid to Kill Himself

GitOps Integration Hell: Docker + Kubernetes + ArgoCD + Prometheus

Pinecone Production Reality: What I Learned After $3200 in Surprise Bills

Claude + LangChain + Pinecone RAG: What Actually Works in Production

Kafka + MongoDB + Kubernetes + Prometheus Integration - When Event Streams Break

LlamaIndex - Document Q&A That Doesn't Suck

I Migrated Our RAG System from LangChain to LlamaIndex

OpenAI Launches Developer Mode with Custom Connectors - September 10, 2025

OpenAI Finally Admits Their Product Development is Amateur Hour

ChromaDB Troubleshooting: When Things Break

ChromaDB - The Vector DB I Actually Use

I Deployed All Four Vector Databases in Production. Here's What Actually Works.

Haystack - RAG Framework That Doesn't Explode

Haystack Editor - Code Editor on a Big Whiteboard

Which Static Site Generator Won't Make You Hate Your Life

ELK Stack for Microservices - Stop Losing Log Data

Your Elasticsearch Cluster Went Red and Production is Down

Kafka + Spark + Elasticsearch: Don't Let This Pipeline Ruin Your Life