LightRAG

LightRAG: Lean RAG with Benchmarks

30 July 2025·884 words·5 mins

LightRAG is a minimal RAG toolkit that strips away heavy abstractions. Here’s a complete build with code, performance numbers versus a LangChain baseline, and when LightRAG is the right choice. TL;DR LightRAG is a minimal RAG stack: FAISS + embeddings + prompt composition, ~120 lines. ~20% faster p50 latency vs LangChain on small corpora (≤ 500 chunks) due to fewer abstractions. Best for: serverless/edge deployments, small teams, single-purpose Q&A. Use LangChain instead when you need agents, tracing, callbacks, or multi-step workflows. Don’t skip data quality: clean text, handle missing values, validate numeric tables before indexing. Why LightRAG # For small, self-hosted RAG services, I often don’t need callbacks, agents, or complex runtime graphs. I need:

LightRAG as a LangChain Retriever

29 January 2025·594 words·3 mins

LLM Engineering

Want LightRAG’s lean retrieval with LangChain’s chain ecosystem? Here’s how to wrap LightRAG as a LangChain-compatible retriever — keeping retrieval explicit and fast while using LangChain for everything downstream. TL;DR Implement BaseRetriever._get_relevant_documents to make any retriever LangChain-compatible. LightRAG’s FAISS retrieval slots straight into LangChain chains, LCEL, and agents. Use this pattern when migrating an existing LangChain pipeline to leaner retrieval incrementally. For full LangChain pipelines without constraints, the standard LangChain retriever is fine. Why Combine LightRAG with LangChain # LightRAG gives you minimal, fast retrieval. LangChain gives you chains, agents, and tooling. Sometimes you want both:

BM25 Hybrid Search with LightRAG

29 January 2025·643 words·4 mins

LLM Engineering

Vector search misses keyword-heavy queries. BM25 misses semantic similarity. Combine both with hybrid search for better retrieval recall. TL;DR Vector search (FAISS): great for semantic/paraphrase queries, bad for exact codes or IDs. BM25: great for keyword/exact matches, bad for synonyms and paraphrases. Hybrid with RRF: combines both rank lists — no score normalization needed. Start with vector_weight=0.5. Lower it if users search exact product codes frequently. Why Hybrid Search # Pure vector search struggles with: