Skip to main content
  1. Categories/

LLM Engineering

RAG

·9 words·1 min
This is a collection of blog posts about RAG.

RAG for Knowledge-Intensive Tasks

·791 words·4 mins
Picture this: You’re asking an AI about cancer treatments. It sounds super confident and gives you detailed answers. But here’s the problem — it just made up a medical study that doesn’t exist.

Document Summarization: Eval First

·408 words·2 mins
Document summarization is a critical NLP task that helps users quickly grasp key information from long documents. But how do you know if your model is actually working? This guide shows a workflow that starts with evaluation and acceptance criteria before touching models.

RAG with LangChain: Architecture, Code, and Metrics

·1240 words·6 mins
RAG is a design pattern, not a product. LangChain supports it out of the box. This guide shows a production-ready RAG setup in LangChain with architecture, retrieval choices, runnable code, evaluation metrics, and trade-offs from my client projects.

LightRAG: Lean RAG with Benchmarks

·800 words·4 mins
LightRAG is a minimal RAG toolkit that strips away heavy abstractions. Here’s a complete build with code, performance numbers versus a LangChain baseline, and when LightRAG is the right choice.