Reranking for Better RAG Retrieval29 January 2025·513 words·3 minsLLM EngineeringBi-encoder retrieval is fast but imprecise. Cross-encoder reranking improves top-k precision at the cost of some latency. Here’s when and how to add it.