01Integrates Anthropic’s 2024 Contextual Retrieval techniques to reduce retrieval failures.
02Standardizes evaluation using RAGAS, TruLens, and metrics such as nDCG and groundedness.
03Optimizes retrieval using hybrid fusion (BM25 + vector) and cross-encoder reranking.
04Provides decision trees for selecting vector indices like HNSW and IVF based on dataset scale.
05Implements advanced chunking strategies including page-level and semantic methods for higher accuracy.
0622 GitHub stars