01Production benchmarking comparison against Ailog's RAG API
02Comprehensive retrieval metrics including Recall@K, Precision@K, and MRR
03Detailed latency analysis for retrieval, generation, and P95 thresholds
041 GitHub stars
05Automated test dataset generation from existing indexed documents
06LLM-as-a-judge generation metrics for faithfulness, relevance, and coherence