Measures quality metrics like faithfulness, relevance, context precision, and answer correctness for LLM and RAG applications. Think Ragas or DeepEval for Ruby.
Johannes Dwi Cahyo
March 10, 2026 8:26am
MIT