Measures quality metrics like faithfulness, relevance, context precision, and answer correctness for LLM and RAG applications. Think Ragas or DeepEval for Ruby.
Johannes Dwi Cahyo
March 8, 2026 3:20am
MIT