Measures quality metrics like faithfulness, relevance, context precision, and answer correctness for LLM and RAG applications. Think Ragas or DeepEval for Ruby.
Johannes Dwi Cahyo
March 11, 2026 11:16am
MIT