lex-eval 0.2.2
Provides LLM-as-judge and code-based evaluators for scoring LLM outputs, with built-in templates for hallucination, relevance, and toxicity detection.
Provides LLM-as-judge and code-based evaluators for scoring LLM outputs, with built-in templates for hallucination, relevance, and toxicity detection.