ask-eval 0.1.0
Test LLM outputs with Minitest-native assertions. LLM-as-judge for faithfulness, hallucination, bias, toxicity. Deterministic assertions (contains, regex, JSON). CI-native: GitHub annotations, JUnit output, cost tracking.
Test LLM outputs with Minitest-native assertions. LLM-as-judge for faithfulness, hallucination, bias, toxicity. Deterministic assertions (contains, regex, JSON). CI-native: GitHub annotations, JUnit output, cost tracking.