qualspec 0.0.1
Define qualitative evaluation criteria and let an LLM judge if responses pass. Perfect for testing AI agents, comparing models, and evaluating subjective qualities.
Gemfile:
=
instalar:
=
dependencias de Runtime (1):
faraday
~> 2.0