A tool for comprehensively evaluating and comparing responses from multiple large language models to provide actionable insights for improvement.
-
Updated
Oct 25, 2024
A tool for comprehensively evaluating and comparing responses from multiple large language models to provide actionable insights for improvement.
Add a description, image, and links to the modelevaluator topic page so that developers can more easily learn about it.
To associate your repository with the modelevaluator topic, visit your repo's landing page and select "manage topics."