smileval

funny evaluator for llms, possibly crossplatform.

usage

A test config is included as an example.

python evaluator.py --config test_config.yml

An evaluation matrix is done per namespace which can also be changed in the config. Each test should have it's unique name which will be a column header which should be changed in config. Don't like yaml files? We use jsonargparse so you can pass a bunch of cli args as well.

generate csv to import into spreadsheet

Run

python generate-report.py experiments/yournamespace

and check your namespace folder after

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
smileval		smileval
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
bootstrap.py		bootstrap.py
evaluator.py		evaluator.py
generate_report.py		generate_report.py
requirements.txt		requirements.txt
test.csv		test.csv
test.py		test.py
test_config.py		test_config.py
test_config.yml		test_config.yml
test_config_gemini.yml		test_config_gemini.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

smileval

usage

generate csv to import into spreadsheet

About

Releases

Packages

Languages

License

ctf-gg/smileval

Folders and files

Latest commit

History

Repository files navigation

smileval

usage

generate csv to import into spreadsheet

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages