llama-stack-mirror/llama_stack/providers/tests/eval
Xi Yan b4416b72fd
Folder restructure for evals/datasets/scoring (#419)
* rename evals related stuff

* fix datasetio

* fix scoring test

* localfs -> LocalFS

* refactor scoring

* refactor scoring

* remove 8b_correctness scoring_fn from tests

* tests w/ eval params

* scoring fn braintrust fixture

* import
2024-11-11 17:35:40 -05:00
..
__init__.py [Evals API][4/n] evals with generation meta-reference impl (#303) 2024-10-25 13:12:39 -07:00
conftest.py [Evals API][11/n] huggingface dataset provider + mmlu scoring fn (#392) 2024-11-11 14:49:50 -05:00
constants.py Folder restructure for evals/datasets/scoring (#419) 2024-11-11 17:35:40 -05:00
fixtures.py [Evals API][10/n] API updates for EvalTaskDef + new test migration (#379) 2024-11-07 21:24:12 -08:00
test_eval.py Folder restructure for evals/datasets/scoring (#419) 2024-11-11 17:35:40 -05:00