llama-stack-mirror/llama_stack
Xi Yan 6192bf43a4
[Evals API][10/n] API updates for EvalTaskDef + new test migration (#379)
* wip

* scoring fn api

* eval api

* eval task

* evaluate api update

* pre commit

* unwrap context -> config

* config field doc

* typo

* naming fix

* separate benchmark / app eval

* api name

* rename

* wip tests

* wip

* datasetio test

* delete unused

* fixture

* scoring resolve

* fix scoring register

* scoring test pass

* score batch

* scoring fix

* fix eval

* test eval works

* remove type ignore

* api refactor

* add default task_eval_id for routing

* add eval_id for jobs

* remove type ignore

* only keep 1 run_eval

* fix optional

* register task required

* register task required

* delete old tests

* delete old tests

* fixture return impl
2024-11-07 21:24:12 -08:00
..
apis [Evals API][10/n] API updates for EvalTaskDef + new test migration (#379) 2024-11-07 21:24:12 -08:00
cli Kill llama stack configure (#371) 2024-11-06 13:32:10 -08:00
distribution [Evals API][10/n] API updates for EvalTaskDef + new test migration (#379) 2024-11-07 21:24:12 -08:00
providers [Evals API][10/n] API updates for EvalTaskDef + new test migration (#379) 2024-11-07 21:24:12 -08:00
scripts Add a test for CLI, but not fully done so disabled 2024-09-19 13:27:07 -07:00
templates Remove the safety adapter for Together; we can just use "meta-reference" (#387) 2024-11-06 17:36:57 -08:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00