llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Xi Yan 2b7d70ba86 [Evals API][11/n] huggingface dataset provider + mmlu scoring fn (#392 ) * wip * scoring fn api * eval api * eval task * evaluate api update * pre commit * unwrap context -> config * config field doc * typo * naming fix * separate benchmark / app eval * api name * rename * wip tests * wip * datasetio test * delete unused * fixture * scoring resolve * fix scoring register * scoring test pass * score batch * scoring fix * fix eval * test eval works * huggingface provider * datasetdef files * mmlu scoring fn * test wip * remove type ignore * api refactor * add default task_eval_id for routing * add eval_id for jobs * remove type ignore * huggingface provider * wip huggingface register * only keep 1 run_eval * fix optional * register task required * register task required * delete old tests * fix * mmlu loose * refactor * msg * fix tests * move benchmark task def to file * msg * gen openapi * openapi gen * move dataset to hf llamastack repo * remove todo * refactor * add register model to unit test * rename * register to client * delete preregistered dataset/eval task * comments * huggingface -> remote adapter * openapi gen		2024-11-11 14:49:50 -05:00
..
agents	add dynamic clients for all APIs (#348 )	2024-10-31 14:46:25 -07:00
batch_inference	Remove "routing_table" and "routing_key" concepts for the user (#201 )	2024-10-10 10:24:13 -07:00
common	[Evals API][4/n] evals with generation meta-reference impl (#303 )	2024-10-25 13:12:39 -07:00
datasetio	[Evals API][3/n] scoring_functions / scoring meta-reference implementations (#296 )	2024-10-24 14:52:30 -07:00
datasets	persist registered objects with distribution (#354 )	2024-11-04 17:25:06 -08:00
eval	[Evals API][11/n] huggingface dataset provider + mmlu scoring fn (#392 )	2024-11-11 14:49:50 -05:00
eval_tasks	[Evals API][10/n] API updates for EvalTaskDef + new test migration (#379 )	2024-11-07 21:24:12 -08:00
inference	migrate model to Resource and new registration signature (#410 )	2024-11-08 16:12:57 -08:00
inspect	Remove "routing_table" and "routing_key" concepts for the user (#201 )	2024-10-10 10:24:13 -07:00
memory	Remove "routing_table" and "routing_key" concepts for the user (#201 )	2024-10-10 10:24:13 -07:00
memory_banks	[bugfix] fix case for agent when memory bank registered without specifying provider_id (#264 )	2024-10-17 17:28:17 -07:00
models	migrate model to Resource and new registration signature (#410 )	2024-11-08 16:12:57 -08:00
post_training	[Evals API][4/n] evals with generation meta-reference impl (#303 )	2024-10-25 13:12:39 -07:00
safety	Resource oriented design for shields (#399 )	2024-11-08 12:16:11 -08:00
scoring	[Evals API][10/n] API updates for EvalTaskDef + new test migration (#379 )	2024-11-07 21:24:12 -08:00
scoring_functions	[Evals API][10/n] API updates for EvalTaskDef + new test migration (#379 )	2024-11-07 21:24:12 -08:00
shields	Resource oriented design for shields (#399 )	2024-11-08 12:16:11 -08:00
synthetic_data_generation	[Evals API][4/n] evals with generation meta-reference impl (#303 )	2024-10-25 13:12:39 -07:00
telemetry	Remove "routing_table" and "routing_key" concepts for the user (#201 )	2024-10-10 10:24:13 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
resource.py	Resource oriented design for shields (#399 )	2024-11-08 12:16:11 -08:00