llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Xi Yan c1d18283d2 feat(eval api): (2.2/n) delete eval / scoring / scoring_fn apis (#1700 ) # What does this PR do? - To make it easier, delete existing `eval/scoring/scoring_function` apis. There will be a bunch of broken impls here. The sequence is: 1. migrate benchmark graders 2. clean up existing scoring functions - Add a skeleton evaluation impl to make tests pass. ## Test Plan tested in following PRs [//]: # (## Documentation)		2025-03-19 11:04:23 -07:00
..
agents	feat(agent): support multiple tool groups (#1556 )	2025-03-17 22:13:09 -07:00
datasetio	precommit	2025-03-17 17:08:21 -07:00
eval	feat(eval api): (2.2/n) delete eval / scoring / scoring_fn apis (#1700 )	2025-03-19 11:04:23 -07:00
evaluation/meta_reference	feat(eval api): (2.2/n) delete eval / scoring / scoring_fn apis (#1700 )	2025-03-19 11:04:23 -07:00
inference	fix: avoid tensor memory error (#1688 )	2025-03-18 16:17:29 -07:00
ios/inference	chore: removed executorch submodule (#1265 )	2025-02-25 21:57:21 -08:00
post_training	chore: fix mypy violations in post_training modules (#1548 )	2025-03-18 14:58:16 -07:00
safety	feat(agent): support multiple tool groups (#1556 )	2025-03-17 22:13:09 -07:00
scoring	feat(dataset api): (1.6/n) fix all iterrows callsites (#1660 )	2025-03-15 17:24:16 -07:00
telemetry	refactor: move all datetime.now() calls to UTC (#1589 )	2025-03-13 15:34:53 -07:00
tool_runtime	chore: Make code interpreter async (#1654 )	2025-03-18 14:13:46 -07:00
vector_io	feat: Qdrant inline provider (#1273 )	2025-03-18 14:04:21 -07:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00