llama-stack

History

Xi Yan c1d18283d2 feat(eval api): (2.2/n) delete eval / scoring / scoring_fn apis (#1700 ) # What does this PR do? - To make it easier, delete existing `eval/scoring/scoring_function` apis. There will be a bunch of broken impls here. The sequence is: 1. migrate benchmark graders 2. clean up existing scoring functions - Add a skeleton evaluation impl to make tests pass. ## Test Plan tested in following PRs [//]: # (## Documentation)		2025-03-19 11:04:23 -07:00
..
bedrock	Fix precommit check after moving to ruff (#927 )	2025-02-02 06:46:45 -08:00
common	feat(eval api): (2.2/n) delete eval / scoring / scoring_fn apis (#1700 )	2025-03-19 11:04:23 -07:00
datasetio	feat(dataset api): (1.5/n) fix dataset registeration (#1659 )	2025-03-15 16:48:09 -07:00
inference	fix: agents with non-llama model (#1550 )	2025-03-17 22:11:06 -07:00
kvstore	chore: made inbuilt tools blocking calls into async non blocking calls (#1509 )	2025-03-09 16:59:24 -07:00
memory	fix(deps): move chardet and pypdf imports inline where used (#1434 )	2025-03-06 17:09:14 -08:00
scoring	feat: [new open benchmark] Math 500 (#1538 )	2025-03-10 20:38:28 -07:00
telemetry	refactor: move all datetime.now() calls to UTC (#1589 )	2025-03-13 15:34:53 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00