llama-stack

forked from phoenix-oss/llama-stack-mirror

History

ehhuang bb2690f176 feat: remove special handling of builtin::rag tool (#1015 ) Summary: Lets the model decide which tool it needs to call to respond to a query. Test Plan: ``` LLAMA_STACK_CONFIG=fireworks pytest -s -v tests/client-sdk/ --safety-shield meta-llama/Llama-Guard-3-8B ``` Also evaluated on a small benchmark with 20 questions from HotpotQA. With this PR and some prompting, the performance is 77% recall compared to 50% currently. --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/meta-llama/llama-stack/pull/1015). * #1268 * #1239 * __->__ #1015		2025-02-26 13:04:52 -08:00
..
agents	feat: remove special handling of builtin::rag tool (#1015 )	2025-02-26 13:04:52 -08:00
datasetio	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
eval	chore!: deprecate eval/tasks (#1186 )	2025-02-20 14:06:21 -08:00
inference	fix: resolve type hint issues and import dependencies (#1176 )	2025-02-25 11:06:47 -08:00
ios/inference	chore: removed executorch submodule (#1265 )	2025-02-25 21:57:21 -08:00
post_training	feat: [post training] support save hf safetensor format checkpoint (#845 )	2025-02-25 23:29:08 -08:00
safety	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
scoring	feat: add aggregation_functions to llm_as_judge_405b_simpleqa (#1164 )	2025-02-19 19:42:04 -08:00
telemetry	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
tool_runtime	feat: remove special handling of builtin::rag tool (#1015 )	2025-02-26 13:04:52 -08:00
vector_io	Fix sqlite_vec config defaults	2025-02-20 17:50:33 -08:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00