llama-stack

History

ehhuang bb2690f176 feat: remove special handling of builtin::rag tool (#1015 ) Summary: Lets the model decide which tool it needs to call to respond to a query. Test Plan: ``` LLAMA_STACK_CONFIG=fireworks pytest -s -v tests/client-sdk/ --safety-shield meta-llama/Llama-Guard-3-8B ``` Also evaluated on a small benchmark with 20 questions from HotpotQA. With this PR and some prompting, the performance is 77% recall compared to 50% currently. --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/meta-llama/llama-stack/pull/1015). * #1268 * #1239 * __->__ #1015		2025-02-26 13:04:52 -08:00
..
inline	feat: remove special handling of builtin::rag tool (#1015 )	2025-02-26 13:04:52 -08:00
registry	feat: add (openai, anthropic, gemini) providers via litellm (#1267 )	2025-02-25 22:07:33 -08:00
remote	fix: Get builtin tool calling working in remote-vllm (#1236 )	2025-02-26 15:25:47 -05:00
tests	feat: add (openai, anthropic, gemini) providers via litellm (#1267 )	2025-02-25 22:07:33 -08:00
utils	fix: make vision and embedding tests pass with openai, anthropic and gemini	2025-02-26 11:24:01 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00