llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-06-28 02:53:30 +00:00

History

ehhuang bb2690f176 feat: remove special handling of builtin::rag tool (#1015 ) Summary: Lets the model decide which tool it needs to call to respond to a query. Test Plan: ``` LLAMA_STACK_CONFIG=fireworks pytest -s -v tests/client-sdk/ --safety-shield meta-llama/Llama-Guard-3-8B ``` Also evaluated on a small benchmark with 20 questions from HotpotQA. With this PR and some prompting, the performance is 77% recall compared to 50% currently. --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/meta-llama/llama-stack/pull/1015). * #1268 * #1239 * __->__ #1015		2025-02-26 13:04:52 -08:00
..
routers	feat: remove special handling of builtin::rag tool (#1015 )	2025-02-26 13:04:52 -08:00
server	refactor(server): replace print statements with logger (#1250 )	2025-02-25 21:31:37 -08:00
store	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
ui	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
utils	refactor: combine start scripts for each env (#1139 )	2025-02-24 16:53:31 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
build.py	feat: add --run to llama stack build (#1156 )	2025-02-23 22:06:09 -05:00
build_conda_env.sh	feat: Add a new template for `dell` (#978 )	2025-02-06 14:14:39 -08:00
build_container.sh	fix: avoid failure when no special pip deps and better exit (#1228 )	2025-02-24 13:18:52 -05:00
build_venv.sh	fix: build_venv expects an extra argument (#1233 )	2025-02-25 11:08:50 -08:00
client.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
common.sh	fix: Fixing some small issues with the build scripts (#1132 )	2025-02-19 22:20:49 -08:00
configure.py	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
configure_container.sh	More generic image type for OCI-compliant container technologies (#802 )	2025-01-17 16:37:42 -08:00
datatypes.py	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
distribution.py	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
inspect.py	fix: improve signal handling and update dependencies (#1044 )	2025-02-13 08:07:59 -08:00
library_client.py	fix: update virtualenv building so llamastack- prefix is not added, make notebook experience easier (#1225 )	2025-02-23 16:57:11 -08:00
request_headers.py	Add X-LlamaStack-Client-Version, rename ProviderData -> Provider-Data (#735 )	2025-01-09 11:51:36 -08:00
resolver.py	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
stack.py	feat: adding endpoints for files and uploads (#1070 )	2025-02-20 13:09:00 -08:00
start_stack.sh	refactor: combine start scripts for each env (#1139 )	2025-02-24 16:53:31 -08:00
start_venv.sh	fix: update virtualenv building so llamastack- prefix is not added, make notebook experience easier (#1225 )	2025-02-23 16:57:11 -08:00