llama-stack-mirror/llama_stack/distribution
ehhuang bb2690f176
feat: remove special handling of builtin::rag tool (#1015)
Summary:

Lets the model decide which tool it needs to call to respond to a query.

Test Plan:
```
LLAMA_STACK_CONFIG=fireworks pytest -s -v tests/client-sdk/ --safety-shield meta-llama/Llama-Guard-3-8B
```

Also evaluated on a small benchmark with 20 questions from HotpotQA.
With this PR and some prompting, the performance is 77% recall compared
to 50% currently.

---
[//]: # (BEGIN SAPLING FOOTER)
Stack created with [Sapling](https://sapling-scm.com). Best reviewed
with
[ReviewStack](https://reviewstack.dev/meta-llama/llama-stack/pull/1015).
* #1268
* #1239
* __->__ #1015
2025-02-26 13:04:52 -08:00
..
routers feat: remove special handling of builtin::rag tool (#1015) 2025-02-26 13:04:52 -08:00
server refactor(server): replace print statements with logger (#1250) 2025-02-25 21:31:37 -08:00
store build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00
ui fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
utils refactor: combine start scripts for each env (#1139) 2025-02-24 16:53:31 -08:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
build.py feat: add --run to llama stack build (#1156) 2025-02-23 22:06:09 -05:00
build_conda_env.sh feat: Add a new template for dell (#978) 2025-02-06 14:14:39 -08:00
build_container.sh fix: avoid failure when no special pip deps and better exit (#1228) 2025-02-24 13:18:52 -05:00
build_venv.sh fix: build_venv expects an extra argument (#1233) 2025-02-25 11:08:50 -08:00
client.py chore: move all Llama Stack types from llama-models to llama-stack (#1098) 2025-02-14 09:10:59 -08:00
common.sh fix: Fixing some small issues with the build scripts (#1132) 2025-02-19 22:20:49 -08:00
configure.py build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00
configure_container.sh More generic image type for OCI-compliant container technologies (#802) 2025-01-17 16:37:42 -08:00
datatypes.py fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
distribution.py fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
inspect.py fix: improve signal handling and update dependencies (#1044) 2025-02-13 08:07:59 -08:00
library_client.py fix: update virtualenv building so llamastack- prefix is not added, make notebook experience easier (#1225) 2025-02-23 16:57:11 -08:00
request_headers.py Add X-LlamaStack-Client-Version, rename ProviderData -> Provider-Data (#735) 2025-01-09 11:51:36 -08:00
resolver.py fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
stack.py feat: adding endpoints for files and uploads (#1070) 2025-02-20 13:09:00 -08:00
start_stack.sh refactor: combine start scripts for each env (#1139) 2025-02-24 16:53:31 -08:00
start_venv.sh fix: update virtualenv building so llamastack- prefix is not added, make notebook experience easier (#1225) 2025-02-23 16:57:11 -08:00