llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Ilya Kolchinsky 40f41af2f7 feat: Add a direct (non-agentic) RAG option to the Playground RAG page (#1940 ) # What does this PR do? This PR makes it possible to switch between agentic and non-agentic RAG when running the respective Playground page. When non-agentic RAG is selected, user queries are answered by directly querying the vector DB, augmenting the prompt, and sending the extended prompt to the model via Inference API. ## Test Plan - Launch the Playground and go to the RAG page; - Select the vector DB ID; - Adjust other configuration parameters if necessary; - Set the radio button to Agent-based RAG; - Send a message to the chat; - The query will be answered by an agent using the knowledge search tool as indicated by the output; - Click the 'Clear Chat' button to make it possible to switch modes; - Send a message to the chat again; - This time, the query will be answered by the model directly as can be deduced from the reply.		2025-04-11 10:16:10 -07:00
..
routers	fix: solve unregister_toolgroup error (#1608 )	2025-04-09 10:56:07 +02:00
server	fix: Use CONDA_DEFAULT_ENV presence as a flag to use conda mode (#1555 )	2025-03-27 17:13:22 -04:00
store	fix: handle registry errors gracefully (#1732 )	2025-03-20 15:24:07 -07:00
ui	feat: Add a direct (non-agentic) RAG option to the Playground RAG page (#1940 )	2025-04-11 10:16:10 -07:00
utils	refactor: move missing tests to test directory (#1892 )	2025-04-08 18:54:00 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
access_control.py	feat: make sure agent sessions are under access control (#1737 )	2025-03-21 07:31:16 -07:00
build.py	refactor: simplify command execution and remove PTY handling (#1641 )	2025-03-17 15:03:14 -07:00
build_conda_env.sh	chore: remove straggler references to llama-models (#1345 )	2025-03-01 14:26:03 -08:00
build_container.sh	fix: Add missing gcc in container build. Fixes #1716 (#1727 )	2025-03-20 15:50:56 -04:00
build_venv.sh	chore: remove straggler references to llama-models (#1345 )	2025-03-01 14:26:03 -08:00
client.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
common.sh	fix: Fixing some small issues with the build scripts (#1132 )	2025-02-19 22:20:49 -08:00
configure.py	feat: add provider API for listing and inspecting provider info (#1429 )	2025-03-13 15:07:21 -07:00
datatypes.py	feat: ability to execute external providers (#1672 )	2025-04-09 10:30:41 +02:00
distribution.py	feat: ability to execute external providers (#1672 )	2025-04-09 10:30:41 +02:00
inspect.py	chore: deprecate /v1/inspect/providers (#1678 )	2025-03-19 20:27:06 -07:00
library_client.py	fix(telemetry): library client does not log span (#1833 )	2025-03-29 14:55:31 -07:00
providers.py	fix: add shutdown method for ProviderImpl (#1670 )	2025-03-17 14:55:40 -07:00
request_headers.py	feat(server): add attribute based access control for resources (#1703 )	2025-03-19 21:28:52 -07:00
resolver.py	feat: ability to execute external providers (#1672 )	2025-04-09 10:30:41 +02:00
stack.py	fix: ensure resource registration arguments are typed (#1941 )	2025-04-11 09:25:57 -07:00
start_stack.sh	fix: multiple issues with getting_started notebook (#1795 )	2025-03-26 10:59:12 -07:00