llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-05 02:17:31 +00:00

History

Ilya Kolchinsky 40f41af2f7 feat: Add a direct (non-agentic) RAG option to the Playground RAG page (#1940 ) # What does this PR do? This PR makes it possible to switch between agentic and non-agentic RAG when running the respective Playground page. When non-agentic RAG is selected, user queries are answered by directly querying the vector DB, augmenting the prompt, and sending the extended prompt to the model via Inference API. ## Test Plan - Launch the Playground and go to the RAG page; - Select the vector DB ID; - Adjust other configuration parameters if necessary; - Set the radio button to Agent-based RAG; - Send a message to the chat; - The query will be answered by an agent using the knowledge search tool as indicated by the output; - Click the 'Clear Chat' button to make it possible to switch modes; - Send a message to the chat again; - This time, the query will be answered by the model directly as can be deduced from the reply.		2025-04-11 10:16:10 -07:00
..
distribution	fix: Fixed import errors for UI and playground (#1666 )	2025-03-18 15:00:48 -07:00
evaluations	fix: Fixed import errors for UI and playground (#1666 )	2025-03-18 15:00:48 -07:00
playground	feat: Add a direct (non-agentic) RAG option to the Playground RAG page (#1940 )	2025-04-11 10:16:10 -07:00
__init__.py	[llama stack ui] add native eval & inspect distro & playground pages (#541 )	2024-12-04 09:47:09 -08:00