forked from phoenix-oss/llama-stack-mirror

History

Michael Clifford c6e93e32f6 feat: Updated playground rag to use session id for persistent conversation (#1870 ) # What does this PR do? This PR updates the [playground RAG example](llama_stack/distribution/ui/page/playground/rag.py) so that the agent is able to use its builtin conversation history. Here we are using streamlit's `cache_resource` functionality to prevent the agent from re-initializing after every interaction as well as storing its session_id in the `session_state`. This allows the agent in the RAG example to behave more closely to how it works using the python-client directly. [//]: # (If resolving an issue, uncomment and update the line below) Closes #1869 ## Test Plan Without these changes, if you ask it "What is 2 + 2"? followed by the question "What did I just ask?" It will provide an obviously incorrect answer. With these changes, you can ask the same series of questions and it will provide the correct answer. [//]: # (## Documentation) Signed-off-by: Michael Clifford <mcliffor@redhat.com>		2025-04-08 09:46:13 +02:00
..
modules	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
page	feat: Updated playground rag to use session id for persistent conversation (#1870 )	2025-04-08 09:46:13 +02:00
__init__.py	move playground ui to llama-stack repo (#536 )	2024-11-26 22:04:21 -08:00
app.py	Fix precommit check after moving to ruff (#927 )	2025-02-02 06:46:45 -08:00
Containerfile	feat: Created Playground Containerfile and Image Workflow (#1256 )	2025-03-18 09:26:49 -07:00
README.md	feat: Created Playground Containerfile and Image Workflow (#1256 )	2025-03-18 09:26:49 -07:00
requirements.txt	[llama stack ui] add native eval & inspect distro & playground pages (#541 )	2024-12-04 09:47:09 -08:00

README.md

(Experimental) LLama Stack UI

Docker Setup

⚠️ This is a work in progress.

Developer Setup

Start up Llama Stack API server. More details here.

llama stack build --template together --image-type conda

llama stack run together

(Optional) Register datasets and eval tasks as resources. If you want to run pre-configured evaluation flows (e.g. Evaluations (Generation + Scoring) Page).

llama-stack-client datasets register \
--dataset-id "mmlu" \
--provider-id "huggingface" \
--url "https://huggingface.co/datasets/llamastack/evals" \
--metadata '{"path": "llamastack/evals", "name": "evals__mmlu__details", "split": "train"}' \
--schema '{"input_query": {"type": "string"}, "expected_answer": {"type": "string", "chat_completion_input": {"type": "string"}}}'

llama-stack-client benchmarks register \
--eval-task-id meta-reference-mmlu \
--provider-id meta-reference \
--dataset-id mmlu \
--scoring-functions basic::regex_parser_multiple_choice_answer

Start Streamlit UI

cd llama_stack/distribution/ui
pip install -r requirements.txt
streamlit run app.py

Environment Variables

Environment Variable	Description	Default Value
LLAMA_STACK_ENDPOINT	The endpoint for the Llama Stack	http://localhost:8321
FIREWORKS_API_KEY	API key for Fireworks provider	(empty string)
TOGETHER_API_KEY	API key for Together provider	(empty string)
SAMBANOVA_API_KEY	API key for SambaNova provider	(empty string)
OPENAI_API_KEY	API key for OpenAI provider	(empty string)