forked from phoenix-oss/llama-stack-mirror

History

snova-edwardm 22dc684da6 Sambanova inference provider (#555 ) # What does this PR do? This PR adds SambaNova as one of the Provider - Add SambaNova as a provider ## Test Plan Test the functional command ``` pytest -s -v --providers inference=sambanova llama_stack/providers/tests/inference/test_embeddings.py llama_stack/providers/tests/inference/test_prompt_adapter.py llama_stack/providers/tests/inference/test_text_inference.py llama_stack/providers/tests/inference/test_vision_inference.py --env SAMBANOVA_API_KEY=<sambanova-api-key> ``` Test the distribution template: ``` # Docker LLAMA_STACK_PORT=5001 docker run -it -p $LLAMA_STACK_PORT:$LLAMA_STACK_PORT \ llamastack/distribution-sambanova \ --port $LLAMA_STACK_PORT \ --env SAMBANOVA_API_KEY=$SAMBANOVA_API_KEY # Conda llama stack build --template sambanova --image-type conda llama stack run ./run.yaml \ --port $LLAMA_STACK_PORT \ --env SAMBANOVA_API_KEY=$SAMBANOVA_API_KEY ``` ## Source [SambaNova API Documentation](https://cloud.sambanova.ai/apis) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [Y] Ran pre-commit to handle lint / formatting issues. - [Y] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [Y] Updated relevant documentation. - [Y ] Wrote necessary unit or integration tests. --------- Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>		2025-01-23 12:20:28 -08:00
..
modules	Sambanova inference provider (#555 )	2025-01-23 12:20:28 -08:00
page	Rename builtin::memory -> builtin::rag	2025-01-22 20:22:51 -08:00
__init__.py	move playground ui to llama-stack repo (#536 )	2024-11-26 22:04:21 -08:00
app.py	[llama stack ui] add native eval & inspect distro & playground pages (#541 )	2024-12-04 09:47:09 -08:00
README.md	Add eval/scoring/datasetio API providers to distribution templates & UI developer guide (#564 )	2024-12-05 16:29:32 -08:00
requirements.txt	[llama stack ui] add native eval & inspect distro & playground pages (#541 )	2024-12-04 09:47:09 -08:00

README.md

(Experimental) LLama Stack UI

Docker Setup

⚠️ This is a work in progress.

Developer Setup

Start up Llama Stack API server. More details here.

llama stack build --template together --image-type conda

llama stack run together

(Optional) Register datasets and eval tasks as resources. If you want to run pre-configured evaluation flows (e.g. Evaluations (Generation + Scoring) Page).

$ llama-stack-client datasets register \
--dataset-id "mmlu" \
--provider-id "huggingface" \
--url "https://huggingface.co/datasets/llamastack/evals" \
--metadata '{"path": "llamastack/evals", "name": "evals__mmlu__details", "split": "train"}' \
--schema '{"input_query": {"type": "string"}, "expected_answer": {"type": "string", "chat_completion_input": {"type": "string"}}}'

$ llama-stack-client eval_tasks register \
--eval-task-id meta-reference-mmlu \
--provider-id meta-reference \
--dataset-id mmlu \
--scoring-functions basic::regex_parser_multiple_choice_answer

Start Streamlit UI

cd llama_stack/distribution/ui
pip install -r requirements.txt
streamlit run app.py