llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-04 18:13:44 +00:00

History

snova-edwardm 22dc684da6 Sambanova inference provider (#555 ) # What does this PR do? This PR adds SambaNova as one of the Provider - Add SambaNova as a provider ## Test Plan Test the functional command ``` pytest -s -v --providers inference=sambanova llama_stack/providers/tests/inference/test_embeddings.py llama_stack/providers/tests/inference/test_prompt_adapter.py llama_stack/providers/tests/inference/test_text_inference.py llama_stack/providers/tests/inference/test_vision_inference.py --env SAMBANOVA_API_KEY=<sambanova-api-key> ``` Test the distribution template: ``` # Docker LLAMA_STACK_PORT=5001 docker run -it -p $LLAMA_STACK_PORT:$LLAMA_STACK_PORT \ llamastack/distribution-sambanova \ --port $LLAMA_STACK_PORT \ --env SAMBANOVA_API_KEY=$SAMBANOVA_API_KEY # Conda llama stack build --template sambanova --image-type conda llama stack run ./run.yaml \ --port $LLAMA_STACK_PORT \ --env SAMBANOVA_API_KEY=$SAMBANOVA_API_KEY ``` ## Source [SambaNova API Documentation](https://cloud.sambanova.ai/apis) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [Y] Ran pre-commit to handle lint / formatting issues. - [Y] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [Y] Updated relevant documentation. - [Y ] Wrote necessary unit or integration tests. --------- Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>		2025-01-23 12:20:28 -08:00
..
routers	[memory refactor][6/n] Update naming and routes (#839 )	2025-01-22 10:39:13 -08:00
server	[memory refactor][3/n] Introduce RAGToolRuntime as a specialized sub-protocol (#832 )	2025-01-22 10:04:16 -08:00
store	Update OpenAPI generator to output discriminator (#848 )	2025-01-22 22:15:23 -08:00
ui	Sambanova inference provider (#555 )	2025-01-23 12:20:28 -08:00
utils	[CICD] Github workflow for publishing Docker images (#764 )	2025-01-15 09:01:33 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
build.py	Fix llama stack build docker creation to have correct entrypoint	2025-01-22 16:53:54 -08:00
build_conda_env.sh	Make llama stack build not create a new conda by default (#788 )	2025-01-16 13:44:53 -08:00
build_container.sh	Fix llama stack build docker creation to have correct entrypoint	2025-01-22 16:53:54 -08:00
build_venv.sh	Miscellaneous fixes around telemetry, library client and run yaml autogen	2024-12-08 20:40:22 -08:00
client.py	use API version in "remote" stack client	2024-11-19 15:59:47 -08:00
common.sh	API Updates (#73 )	2024-09-17 19:51:35 -07:00
configure.py	[remove import ] clean up import 's (#689 )	2024-12-27 15:45:44 -08:00
configure_container.sh	More generic image type for OCI-compliant container technologies (#802 )	2025-01-17 16:37:42 -08:00
datatypes.py	[memory refactor][1/n] Rename Memory -> VectorIO, MemoryBanks -> VectorDBs (#828 )	2025-01-22 09:59:30 -08:00
distribution.py	[memory refactor][1/n] Rename Memory -> VectorIO, MemoryBanks -> VectorDBs (#828 )	2025-01-22 09:59:30 -08:00
inspect.py	REST API fixes (#789 )	2025-01-16 13:47:08 -08:00
library_client.py	Fix telemetry (#787 )	2025-01-16 10:36:13 -08:00
request_headers.py	Add X-LlamaStack-Client-Version, rename ProviderData -> Provider-Data (#735 )	2025-01-09 11:51:36 -08:00
resolver.py	[memory refactor][3/n] Introduce RAGToolRuntime as a specialized sub-protocol (#832 )	2025-01-22 10:04:16 -08:00
stack.py	[memory refactor][3/n] Introduce RAGToolRuntime as a specialized sub-protocol (#832 )	2025-01-22 10:04:16 -08:00
start_conda_env.sh	Make llama stack build not create a new conda by default (#788 )	2025-01-16 13:44:53 -08:00
start_container.sh	More generic image type for OCI-compliant container technologies (#802 )	2025-01-17 16:37:42 -08:00