llama-stack

History

Ashwin Bharambe 928a39d17b feat(providers): Groq now uses LiteLLM openai-compat (#1303 ) Groq has never supported raw completions anyhow. So this makes it easier to switch it to LiteLLM. All our test suite passes. I also updated all the openai-compat providers so they work with api keys passed from headers. `provider_data` ## Test Plan ```bash LLAMA_STACK_CONFIG=groq \ pytest -s -v tests/client-sdk/inference/test_text_inference.py \ --inference-model=groq/llama-3.3-70b-versatile --vision-inference-model="" ``` Also tested (openai, anthropic, gemini) providers. No regressions.		2025-02-27 13:16:50 -08:00
..
ondevice_distro	Fixed distro documentation (#852 )	2025-01-23 08:19:51 -08:00
remote_hosted_distro	feat: add nemo retriever text embedding models to nvidia inference provider (#1218 )	2025-02-26 21:18:34 -08:00
self_hosted_distro	feat(providers): Groq now uses LiteLLM openai-compat (#1303 )	2025-02-27 13:16:50 -08:00
building_distro.md	chore: remove --no-list-templates option (#1121 )	2025-02-18 10:13:46 -08:00
configuration.md	script for running client sdk tests (#895 )	2025-02-19 22:38:06 -08:00
importing_as_library.md	Fix precommit check after moving to ruff (#927 )	2025-02-02 06:46:45 -08:00
index.md	Add Kubernetes deployment guide (#899 )	2025-02-06 10:28:02 -08:00
kubernetes_deployment.md	Add Kubernetes deployment guide (#899 )	2025-02-06 10:28:02 -08:00
selection.md	docs: small fixes (#1224 )	2025-02-24 07:59:58 -05:00