llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Ashwin Bharambe 928a39d17b feat(providers): Groq now uses LiteLLM openai-compat (#1303 ) Groq has never supported raw completions anyhow. So this makes it easier to switch it to LiteLLM. All our test suite passes. I also updated all the openai-compat providers so they work with api keys passed from headers. `provider_data` ## Test Plan ```bash LLAMA_STACK_CONFIG=groq \ pytest -s -v tests/client-sdk/inference/test_text_inference.py \ --inference-model=groq/llama-3.3-70b-versatile --vision-inference-model="" ``` Also tested (openai, anthropic, gemini) providers. No regressions.		2025-02-27 13:16:50 -08:00
..
bedrock	ModelAlias -> ProviderModelEntry	2025-02-20 14:02:36 -08:00
cerebras	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
ci-tests	feat: add (openai, anthropic, gemini) providers via litellm (#1267 )	2025-02-25 22:07:33 -08:00
dell	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
dev	feat(providers): Groq now uses LiteLLM openai-compat (#1303 )	2025-02-27 13:16:50 -08:00
experimental-post-training	feat: [post training] support save hf safetensor format checkpoint (#845 )	2025-02-25 23:29:08 -08:00
fireworks	test: add a ci-tests distro template for running e2e tests (#1237 )	2025-02-24 14:43:21 -08:00
groq	feat(providers): Groq now uses LiteLLM openai-compat (#1303 )	2025-02-27 13:16:50 -08:00
hf-endpoint	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
hf-serverless	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
meta-reference-gpu	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
meta-reference-quantized-gpu	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
nvidia	feat: add nemo retriever text embedding models to nvidia inference provider (#1218 )	2025-02-26 21:18:34 -08:00
ollama	feat: Add model context protocol tools with ollama provider (#1283 )	2025-02-26 15:38:18 -08:00
passthrough	feat: inference passthrough provider (#1166 )	2025-02-19 21:47:00 -08:00
remote-vllm	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
sambanova	ModelAlias -> ProviderModelEntry	2025-02-20 14:02:36 -08:00
tgi	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
together	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
vllm-gpu	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
__init__.py	Auto-generate distro yamls + docs (#468 )	2024-11-18 14:57:06 -08:00
template.py	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00