llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Ashwin Bharambe 928a39d17b feat(providers): Groq now uses LiteLLM openai-compat (#1303 ) Groq has never supported raw completions anyhow. So this makes it easier to switch it to LiteLLM. All our test suite passes. I also updated all the openai-compat providers so they work with api keys passed from headers. `provider_data` ## Test Plan ```bash LLAMA_STACK_CONFIG=groq \ pytest -s -v tests/client-sdk/inference/test_text_inference.py \ --inference-model=groq/llama-3.3-70b-versatile --vision-inference-model="" ``` Also tested (openai, anthropic, gemini) providers. No regressions.		2025-02-27 13:16:50 -08:00
..
bedrock.md	fix: Get distro_codegen.py working with default deps and enabled in pre-commit hooks (#1123 )	2025-02-19 18:39:20 -08:00
cerebras.md	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
dell-tgi.md	Update default port from 5000 -> 8321	2025-01-16 15:26:48 -08:00
dell.md	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
fireworks.md	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
groq.md	feat(providers): Groq now uses LiteLLM openai-compat (#1303 )	2025-02-27 13:16:50 -08:00
meta-reference-gpu.md	fix: Readthedocs cannot parse comments, resulting in docs bugs (#1033 )	2025-02-10 16:35:16 -05:00
meta-reference-quantized-gpu.md	fix: Readthedocs cannot parse comments, resulting in docs bugs (#1033 )	2025-02-10 16:35:16 -05:00
nvidia.md	add nvidia distribution (#565 )	2025-01-15 14:04:43 -08:00
ollama.md	feat: Add model context protocol tools with ollama provider (#1283 )	2025-02-26 15:38:18 -08:00
remote-vllm.md	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
sambanova.md	fix: Readthedocs cannot parse comments, resulting in docs bugs (#1033 )	2025-02-10 16:35:16 -05:00
tgi.md	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00
together.md	chore: move embedding deps to RAG tool where they are needed (#1210 )	2025-02-21 11:33:41 -08:00