llama-stack-mirror/docs/source/distributions/self_hosted_distro
Ashwin Bharambe 928a39d17b
feat(providers): Groq now uses LiteLLM openai-compat (#1303)
Groq has never supported raw completions anyhow. So this makes it easier
to switch it to LiteLLM. All our test suite passes.

I also updated all the openai-compat providers so they work with api
keys passed from headers. `provider_data`

## Test Plan

```bash
LLAMA_STACK_CONFIG=groq \
   pytest -s -v tests/client-sdk/inference/test_text_inference.py \
   --inference-model=groq/llama-3.3-70b-versatile --vision-inference-model=""
```

Also tested (openai, anthropic, gemini) providers. No regressions.
2025-02-27 13:16:50 -08:00
..
bedrock.md fix: Get distro_codegen.py working with default deps and enabled in pre-commit hooks (#1123) 2025-02-19 18:39:20 -08:00
cerebras.md chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
dell-tgi.md Update default port from 5000 -> 8321 2025-01-16 15:26:48 -08:00
dell.md chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
fireworks.md chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
groq.md feat(providers): Groq now uses LiteLLM openai-compat (#1303) 2025-02-27 13:16:50 -08:00
meta-reference-gpu.md fix: Readthedocs cannot parse comments, resulting in docs bugs (#1033) 2025-02-10 16:35:16 -05:00
meta-reference-quantized-gpu.md fix: Readthedocs cannot parse comments, resulting in docs bugs (#1033) 2025-02-10 16:35:16 -05:00
nvidia.md add nvidia distribution (#565) 2025-01-15 14:04:43 -08:00
ollama.md feat: Add model context protocol tools with ollama provider (#1283) 2025-02-26 15:38:18 -08:00
remote-vllm.md chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
sambanova.md fix: Readthedocs cannot parse comments, resulting in docs bugs (#1033) 2025-02-10 16:35:16 -05:00
tgi.md chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
together.md chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00