llama-stack/llama_stack/templates
Ashwin Bharambe 928a39d17b
feat(providers): Groq now uses LiteLLM openai-compat (#1303)
Groq has never supported raw completions anyhow. So this makes it easier
to switch it to LiteLLM. All our test suite passes.

I also updated all the openai-compat providers so they work with api
keys passed from headers. `provider_data`

## Test Plan

```bash
LLAMA_STACK_CONFIG=groq \
   pytest -s -v tests/client-sdk/inference/test_text_inference.py \
   --inference-model=groq/llama-3.3-70b-versatile --vision-inference-model=""
```

Also tested (openai, anthropic, gemini) providers. No regressions.
2025-02-27 13:16:50 -08:00
..
bedrock ModelAlias -> ProviderModelEntry 2025-02-20 14:02:36 -08:00
cerebras chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
ci-tests feat: add (openai, anthropic, gemini) providers via litellm (#1267) 2025-02-25 22:07:33 -08:00
dell chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
dev feat(providers): Groq now uses LiteLLM openai-compat (#1303) 2025-02-27 13:16:50 -08:00
experimental-post-training feat: [post training] support save hf safetensor format checkpoint (#845) 2025-02-25 23:29:08 -08:00
fireworks test: add a ci-tests distro template for running e2e tests (#1237) 2025-02-24 14:43:21 -08:00
groq feat(providers): Groq now uses LiteLLM openai-compat (#1303) 2025-02-27 13:16:50 -08:00
hf-endpoint fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
hf-serverless chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
meta-reference-gpu fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
meta-reference-quantized-gpu fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
nvidia feat: add nemo retriever text embedding models to nvidia inference provider (#1218) 2025-02-26 21:18:34 -08:00
ollama feat: Add model context protocol tools with ollama provider (#1283) 2025-02-26 15:38:18 -08:00
passthrough feat: inference passthrough provider (#1166) 2025-02-19 21:47:00 -08:00
remote-vllm chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
sambanova ModelAlias -> ProviderModelEntry 2025-02-20 14:02:36 -08:00
tgi chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
together chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
vllm-gpu chore: move embedding deps to RAG tool where they are needed (#1210) 2025-02-21 11:33:41 -08:00
__init__.py Auto-generate distro yamls + docs (#468) 2024-11-18 14:57:06 -08:00
template.py build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00