llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Ashwin Bharambe 928a39d17b feat(providers): Groq now uses LiteLLM openai-compat (#1303 ) Groq has never supported raw completions anyhow. So this makes it easier to switch it to LiteLLM. All our test suite passes. I also updated all the openai-compat providers so they work with api keys passed from headers. `provider_data` ## Test Plan ```bash LLAMA_STACK_CONFIG=groq \ pytest -s -v tests/client-sdk/inference/test_text_inference.py \ --inference-model=groq/llama-3.3-70b-versatile --vision-inference-model="" ``` Also tested (openai, anthropic, gemini) providers. No regressions.		2025-02-27 13:16:50 -08:00
..
groq	feat(providers): Groq now uses LiteLLM openai-compat (#1303 )	2025-02-27 13:16:50 -08:00
__init__.py	Remove "routing_table" and "routing_key" concepts for the user (#201 )	2024-10-10 10:24:13 -07:00
conftest.py	refactor: move OpenAI compat utilities from nvidia to openai_compat (#1258 )	2025-02-25 22:02:11 -08:00
fixtures.py	refactor: move OpenAI compat utilities from nvidia to openai_compat (#1258 )	2025-02-25 22:02:11 -08:00
pasta.jpeg	Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376 )	2024-11-05 16:22:33 -08:00
test_embeddings.py	Fix precommit check after moving to ruff (#927 )	2025-02-02 06:46:45 -08:00
test_model_registration.py	test: skip model registration for unsupported providers (#1030 )	2025-02-19 22:39:13 -08:00
test_prompt_adapter.py	fix: pass tool_prompt_format to chat_formatter (#1198 )	2025-02-20 21:38:35 -08:00
test_text_inference.py	feat: completing text /chat-completion and /completion tests (#1223 )	2025-02-25 11:37:04 -08:00
test_vision_inference.py	build: configure ruff from pyproject.toml (#1100 )	2025-02-14 09:01:57 -08:00
utils.py	Fix precommit check after moving to ruff (#927 )	2025-02-02 06:46:45 -08:00