llama-stack

History

Ashwin Bharambe 928a39d17b feat(providers): Groq now uses LiteLLM openai-compat (#1303 ) Groq has never supported raw completions anyhow. So this makes it easier to switch it to LiteLLM. All our test suite passes. I also updated all the openai-compat providers so they work with api keys passed from headers. `provider_data` ## Test Plan ```bash LLAMA_STACK_CONFIG=groq \ pytest -s -v tests/client-sdk/inference/test_text_inference.py \ --inference-model=groq/llama-3.3-70b-versatile --vision-inference-model="" ``` Also tested (openai, anthropic, gemini) providers. No regressions.		2025-02-27 13:16:50 -08:00
..
__init__.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
embedding_mixin.py	fix: dont assume SentenceTransformer is imported	2025-02-25 16:53:01 -08:00
litellm_openai_mixin.py	feat(providers): Groq now uses LiteLLM openai-compat (#1303 )	2025-02-27 13:16:50 -08:00
model_registry.py	feat(providers): support non-llama models for inference providers (#1200 )	2025-02-21 13:21:28 -08:00
openai_compat.py	fix(test): update client-sdk tests to handle tool format parametrization better (#1287 )	2025-02-26 21:16:00 -08:00
prompt_adapter.py	fix: set default tool_prompt_format in inference api (#1214 )	2025-02-24 12:38:37 -08:00