llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Ben Browning 602e949a46 fix: OpenAI Completions API and Fireworks (#1997 ) # What does this PR do? We were passing a dict into the compat mixin for OpenAI Completions when using Llama models with Fireworks, and that was breaking some strong typing code that was added in openai_compat.py. We shouldn't have been converting these params to a dict in that case anyway, so this adjusts things to pass the params in as their actual original types when calling the OpenAIChatCompletionToLlamaStackMixin. ## Test Plan All of the fireworks provider verification tests were failing due to some OpenAI compatibility cleanup in #1962. The changes in that PR were good to make, and this just cleans up the fireworks provider code to stop passing in untyped dicts to some of those `openai_compat.py` methods since we have the original strongly-typed parameters we can pass in. ``` llama stack run --image-type venv tests/verifications/openai-api-verification-run.yaml ``` ``` python -m pytest -s -v tests/verifications/openai_api/test_chat_completion.py --provider=fireworks-llama-stack ``` Before this PR, all of the fireworks OpenAI verification tests were failing. Now, most of them are passing. Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-04-21 11:49:12 -07:00
..
inline	fix: OAI compat endpoint for meta reference inference provider (#1962 )	2025-04-17 11:16:04 -07:00
registry	fix: use torchao 0.8.0 for inference (#1925 )	2025-04-10 13:39:20 -07:00
remote	fix: OpenAI Completions API and Fireworks (#1997 )	2025-04-21 11:49:12 -07:00
tests	refactor: move all llama code to models/llama out of meta reference (#1887 )	2025-04-07 15:03:58 -07:00
utils	fix: OAI compat endpoint for meta reference inference provider (#1962 )	2025-04-17 11:16:04 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	feat: add health to all providers through providers endpoint (#1418 )	2025-04-14 11:59:36 +02:00