llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-30 21:43:53 +00:00

History

Ben Browning 8a1c0a1008 Improve groq OpenAI API compatibility This doesn't get Groq to 100% on the OpenAI API verification tests, but it does get it to 88.2% when Llama Stack is in the middle, compared to the 61.8% results for using an OpenAI client against Groq directly. The groq provider doesn't use litellm under the covers in its openai_chat_completion endpoint, and instead directly uses an AsyncOpenAI client with some special handling to improve conformance of responses for response_format usage and tool calling. Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-04-13 13:41:52 -04:00
..
css	docs: Updated docs to show minimal RAG example and some other minor changes (#1935 )	2025-04-11 11:50:36 -07:00
js	chore: Detect browser setting for dark/light mode and set default to light mode (#1913 )	2025-04-09 12:40:56 -04:00
providers/vector_io	docs: Document sqlite-vec faiss comparison (#1821 )	2025-03-28 17:41:33 +01:00
llama-stack-logo.png	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
llama-stack-spec.html	Improve groq OpenAI API compatibility	2025-04-13 13:41:52 -04:00
llama-stack-spec.yaml	Improve groq OpenAI API compatibility	2025-04-13 13:41:52 -04:00
llama-stack.png	Make a new llama stack image	2024-11-22 23:49:22 -08:00
remote_or_local.gif	[docs] update documentations (#356 )	2024-11-04 16:52:38 -08:00
safety_system.webp	[Docs] Zero-to-Hero notebooks and quick start documentation (#368 )	2024-11-08 17:16:44 -08:00