llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-23 17:33:57 +00:00

History

Ben Browning fa9e2dd543 fix: Don't cache clients for passthrough auth providers Some of our inference providers support passthrough authentication via `x-llamastack-provider-data` header values. This fixes the providers that support passthrough auth to not cache their clients to the backend providers (mostly OpenAI client instances) so that the client connecting to Llama Stack has to provide those auth values on each and every request. Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-07-11 10:18:35 -04:00
..
anthropic	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
bedrock	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
cerebras	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
cerebras_openai_compat	feat: introduce APIs for retrieving chat completion requests (#2145 )	2025-05-18 21:43:19 -07:00
databricks	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
fireworks	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
fireworks_openai_compat	feat: introduce APIs for retrieving chat completion requests (#2145 )	2025-05-18 21:43:19 -07:00
gemini	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
groq	fix: Don't cache clients for passthrough auth providers	2025-07-11 10:18:35 -04:00
groq_openai_compat	feat: introduce APIs for retrieving chat completion requests (#2145 )	2025-05-18 21:43:19 -07:00
llama_openai_compat	feat: introduce APIs for retrieving chat completion requests (#2145 )	2025-05-18 21:43:19 -07:00
nvidia	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
ollama	refactor: set proper name for embedding all-minilm:l6-v2 and update to use "starter" in detailed_tutorial (#2627 )	2025-07-06 09:07:37 +05:30
openai	fix: Don't cache clients for passthrough auth providers	2025-07-11 10:18:35 -04:00
passthrough	feat: consolidate most distros into "starter" (#2516 )	2025-07-04 15:58:03 +02:00
runpod	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
sambanova	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
sambanova_openai_compat	feat: introduce APIs for retrieving chat completion requests (#2145 )	2025-05-18 21:43:19 -07:00
tgi	feat: consolidate most distros into "starter" (#2516 )	2025-07-04 15:58:03 +02:00
together	fix: Don't cache clients for passthrough auth providers	2025-07-11 10:18:35 -04:00
together_openai_compat	feat: introduce APIs for retrieving chat completion requests (#2145 )	2025-05-18 21:43:19 -07:00
vllm	refactor(env)!: enhanced environment variable substitution (#2490 )	2025-06-26 08:20:08 +05:30
watsonx	fix: allow default empty vars for conditionals (#2570 )	2025-07-01 14:42:05 +02:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00