llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-15 04:12:38 +00:00

History

Matthew Farrellee f754e1b65b chore: remove deprecated inference.chat_completion implementations vllm - - requires max_tokens be set, use config value - set tool_choice to none if no tools provided		2025-10-02 10:39:30 -04:00
..
agent	fix: adding mime type of application/json support (#3452 )	2025-09-29 11:27:31 -07:00
agents	fix: Ensure that tool calls with no arguments get handled correctly (#3560 )	2025-10-01 08:36:57 -04:00
batches	feat(batches, completions): add /v1/completions support to /v1/batches (#3309 )	2025-09-05 11:59:57 -07:00
files	feat(files): fix expires_after API shape (#3604 )	2025-09-29 21:29:15 -07:00
inference	chore: remove deprecated inference.chat_completion implementations	2025-10-02 10:39:30 -04:00
inline	fix: mcp tool with array type should include items (#3602 )	2025-09-29 23:11:41 -07:00
nvidia	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547 )	2025-09-25 17:17:00 -04:00
utils	chore: add provider-data-api-key support to openaimixin (#3639 )	2025-10-01 13:44:59 -07:00
vector_io	chore(api): remove deprecated embeddings impls (#3301 )	2025-09-29 14:45:09 -04:00
test_bedrock.py	fix: AWS Bedrock inference profile ID conversion for region-specific endpoints (#3386 )	2025-09-11 11:41:53 +02:00
test_configs.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00