llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-23 16:37:28 +00:00

History

Matthew Farrellee d266c59c2a chore: remove deprecated inference.chat_completion implementations (#3654 ) # What does this PR do? remove unused chat_completion implementations vllm features ported - - requires max_tokens be set, use config value - set tool_choice to none if no tools provided ## Test Plan ci		2025-10-03 07:55:34 -04:00
..
agent	feat(tools)!: substantial clean up of "Tool" related datatypes (#3627 )	2025-10-02 15:12:03 -07:00
agents	fix: responses <> chat completion input conversion (#3645 )	2025-10-02 16:01:08 -07:00
batches	feat(batches, completions): add /v1/completions support to /v1/batches (#3309 )	2025-09-05 11:59:57 -07:00
files	feat(files): fix expires_after API shape (#3604 )	2025-09-29 21:29:15 -07:00
inference	chore: remove deprecated inference.chat_completion implementations (#3654 )	2025-10-03 07:55:34 -04:00
inline	feat(tools)!: substantial clean up of "Tool" related datatypes (#3627 )	2025-10-02 15:12:03 -07:00
nvidia	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547 )	2025-09-25 17:17:00 -04:00
utils	chore: OpenAIMixin implements ModelsProtocolPrivate (#3662 )	2025-10-02 21:32:02 -07:00
vector_io	feat: implement keyword and hybrid search for Weaviate provider (#3264 )	2025-10-03 10:22:30 +02:00
test_bedrock.py	fix: AWS Bedrock inference profile ID conversion for region-specific endpoints (#3386 )	2025-09-11 11:41:53 +02:00
test_configs.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00