llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

Matthew Farrellee d266c59c2a chore: remove deprecated inference.chat_completion implementations (#3654 ) # What does this PR do? remove unused chat_completion implementations vllm features ported - - requires max_tokens be set, use config value - set tool_choice to none if no tools provided ## Test Plan ci		2025-10-03 07:55:34 -04:00
..
agents	test: add unit test to ensure all config types are instantiable (#1601 )	2025-03-12 22:29:58 -07:00
datasetio	chore(misc): make tests and starter faster (#3042 )	2025-08-05 14:55:05 -07:00
eval	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547 )	2025-09-25 17:17:00 -04:00
files/s3	fix(expires_after): make sure multipart/form-data is properly parsed (#3612 )	2025-09-30 16:14:03 -04:00
inference	chore: remove deprecated inference.chat_completion implementations (#3654 )	2025-10-03 07:55:34 -04:00
post_training	fix: remove inference.completion from docs (#3589 )	2025-09-29 13:14:41 -07:00
safety	refactor(logging): rename llama_stack logger categories (#3065 )	2025-08-21 17:31:04 -07:00
tool_runtime	feat(tools)!: substantial clean up of "Tool" related datatypes (#3627 )	2025-10-02 15:12:03 -07:00
vector_io	feat: implement keyword and hybrid search for Weaviate provider (#3264 )	2025-10-03 10:22:30 +02:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00