llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Matthew Farrellee 53b15725b6 chore(apis): unpublish deprecated /v1/inference apis (#3297 ) # What does this PR do? unpublish (make unavailable to users) the following apis - - `/v1/inference/completion`, replaced by `/v1/openai/v1/completions` - `/v1/inference/chat-completion`, replaced by `/v1/openai/v1/chat/completions` - `/v1/inference/embeddings`, replaced by `/v1/openai/v1/embeddings` - `/v1/inference/batch-completion`, replaced by `/v1/openai/v1/batches` - `/v1/inference/batch-chat-completion`, replaced by `/v1/openai/v1/batches` note: the implementations are still available for internal use, e.g. agents uses chat-completion.		2025-09-27 11:20:06 -07:00
..
inline	chore(api): remove batch inference (#3261 )	2025-09-26 14:35:34 -07:00
registry	docs: provider and distro codegen migration (#3531 )	2025-09-24 14:01:29 -07:00
remote	chore(api): remove batch inference (#3261 )	2025-09-26 14:35:34 -07:00
utils	chore(apis): unpublish deprecated /v1/inference apis (#3297 )	2025-09-27 11:20:06 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	feat: combine ProviderSpec datatypes (#3378 )	2025-09-18 16:10:00 +02:00