llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-04 04:04:14 +00:00

History

Varsha Prasad Narsing 531b1451dc feat: Add /v1/embeddings endpoint to batches API This PR extends the Llama Stack Batches API to support the /v1/embeddings endpoint, enabling efficient batch processing of embedding requests alongside the existing /v1/chat/completions and /v1/completions support. Signed-off-by: Varsha Prasad Narsing <varshaprasad96@gmail.com>		2025-09-29 12:00:28 -07:00
..
inline	feat: Add /v1/embeddings endpoint to batches API	2025-09-29 12:00:28 -07:00
registry	docs: provider and distro codegen migration (#3531 )	2025-09-24 14:01:29 -07:00
remote	chore: recordings for fireworks (inference + openai) (#3573 )	2025-09-27 11:22:30 -07:00
utils	chore: introduce write queue for response_store (#3497 )	2025-09-29 10:36:16 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	feat: combine ProviderSpec datatypes (#3378 )	2025-09-18 16:10:00 +02:00