llama-stack-mirror/llama_stack/providers
Varsha Prasad Narsing 531b1451dc feat: Add /v1/embeddings endpoint to batches API
This PR extends the Llama Stack Batches API to support the /v1/embeddings endpoint, enabling efficient batch processing of embedding requests alongside the existing /v1/chat/completions and /v1/completions support.

Signed-off-by: Varsha Prasad Narsing <varshaprasad96@gmail.com>
2025-09-29 12:00:28 -07:00
..
inline feat: Add /v1/embeddings endpoint to batches API 2025-09-29 12:00:28 -07:00
registry docs: provider and distro codegen migration (#3531) 2025-09-24 14:01:29 -07:00
remote chore: recordings for fireworks (inference + openai) (#3573) 2025-09-27 11:22:30 -07:00
utils chore: introduce write queue for response_store (#3497) 2025-09-29 10:36:16 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
datatypes.py feat: combine ProviderSpec datatypes (#3378) 2025-09-18 16:10:00 +02:00