llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-11 13:44:38 +00:00

History

Varsha Prasad Narsing 531b1451dc feat: Add /v1/embeddings endpoint to batches API This PR extends the Llama Stack Batches API to support the /v1/embeddings endpoint, enabling efficient batch processing of embedding requests alongside the existing /v1/chat/completions and /v1/completions support. Signed-off-by: Varsha Prasad Narsing <varshaprasad96@gmail.com>		2025-09-29 12:00:28 -07:00
..
agent	fix: adding mime type of application/json support (#3452 )	2025-09-29 11:27:31 -07:00
agents	chore: introduce write queue for response_store (#3497 )	2025-09-29 10:36:16 -07:00
batches	feat: Add /v1/embeddings endpoint to batches API	2025-09-29 12:00:28 -07:00
files	feat(files, s3, expiration): add expires_after support to S3 files provider (#3283 )	2025-08-29 16:17:24 -07:00
inference	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547 )	2025-09-25 17:17:00 -04:00
nvidia	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547 )	2025-09-25 17:17:00 -04:00
utils	feat(internal): add image_url download feature to OpenAIMixin (#3516 )	2025-09-26 17:32:16 -04:00
vector_io	feat: migrate to FIPS-validated cryptographic algorithms (#3423 )	2025-09-12 11:18:19 +02:00
test_bedrock.py	fix: AWS Bedrock inference profile ID conversion for region-specific endpoints (#3386 )	2025-09-11 11:41:53 +02:00
test_configs.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00