llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

ehhuang 8ab6684a94 chore: introduce write queue for response_store (#3497 ) # What does this PR do? Mirroring the same changes that was used for inference_store: https://github.com/llamastack/llama-stack/pull/3383 Will follow up with a shared internal API for managing these write queues. ## Test Plan existing tests		2025-09-29 10:36:16 -07:00
..
agent	feat: Add items and title to ToolParameter/ToolParamDefinition (#3003 )	2025-09-27 11:35:29 -07:00
agents	chore: introduce write queue for response_store (#3497 )	2025-09-29 10:36:16 -07:00
batches	feat(batches, completions): add /v1/completions support to /v1/batches (#3309 )	2025-09-05 11:59:57 -07:00
files	feat(files, s3, expiration): add expires_after support to S3 files provider (#3283 )	2025-08-29 16:17:24 -07:00
inference	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547 )	2025-09-25 17:17:00 -04:00
nvidia	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547 )	2025-09-25 17:17:00 -04:00
utils	feat(internal): add image_url download feature to OpenAIMixin (#3516 )	2025-09-26 17:32:16 -04:00
vector_io	feat: migrate to FIPS-validated cryptographic algorithms (#3423 )	2025-09-12 11:18:19 +02:00
test_bedrock.py	fix: AWS Bedrock inference profile ID conversion for region-specific endpoints (#3386 )	2025-09-11 11:41:53 +02:00
test_configs.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00