llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

Ashwin Bharambe 606f4cf281 fix(expires_after): make sure multipart/form-data is properly parsed (#3612 ) https://github.com/llamastack/llama-stack/pull/3604 broke multipart form data field parsing for the Files API since it changed its shape -- so as to match the API exactly to the OpenAI spec even in the generated client code. The underlying reason is that multipart/form-data cannot transport structured nested fields. Each field must be str-serialized. The client (specifically the OpenAI client whose behavior we must match), transports sub-fields as `expires_after[anchor]` and `expires_after[seconds]`, etc. We must be able to handle these fields somehow on the server without compromising the shape of the YAML spec. This PR "fixes" this by adding a dependency to convert the data. The main trade-off here is that we must add this `Depends()` annotation on every provider implementation for Files. This is a headache, but a much more reasonable one (in my opinion) given the alternatives. ## Test Plan Tests as shown in https://github.com/llamastack/llama-stack/pull/3604#issuecomment-3351090653 pass.		2025-09-30 16:14:03 -04:00
..
agents	fix: mcp tool with array type should include items (#3602 )	2025-09-29 23:11:41 -07:00
batches	feat(batches, completions): add /v1/completions support to /v1/batches (#3309 )	2025-09-05 11:59:57 -07:00
datasetio	chore(misc): make tests and starter faster (#3042 )	2025-08-05 14:55:05 -07:00
eval	feat: update eval runner to use openai endpoints (#3588 )	2025-09-29 13:13:53 -07:00
files/localfs	fix(expires_after): make sure multipart/form-data is properly parsed (#3612 )	2025-09-30 16:14:03 -04:00
inference	chore(api): remove batch inference (#3261 )	2025-09-26 14:35:34 -07:00
ios/inference	chore: removed executorch submodule (#1265 )	2025-02-25 21:57:21 -08:00
post_training	chore(pre-commit): add pre-commit hook to enforce llama_stack logger usage (#3061 )	2025-08-20 07:15:35 -04:00
safety	feat: use /v1/chat/completions for safety model inference (#3591 )	2025-09-30 11:01:44 -07:00
scoring	feat: create HTTP DELETE API endpoints to unregister ScoringFn and Benchmark resources in Llama Stack (#3371 )	2025-09-15 12:43:38 -07:00
telemetry	chore: remove extra logging (#3574 )	2025-09-27 11:22:54 -07:00
tool_runtime	chore: Updating documentation, adding exception handling for Vector Stores in RAG Tool, more tests on migration, and migrate off of inference_api for context_retriever for RAG (#3367 )	2025-09-11 14:20:11 +02:00
vector_io	refactor: use generic WeightedInMemoryAggregator for hybrid search in SQLiteVecIndex (#3303 )	2025-09-02 10:38:35 -07:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00