llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

mergify[bot] a6c3a9cadf Some checks failed Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 3s Details Integration Tests (Replay) / generate-matrix (push) Successful in 6s Details SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 48s Details SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 53s Details Vector IO Integration Tests / test-matrix (push) Failing after 1m10s Details Unit Tests / unit-tests (3.13) (push) Failing after 2m41s Details Unit Tests / unit-tests (3.12) (push) Failing after 2m44s Details Pre-commit / pre-commit (push) Successful in 3m22s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 3m16s Details fix: harden storage semantics (backport #4118 ) (#4138 ) Fixes issues in the storage system by guaranteeing immediate durability for responses and ensuring background writers stay alive. Three related fixes: * Responses to the OpenAI-compatible API now write directly to Postgres/SQLite inside the request instead of detouring through an async queue that might never drain; this restores the expected read-after-write behavior and removes the "response not found" races reported by users. * The access-control shim was stamping owner_principal/access_attributes as SQL NULL, which Postgres interprets as non-public rows; fixing it to use the empty-string/JSON-null pattern means conversations and responses stored without an authenticated user stay queryable (matching SQLite). * The inference-store queue remains for batching, but its worker tasks now start lazily on the live event loop so server startup doesn't cancel them—writes keep flowing even when the stack is launched via llama stack run. Closes #4115 ### Test Plan Added a matrix entry to test our "base" suite against Postgres as the store.<hr>This is an automatic backport of pull request #4118 done by [Mergify](https://mergify.com). --------- Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>		2025-11-12 13:01:21 -08:00
..
bedrock	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
common	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
datasetio	chore(misc): make tests and starter faster (#3042 )	2025-08-05 14:55:05 -07:00
files	fix(expires_after): make sure multipart/form-data is properly parsed (#3612 )	2025-09-30 16:14:03 -04:00
inference	fix: harden storage semantics (backport #4118 ) (#4138 )	2025-11-12 13:01:21 -08:00
kvstore	feat(stores)!: use backend storage references instead of configs (#3697 )	2025-10-20 13:20:09 -07:00
memory	fix: remove consistency checks (#3881 )	2025-10-21 14:40:14 -07:00
responses	fix: harden storage semantics (backport #4118 ) (#4138 )	2025-11-12 13:01:21 -08:00
scoring	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
sqlstore	fix: harden storage semantics (backport #4118 ) (#4138 )	2025-11-12 13:01:21 -08:00
telemetry	test(telemetry): Telemetry Tests (#3805 )	2025-10-17 10:43:33 -07:00
tools	feat(tools)!: substantial clean up of "Tool" related datatypes (#3627 )	2025-10-02 15:12:03 -07:00
vector_io	feat: migrate to FIPS-validated cryptographic algorithms (#3423 )	2025-09-12 11:18:19 +02:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
pagination.py	chore(refact): move paginate_records fn outside of datasetio (#2137 )	2025-05-12 10:56:14 -07:00
scheduler.py	refactor(logging): rename llama_stack logger categories (#3065 )	2025-08-21 17:31:04 -07:00