llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

mergify[bot] b9299a20ed fix: enable SQLite WAL mode to prevent database locking errors (backport #4048 ) (#4226 ) Fixes race condition causing "database is locked" errors during concurrent writes to SQLite, particularly in streaming responses with guardrails where multiple inference calls write simultaneously. Enable Write-Ahead Logging (WAL) mode for SQLite which allows multiple concurrent readers and one writer without blocking. Set busy_timeout to 5s so SQLite retries instead of failing immediately. Remove the logic that disabled write queues for SQLite since WAL mode eliminates the locking issues that prompted disabling them. Fixes: test_output_safety_guardrails_safe_content[stream=True] flake<hr>This is an automatic backport of pull request #4048 done by [Mergify](https://mergify.com). Signed-off-by: Charlie Doern <cdoern@redhat.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>		2025-11-24 11:30:57 -08:00
..
__init__.py	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
embedding_mixin.py	fix(inference): enable routing of models with provider_data alone (backport #3928 ) (#4142 )	2025-11-12 13:41:27 -08:00
inference_store.py	fix: enable SQLite WAL mode to prevent database locking errors (backport #4048 ) (#4226 )	2025-11-24 11:30:57 -08:00
litellm_openai_mixin.py	feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794 )	2025-10-12 19:01:52 -07:00
model_registry.py	fix: allowed_models config did not filter models (backport #4030 ) (#4223 )	2025-11-24 11:29:53 -08:00
openai_compat.py	fix: Update watsonx.ai provider to use LiteLLM mixin and list all models (#3674 )	2025-10-08 07:29:43 -04:00
openai_mixin.py	fix: allowed_models config did not filter models (backport #4030 ) (#4223 )	2025-11-24 11:29:53 -08:00
prompt_adapter.py	chore!: Safety api refactoring to use OpenAIMessageParam (#3796 )	2025-10-12 08:01:00 -07:00