Ashwin Bharambe 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								e9b4278a51 
								
							 
						 
						
							
							
								
								feat(responses)!: improve responses + conversations implementations ( #3810 )  
							
							... 
							
							
							
							This PR updates the Conversation item related types and improves a
couple critical parts of the implemenation:
- it creates a streaming output item for the final assistant message
output by
  the model. until now we only added content parts and included that
  message in the final response.
- rewrites the conversation update code completely to account for items
  other than messages (tool calls, outputs, etc.)
## Test Plan
Used the test script from
https://github.com/llamastack/llama-stack-client-python/pull/281  for
this
```
TEST_API_BASE_URL=http://localhost:8321/v1  \
  pytest tests/integration/test_agent_turn_step_events.py::test_client_side_function_tool -xvs
``` 
							
						 
						
							2025-10-15 09:36:11 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ehhuang 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								14a94e9894 
								
							 
						 
						
							
							
								
								fix: responses <> chat completion input conversion ( #3645 )  
							
							... 
							
							
	
		
			
	 
	
	
		
	
	
		
			
				
	Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 2s 
				
			 
		
			
				
	SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 1s 
				
			 
		
			
				
	Python Package Build Test / build (3.12) (push) Failing after 2s 
				
			 
		
			
				
	Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 5s 
				
			 
		
			
				
	Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped 
				
			 
		
			
				
	API Conformance Tests / check-schema-compatibility (push) Successful in 10s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (push) Failing after 5s 
				
			 
		
			
				
	Python Package Build Test / build (3.13) (push) Failing after 3s 
				
			 
		
			
				
	SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 9s 
				
			 
		
			
				
	Test External API and Providers / test-external (venv) (push) Failing after 6s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.12) (push) Failing after 5s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.13) (push) Failing after 6s 
				
			 
		
			
				
	UI Tests / ui-tests (22) (push) Successful in 33s 
				
			 
		
			
				
	Pre-commit / pre-commit (push) Successful in 1m27s 
				
			 
		
		
	 
 
	 
							
							# What does this PR do?
closes  #3268 
closes  #3498 
When resuming from previous response ID, currently we attempt to convert
from the stored responses input to chat completion messages, which is
not always possible, e.g. for tool calls where some data is lost once
converted from chat completion message to repsonses input format.
This PR stores the chat completion messages that correspond to the
_last_ call to chat completion, which is sufficient to be resumed from
in the next responses API call, where we load these saved messages and
skip conversion entirely.
Separate issue to optimize storage:
https://github.com/llamastack/llama-stack/issues/3646 
## Test Plan
existing CI tests 
							
						 
						
							2025-10-02 16:01:08 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ehhuang 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								ceca3c056f 
								
							 
						 
						
							
							
								
								chore: fix/add logging categories ( #3658 )  
							
							... 
							
							
							
							# What does this PR do?
These aren't controllable by LLAMA_STACK_LOGGING
```
tests/integration/agents/test_persistence.py::test_delete_agents_and_sessions SKIPPED (This ...) [  3%]
tests/integration/agents/test_persistence.py::test_get_agent_turns_and_steps SKIPPED (This t...) [  7%]
tests/integration/agents/test_openai_responses.py::test_responses_store[openai_client-txt=openai/gpt-4o-tools0-True] 
instantiating llama_stack_client
WARNING  2025-10-02 13:14:33,472 root:258 uncategorized: Unknown logging category: testing. Falling back to default 'root' level: 20                  
WARNING  2025-10-02 13:14:33,477 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:33,960 root:258 uncategorized: Unknown logging category: tokenizer_utils. Falling back to default 'root' level: 20          
WARNING  2025-10-02 13:14:33,962 root:258 uncategorized: Unknown logging category: models::llama. Falling back to default 'root' level: 20            
WARNING  2025-10-02 13:14:33,963 root:258 uncategorized: Unknown logging category: models::llama. Falling back to default 'root' level: 20            
WARNING  2025-10-02 13:14:33,968 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:33,974 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:33,978 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:35,350 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:35,366 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:35,489 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:35,490 root:258 uncategorized: Unknown logging category: inference_store. Falling back to default 'root' level: 20          
WARNING  2025-10-02 13:14:35,697 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:35,918 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
INFO     2025-10-02 13:14:35,945 llama_stack.providers.utils.inference.inference_store:74 inference_store: Write queue disabled for SQLite to avoid   
         concurrency issues                                                                                                                           
WARNING  2025-10-02 13:14:36,172 root:258 uncategorized: Unknown logging category: files. Falling back to default 'root' level: 20                    
WARNING  2025-10-02 13:14:36,218 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:36,219 root:258 uncategorized: Unknown logging category: vector_io. Falling back to default 'root' level: 20                
WARNING  2025-10-02 13:14:36,231 root:258 uncategorized: Unknown logging category: vector_io. Falling back to default 'root' level: 20                
WARNING  2025-10-02 13:14:36,255 root:258 uncategorized: Unknown logging category: tool_runtime. Falling back to default 'root' level: 20             
WARNING  2025-10-02 13:14:36,486 root:258 uncategorized: Unknown logging category: responses_store. Falling back to default 'root' level: 20          
WARNING  2025-10-02 13:14:36,503 root:258 uncategorized: Unknown logging category: openai::responses. Falling back to default 'root' level: 20        
INFO     2025-10-02 13:14:36,524 llama_stack.providers.utils.responses.responses_store:80 responses_store: Write queue disabled for SQLite to avoid   
         concurrency issues                                                                                                                           
WARNING  2025-10-02 13:14:36,528 root:258 uncategorized: Unknown logging category: providers::utils. Falling back to default 'root' level: 20         
WARNING  2025-10-02 13:14:36,703 root:258 uncategorized: Unknown logging category: uncategorized. Falling back to default 'root' level: 20 
```
## Test Plan 
							
						 
						
							2025-10-02 13:10:13 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ehhuang 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								8ab6684a94 
								
							 
						 
						
							
							
								
								chore: introduce write queue for response_store ( #3497 )  
							
							... 
							
							
							
							# What does this PR do?
Mirroring the same changes that was used for inference_store:
https://github.com/llamastack/llama-stack/pull/3383 
Will follow up with a shared internal API for managing these write
queues.
## Test Plan
existing tests 
							
						 
						
							2025-09-29 10:36:16 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ehhuang 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								f44eb935c4 
								
							 
						 
						
							
							
								
								chore: simplify authorized sqlstore ( #3496 )  
							
							... 
							
							
	
		
			
	 
	
	
		
	
	
		
			
				
	SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 1s 
				
			 
		
			
				
	Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s 
				
			 
		
			
				
	SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 0s 
				
			 
		
			
				
	Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped 
				
			 
		
			
				
	Python Package Build Test / build (3.12) (push) Failing after 1s 
				
			 
		
			
				
	Python Package Build Test / build (3.13) (push) Failing after 1s 
				
			 
		
			
				
	Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 2s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.13) (push) Failing after 3s 
				
			 
		
			
				
	Update ReadTheDocs / update-readthedocs (push) Failing after 3s 
				
			 
		
			
				
	Test External API and Providers / test-external (venv) (push) Failing after 4s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (push) Failing after 4s 
				
			 
		
			
				
	UI Tests / ui-tests (22) (push) Successful in 35s 
				
			 
		
			
				
	API Conformance Tests / check-schema-compatibility (push) Successful in 6s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.12) (push) Failing after 3s 
				
			 
		
			
				
	Pre-commit / pre-commit (push) Successful in 1m19s 
				
			 
		
		
	 
 
	 
							
							# What does this PR do?
This PR is generated with AI and reviewed by me.
Refactors the AuthorizedSqlStore class to store the access policy as an
instance variable rather than passing it as a parameter to each method
call. This simplifies the API.
# Test Plan
existing tests 
							
						 
						
							2025-09-19 16:13:56 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								2665f00102 
								
							 
						 
						
							
							
								
								chore(rename): move llama_stack.distribution to llama_stack.core ( #2975 )  
							
							... 
							
							
							
							We would like to rename the term `template` to `distribution`. To
prepare for that, this is a precursor.
cc @leseb 
							
						 
						
							2025-07-30 23:30:53 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Sébastien Han 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								8d8e90d78e 
								
							 
						 
						
							
							
								
								fix: add missing argument and methods ( #2550 )  
							
							... 
							
							
							
							# What does this PR do?
Resolves:
```
mypy.....................................................................Failed
- hook id: mypy
- exit code: 1
llama_stack/providers/utils/responses/responses_store.py:119: error: Missing positional argument "policy" in call to "fetch_one" of "AuthorizedSqlStore"  [call-arg]
llama_stack/providers/utils/responses/responses_store.py:122: error: "AuthorizedSqlStore" has no attribute "delete"  [attr-defined]
Found 2 errors in 1 file (checked 403 source files)
```
Signed-off-by: Sébastien Han <seb@redhat.com> 
							
						 
						
							2025-06-30 14:55:37 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Krzysztof Malczuk 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								be9bf68246 
								
							 
						 
						
							
							
								
								feat: Add webmethod for deleting openai responses ( #2160 )  
							
							... 
							
							
	
		
			
	 
	
	
		
	
	
		
			
				
	Integration Tests / test-matrix (library, 3.13, datasets) (push) Failing after 16s 
				
			 
		
			
				
	Integration Tests / test-matrix (http, 3.13, datasets) (push) Failing after 11s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.13, inference) (push) Failing after 12s 
				
			 
		
			
				
	Integration Tests / test-matrix (http, 3.13, scoring) (push) Failing after 12s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.13, post_training) (push) Failing after 9s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, inspect) (push) Failing after 11s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, tool_runtime) (push) Failing after 11s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.13, tool_runtime) (push) Failing after 8s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, post_training) (push) Failing after 12s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.13, providers) (push) Failing after 12s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.13, inspect) (push) Failing after 12s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.13, scoring) (push) Failing after 11s 
				
			 
		
			
				
	Integration Tests / test-matrix (http, 3.12, providers) (push) Failing after 17s 
				
			 
		
			
				
	Integration Tests / test-matrix (http, 3.13, agents) (push) Failing after 11s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (3.12, inline::faiss) (push) Failing after 5s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (3.12, remote::chromadb) (push) Failing after 7s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.13, vector_io) (push) Failing after 16s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (3.12, inline::sqlite-vec) (push) Failing after 18s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (3.13, inline::sqlite-vec) (push) Failing after 19s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (3.12, remote::pgvector) (push) Failing after 21s 
				
			 
		
			
				
	Test External Providers / test-external-providers (venv) (push) Failing after 9s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (3.13, remote::pgvector) (push) Failing after 19s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.12) (push) Failing after 9s 
				
			 
		
			
				
	Update ReadTheDocs / update-readthedocs (push) Failing after 7s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.13) (push) Failing after 10s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (3.13, inline::faiss) (push) Failing after 39s 
				
			 
		
			
				
	Vector IO Integration Tests / test-matrix (3.13, remote::chromadb) (push) Failing after 37s 
				
			 
		
			
				
	Python Package Build Test / build (3.13) (push) Failing after 33s 
				
			 
		
			
				
	Python Package Build Test / build (3.12) (push) Failing after 36s 
				
			 
		
			
				
	Pre-commit / pre-commit (push) Failing after 1m19s 
				
			 
		
		
	 
 
	 
							
							# What does this PR do?
This PR creates a webmethod for deleting open AI responses, adds and
implementation for it and makes an integration test for the OpenAI
delete response method.
[//]: # (If resolving an issue, uncomment and update the line below)
# (Closes  #2077 )
## Test Plan
Ran the standard tests and the pre-commit hooks and the unit tests.
# (## Documentation)
For this pr I made the routes and implementation based on the current
get and create methods. The unit tests were not able to handle this test
due to the mock interface in use, which did not allow for effective CRUD
to be tested. I instead created an integration test to match the
existing ones in the test_openai_responses. 
							
						 
						
							2025-06-30 11:28:02 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ehhuang 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								d3b60507d7 
								
							 
						 
						
							
							
								
								feat: support auth attributes in inference/responses stores ( #2389 )  
							
							... 
							
							
							
							# What does this PR do?
Inference/Response stores now store user attributes when inserting, and
respects them when fetching.
## Test Plan
pytest tests/unit/utils/test_sqlstore.py 
							
						 
						
							2025-06-20 10:24:45 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Sébastien Han 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								c20388c424 
								
							 
						 
						
							
							
								
								ci: add python package build test ( #2457 )  
							
							... 
							
							
							
							# What does this PR do?
We now test a package build on every PRs.
Closes: https://github.com/meta-llama/llama-stack/issues/2406 
Signed-off-by: Sébastien Han <seb@redhat.com> 
							
						 
						
							2025-06-19 18:57:32 +05:30 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ehhuang 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								15f630e5da 
								
							 
						 
						
							
							
								
								feat: support pagination in inference/responses stores ( #2397 )  
							
							... 
							
							
	
		
			
	 
	
	
		
	
	
		
			
				
	Integration Tests / test-matrix (http, 3.12, agents) (push) Failing after 23s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.11, datasets) (push) Failing after 5s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.10, vector_io) (push) Failing after 7s 
				
			 
		
			
				
	Integration Tests / test-matrix (http, 3.10, vector_io) (push) Failing after 27s 
				
			 
		
			
				
	Integration Tests / test-matrix (http, 3.12, vector_io) (push) Failing after 19s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.10, post_training) (push) Failing after 10s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.10, tool_runtime) (push) Failing after 11s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.11, inspect) (push) Failing after 8s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.11, inference) (push) Failing after 9s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.11, providers) (push) Failing after 8s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.11, tool_runtime) (push) Failing after 8s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.11, scoring) (push) Failing after 10s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.11, agents) (push) Failing after 11s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.11, post_training) (push) Failing after 10s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, agents) (push) Failing after 7s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.11, vector_io) (push) Failing after 10s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, inference) (push) Failing after 7s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, datasets) (push) Failing after 10s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, inspect) (push) Failing after 10s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, post_training) (push) Failing after 8s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, scoring) (push) Failing after 8s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, providers) (push) Failing after 10s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, vector_io) (push) Failing after 44s 
				
			 
		
			
				
	Integration Tests / test-matrix (library, 3.12, tool_runtime) (push) Failing after 46s 
				
			 
		
			
				
	Test External Providers / test-external-providers (venv) (push) Failing after 41s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.10) (push) Failing after 52s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.12) (push) Failing after 18s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.11) (push) Failing after 20s 
				
			 
		
			
				
	Unit Tests / unit-tests (3.13) (push) Failing after 16s 
				
			 
		
			
				
	Pre-commit / pre-commit (push) Successful in 2m0s 
				
			 
		
		
	 
 
	 
							
							# What does this PR do?
## Test Plan
added unit tests 
							
						 
						
							2025-06-16 22:43:35 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ehhuang 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								15b0a67555 
								
							 
						 
						
							
							
								
								feat: add responses input items api ( #2239 )  
							
							... 
							
							
							
							# What does this PR do?
TSIA
## Test Plan
added integration and unit tests 
							
						 
						
							2025-05-24 07:05:53 -07:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									ehhuang 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								5844c2da68 
								
							 
						 
						
							
							
								
								feat: add list responses API ( #2233 )  
							
							... 
							
							
							
							# What does this PR do?
This is not part of the official OpenAI API, but we'll use this for the
logs UI.
In order to support more filtering options, I'm adopting the newly
introduced sql store in in place of the kv store.
## Test Plan
Added integration/unit tests. 
							
						 
						
							2025-05-23 13:16:48 -07:00