llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

Ashwin Bharambe 7c63aebd64 feat(responses)!: add reasoning and annotation added events (#3793 ) Implements missing streaming events from OpenAI Responses API spec: - reasoning text/summary events for o1/o3 models, - refusal events for safety moderation - annotation events for citations, - and file search streaming events. Added optional reasoning_content field to chat completion chunks to support non-standard provider extensions. NOTE: OpenAI does _not_ fill reasoning_content when users use the chat_completion APIs. This means there is no way for us to implement Responses (with reasoning) by using OpenAI chat completions! We'd need to transparently punt to OpenAI's responses endpoints if we wish to do that. For others though (vLLM, etc.) we can use it. ## Test Plan File search streaming test passes: ``` ./scripts/integration-tests.sh --stack-config server:ci-tests \ --suite responses --setup gpt --inference-mode replay --pattern test_response_file_search_streaming_events ``` Need more complex setup and validation for reasoning tests (need a vLLM powered OSS model maybe gpt-oss which can return reasoning_content). I will do that in a followup PR.		2025-10-11 16:47:14 -07:00
..
agents	feat(responses)!: add reasoning and annotation added events (#3793 )	2025-10-11 16:47:14 -07:00
batches	chore!: add double routes for v1/openai/v1 (#3636 )	2025-10-02 16:11:05 +02:00
benchmarks	feat: introduce API leveling, post_training, eval to v1alpha (#3449 )	2025-09-26 16:18:07 +02:00
common	feat: Add support for Conversations in Responses API (#3743 )	2025-10-10 11:57:40 -07:00
conversations	feat: Add OpenAI Conversations API (#3429 )	2025-10-03 08:47:18 -07:00
datasetio	feat(api): implement v1beta leveling, and additional alpha (#3594 )	2025-10-01 09:18:11 -07:00
datasets	feat(api): implement v1beta leveling, and additional alpha (#3594 )	2025-10-01 09:18:11 -07:00
eval	feat: introduce API leveling, post_training, eval to v1alpha (#3449 )	2025-09-26 16:18:07 +02:00
files	docs: API docstrings cleanup for better documentation rendering (#3661 )	2025-10-06 10:46:33 -07:00
inference	feat(responses)!: add reasoning and annotation added events (#3793 )	2025-10-11 16:47:14 -07:00
inspect	fix(auth): allow unauthenticated access to health and version endpoints (#3736 )	2025-10-10 13:41:43 -07:00
models	docs: API docstrings cleanup for better documentation rendering (#3661 )	2025-10-06 10:46:33 -07:00
post_training	feat: introduce API leveling, post_training, eval to v1alpha (#3449 )	2025-09-26 16:18:07 +02:00
prompts	docs: API docstrings cleanup for better documentation rendering (#3661 )	2025-10-06 10:46:33 -07:00
providers	docs: API docstrings cleanup for better documentation rendering (#3661 )	2025-10-06 10:46:33 -07:00
safety	docs: API docstrings cleanup for better documentation rendering (#3661 )	2025-10-06 10:46:33 -07:00
scoring	feat: introduce API leveling, post_training, eval to v1alpha (#3449 )	2025-09-26 16:18:07 +02:00
scoring_functions	feat: introduce API leveling, post_training, eval to v1alpha (#3449 )	2025-09-26 16:18:07 +02:00
shields	feat: introduce API leveling, post_training, eval to v1alpha (#3449 )	2025-09-26 16:18:07 +02:00
synthetic_data_generation	feat: introduce API leveling, post_training, eval to v1alpha (#3449 )	2025-09-26 16:18:07 +02:00
telemetry	feat(api): implement v1beta leveling, and additional alpha (#3594 )	2025-10-01 09:18:11 -07:00
tools	feat(tools)!: substantial clean up of "Tool" related datatypes (#3627 )	2025-10-02 15:12:03 -07:00
vector_dbs	chore!: BREAKING CHANGE removing VectorDB APIs (#3774 )	2025-10-11 14:07:08 -07:00
vector_io	chore!: add double routes for v1/openai/v1 (#3636 )	2025-10-02 16:11:05 +02:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	chore!: BREAKING CHANGE removing VectorDB APIs (#3774 )	2025-10-11 14:07:08 -07:00
resource.py	feat: Adding OpenAI Prompts API (#3319 )	2025-09-08 11:05:13 -04:00
version.py	feat: introduce API leveling, post_training, eval to v1alpha (#3449 )	2025-09-26 16:18:07 +02:00