llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-04 10:10:36 +00:00

History

ashwinb 47d5af703c chore(responses): Refactor Responses Impl to be civilized (#3138 ) # What does this PR do? Refactors the OpenAI responses implementation by extracting streaming and tool execution logic into separate modules. This improves code organization by: 1. Creating a new `StreamingResponseOrchestrator` class in `streaming.py` to handle the streaming response generation logic 2. Moving tool execution functionality to a dedicated `ToolExecutor` class in `tool_executor.py` ## Test Plan Existing tests		2025-08-15 00:05:35 +00:00
..
apply.sh	chore: setup for performance benchmarking (#3096 )	2025-08-13 10:58:22 -07:00
locust-k8s.yaml	chore: setup for performance benchmarking (#3096 )	2025-08-13 10:58:22 -07:00
locustfile.py	chore: setup for performance benchmarking (#3096 )	2025-08-13 10:58:22 -07:00
openai-mock-deployment.yaml	chore: setup for performance benchmarking (#3096 )	2025-08-13 10:58:22 -07:00
openai-mock-server.py	chore(responses): Refactor Responses Impl to be civilized (#3138 )	2025-08-15 00:05:35 +00:00
stack-configmap.yaml	chore: setup for performance benchmarking (#3096 )	2025-08-13 10:58:22 -07:00
stack-k8s.yaml.template	chore: setup for performance benchmarking (#3096 )	2025-08-13 10:58:22 -07:00
stack_run_config.yaml	chore: setup for performance benchmarking (#3096 )	2025-08-13 10:58:22 -07:00