llama-stack-mirror/docs/source/distributions/k8s-benchmark
ashwinb 47d5af703c
chore(responses): Refactor Responses Impl to be civilized (#3138)
# What does this PR do?
Refactors the OpenAI responses implementation by extracting streaming and tool execution logic into separate modules. This improves code organization by:

1. Creating a new `StreamingResponseOrchestrator` class in `streaming.py` to handle the streaming response generation logic
2. Moving tool execution functionality to a dedicated `ToolExecutor` class in `tool_executor.py`

## Test Plan

Existing tests
2025-08-15 00:05:35 +00:00
..
apply.sh chore: setup for performance benchmarking (#3096) 2025-08-13 10:58:22 -07:00
locust-k8s.yaml chore: setup for performance benchmarking (#3096) 2025-08-13 10:58:22 -07:00
locustfile.py chore: setup for performance benchmarking (#3096) 2025-08-13 10:58:22 -07:00
openai-mock-deployment.yaml chore: setup for performance benchmarking (#3096) 2025-08-13 10:58:22 -07:00
openai-mock-server.py chore(responses): Refactor Responses Impl to be civilized (#3138) 2025-08-15 00:05:35 +00:00
stack-configmap.yaml chore: setup for performance benchmarking (#3096) 2025-08-13 10:58:22 -07:00
stack-k8s.yaml.template chore: setup for performance benchmarking (#3096) 2025-08-13 10:58:22 -07:00
stack_run_config.yaml chore: setup for performance benchmarking (#3096) 2025-08-13 10:58:22 -07:00