llama-stack-mirror/llama_stack/providers/remote/inference/vllm
Wen Zhou ea964a13ec fix: add missing extra_body to client.chat.completions.create() call
- test requires vLLM as provider, current is skipped in GH Action
- test:
>export VLLM_URL="http://localhost:8000"
>pytest tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_extra_body -v --stack-config="inference=remote::vllm"

Signed-off-by: Wen Zhou <wenzhou@redhat.com>
2025-07-11 13:02:11 +02:00
..
__init__.py Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
config.py refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
vllm.py fix: add missing extra_body to client.chat.completions.create() call 2025-07-11 13:02:11 +02:00