llama-stack-mirror/tests/integration/recordings/responses/b9f6e724ae06.json
Matthew Farrellee f4ab154ade
Some checks failed
Vector IO Integration Tests / test-matrix (push) Failing after 4s
Update ReadTheDocs / update-readthedocs (push) Failing after 3s
UI Tests / ui-tests (22) (push) Successful in 43s
SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 3s
Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s
API Conformance Tests / check-schema-compatibility (push) Successful in 7s
Unit Tests / unit-tests (3.13) (push) Failing after 4s
Pre-commit / pre-commit (push) Successful in 1m21s
Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped
Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 3s
Python Package Build Test / build (3.12) (push) Failing after 2s
Python Package Build Test / build (3.13) (push) Failing after 2s
SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 5s
Unit Tests / unit-tests (3.12) (push) Failing after 3s
Test External API and Providers / test-external (venv) (push) Failing after 5s
feat: add dynamic model registration support to TGI inference (#3417)
# What does this PR do?

adds dynamic model support to TGI

add new overwrite_completion_id feature to OpenAIMixin to deal with TGI
always returning id=""

## Test Plan

tgi: `docker run --gpus all --shm-size 1g -p 8080:80 -v /data:/data
ghcr.io/huggingface/text-generation-inference --model-id
Qwen/Qwen3-0.6B`

stack: `TGI_URL=http://localhost:8080 uv run llama stack build
--image-type venv --distro ci-tests --run`

test: `./scripts/integration-tests.sh --stack-config
http://localhost:8321 --setup tgi --subdirs inference --pattern openai`
2025-09-15 15:52:40 -04:00

976 lines
25 KiB
JSON

{
"request": {
"method": "POST",
"url": "http://localhost:8080/v1/v1/completions",
"headers": {},
"body": {
"model": "Qwen/Qwen3-0.6B",
"prompt": "Respond to this question and explain your answer. Complete the sentence using one word: Roses are red, violets are ",
"max_tokens": 50,
"stream": true
},
"endpoint": "/v1/completions",
"model": "Qwen/Qwen3-0.6B"
},
"response": {
"body": [
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " several"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " several"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " times"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " more"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " popular"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " than"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " ____"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": ".\n"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "Answer"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": ":\n\n"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "The"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " roses"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " are"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " red"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": ","
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " v"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "io"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "lets"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " are"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " several"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " several"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " times"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " more"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " popular"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " than"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " **"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "numbers"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "**"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": ".\n\n"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "Explanation"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": ":"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " \""
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "se"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "veral"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " several"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " times"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " more"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " popular"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " than"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "\""
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " can"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " be"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " replaced"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " with"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " \""
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "numbers"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": "\""
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " as"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "",
"index": 0,
"logprobs": null,
"text": " the"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": null
}
},
{
"__type__": "openai.types.completion.Completion",
"__data__": {
"id": "",
"choices": [
{
"finish_reason": "length",
"index": 0,
"logprobs": null,
"text": " number"
}
],
"created": 1757550367,
"model": "Qwen/Qwen3-0.6B",
"object": "text_completion",
"system_fingerprint": "3.3.5-dev0-sha-1b90c50",
"usage": {
"completion_tokens": 50,
"prompt_tokens": 25,
"total_tokens": 75,
"completion_tokens_details": null,
"prompt_tokens_details": null
}
}
}
],
"is_streaming": true
}
}