llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-04 02:03:44 +00:00

History

Cesare Pompeiano 0dbf79c328 Some checks failed Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 4s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 4s Details Test Llama Stack Build / build-single-provider (push) Failing after 3s Details Test Llama Stack Build / generate-matrix (push) Successful in 5s Details SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 9s Details SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 9s Details Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Details Python Package Build Test / build (3.12) (push) Failing after 1s Details Python Package Build Test / build (3.13) (push) Failing after 1s Details Vector IO Integration Tests / test-matrix (push) Failing after 9s Details Test Llama Stack Build / build-custom-container-distribution (push) Failing after 3s Details API Conformance Tests / check-schema-compatibility (push) Successful in 13s Details Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 4s Details Unit Tests / unit-tests (3.12) (push) Failing after 4s Details Unit Tests / unit-tests (3.13) (push) Failing after 3s Details Test External API and Providers / test-external (venv) (push) Failing after 5s Details Test Llama Stack Build / build (push) Failing after 31s Details UI Tests / ui-tests (22) (push) Successful in 46s Details Pre-commit / pre-commit (push) Successful in 2m13s Details fix: Fixed WatsonX remote inference provider (#3801 ) # What does this PR do? This PR fixes issues with the WatsonX provider so it works correctly with LiteLLM. The main problem was that WatsonX requests failed because the provider data validator didn’t properly handle the API key and project ID. This was fixed by updating the WatsonXProviderDataValidator and ensuring the provider data is loaded correctly. The openai_chat_completion method was also updated to match the behavior of other providers while adding WatsonX-specific fields like project_id. It still calls await super().openai_chat_completion.__func__(self, params) to keep the existing setup and tracing logic. After these changes, WatsonX requests now run correctly. ## Test Plan The changes were tested by running chat completion requests and confirming that credentials and project parameters are passed correctly. I have tested with my WatsonX credentials, by using the cli with `uv run llama-stack-client inference chat-completion --session` --------- Signed-off-by: Sébastien Han <seb@redhat.com> Co-authored-by: Sébastien Han <seb@redhat.com>		2025-10-14 14:52:32 +02:00
..
anthropic	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
azure	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
bedrock	feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794 )	2025-10-12 19:01:52 -07:00
cerebras	feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794 )	2025-10-12 19:01:52 -07:00
databricks	feat(api)!: BREAKING CHANGE: support passing `extra_body` through to providers (#3777 )	2025-10-10 16:21:44 -07:00
fireworks	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
gemini	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
groq	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
llama_openai_compat	feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794 )	2025-10-12 19:01:52 -07:00
nvidia	feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794 )	2025-10-12 19:01:52 -07:00
ollama	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
openai	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
passthrough	feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794 )	2025-10-12 19:01:52 -07:00
runpod	feat(api)!: BREAKING CHANGE: support passing `extra_body` through to providers (#3777 )	2025-10-10 16:21:44 -07:00
sambanova	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
tgi	feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794 )	2025-10-12 19:01:52 -07:00
together	feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794 )	2025-10-12 19:01:52 -07:00
vertexai	feat: use SecretStr for inference provider auth credentials (#3724 )	2025-10-10 07:32:50 -07:00
vllm	feat(api)!: BREAKING CHANGE: support passing `extra_body` through to providers (#3777 )	2025-10-10 16:21:44 -07:00
watsonx	fix: Fixed WatsonX remote inference provider (#3801 )	2025-10-14 14:52:32 +02:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00