llama-stack-mirror/llama_stack/providers/remote/inference
mergify[bot] 0899f78943
fix: Avoid model_limits KeyError (backport #4060) (#4283)
# What does this PR do?
It avoids model_limit KeyError while trying to get embedding models for
Watsonx


Closes https://github.com/llamastack/llama-stack/issues/4059

## Test Plan

Start server with watsonx distro:
```bash
llama stack list-deps watsonx | xargs -L1 uv pip install
uv run llama stack run watsonx
```
Run 
```python
client = LlamaStackClient(base_url=base_url)
client.models.list()
```
Check if there is any embedding model available (currently there is not
a single one)<hr>This is an automatic backport of pull request #4060
done by [Mergify](https://mergify.com).

Co-authored-by: Wojciech-Rebisz <147821486+Wojciech-Rebisz@users.noreply.github.com>
2025-12-03 10:56:24 +01:00
..
anthropic feat: use SecretStr for inference provider auth credentials (#3724) 2025-10-10 07:32:50 -07:00
azure feat: use SecretStr for inference provider auth credentials (#3724) 2025-10-10 07:32:50 -07:00
bedrock feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794) 2025-10-12 19:01:52 -07:00
cerebras feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794) 2025-10-12 19:01:52 -07:00
databricks feat(api)!: BREAKING CHANGE: support passing extra_body through to providers (#3777) 2025-10-10 16:21:44 -07:00
fireworks feat: use SecretStr for inference provider auth credentials (#3724) 2025-10-10 07:32:50 -07:00
gemini feat(gemini): Support gemini-embedding-001 and fix models/ prefix in metadata keys (#3813) 2025-10-15 12:22:10 -04:00
groq feat: use SecretStr for inference provider auth credentials (#3724) 2025-10-10 07:32:50 -07:00
llama_openai_compat feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794) 2025-10-12 19:01:52 -07:00
nvidia chore: remove build.py (#3869) 2025-10-20 16:28:15 -07:00
ollama feat: use SecretStr for inference provider auth credentials (#3724) 2025-10-10 07:32:50 -07:00
openai feat: use SecretStr for inference provider auth credentials (#3724) 2025-10-10 07:32:50 -07:00
passthrough feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794) 2025-10-12 19:01:52 -07:00
runpod feat(api)!: BREAKING CHANGE: support passing extra_body through to providers (#3777) 2025-10-10 16:21:44 -07:00
sambanova feat: use SecretStr for inference provider auth credentials (#3724) 2025-10-10 07:32:50 -07:00
tgi feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794) 2025-10-12 19:01:52 -07:00
together feat(api)!: support extra_body to embeddings and vector_stores APIs (#3794) 2025-10-12 19:01:52 -07:00
vertexai fix: update hard-coded google model names (backport #4212) (#4229) 2025-11-24 11:32:14 -08:00
vllm feat(api)!: BREAKING CHANGE: support passing extra_body through to providers (#3777) 2025-10-10 16:21:44 -07:00
watsonx fix: Avoid model_limits KeyError (backport #4060) (#4283) 2025-12-03 10:56:24 +01:00
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00