llama-stack-mirror/src/llama_stack/providers/remote/inference
Charlie Doern d5cd0eea14
feat!: standardize base_url for inference (#4177)
# What does this PR do?

Completes #3732 by removing runtime URL transformations and requiring
users to provide full URLs in configuration. All providers now use
'base_url' consistently and respect the exact URL provided without
appending paths like /v1 or /openai/v1 at runtime.

BREAKING CHANGE: Users must update configs to include full URL paths
(e.g., http://localhost:11434/v1 instead of http://localhost:11434).

Closes #3732 

## Test Plan

Existing tests should pass even with the URL changes, due to default
URLs being altered.

Add unit test to enforce URL standardization across remote inference
providers (verifies all use 'base_url' field with HttpUrl | None type)

Signed-off-by: Charlie Doern <cdoern@redhat.com>
2025-11-19 08:44:28 -08:00
..
anthropic fix: rename llama_stack_api dir (#4155) 2025-11-13 15:04:36 -08:00
azure feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
bedrock fix: rename llama_stack_api dir (#4155) 2025-11-13 15:04:36 -08:00
cerebras feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
databricks feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
fireworks feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
gemini fix: rename llama_stack_api dir (#4155) 2025-11-13 15:04:36 -08:00
groq feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
llama_openai_compat feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
nvidia feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
oci fix: rename llama_stack_api dir (#4155) 2025-11-13 15:04:36 -08:00
ollama feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
openai feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
passthrough feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
runpod feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
sambanova feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
tgi feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
together feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
vertexai fix: rename llama_stack_api dir (#4155) 2025-11-13 15:04:36 -08:00
vllm feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
watsonx feat!: standardize base_url for inference (#4177) 2025-11-19 08:44:28 -08:00
__init__.py chore(package): migrate to src/ layout (#3920) 2025-10-27 12:02:21 -07:00