llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Charlie Doern d5cd0eea14 feat!: standardize base_url for inference (#4177 ) # What does this PR do? Completes #3732 by removing runtime URL transformations and requiring users to provide full URLs in configuration. All providers now use 'base_url' consistently and respect the exact URL provided without appending paths like /v1 or /openai/v1 at runtime. BREAKING CHANGE: Users must update configs to include full URL paths (e.g., http://localhost:11434/v1 instead of http://localhost:11434). Closes #3732 ## Test Plan Existing tests should pass even with the URL changes, due to default URLs being altered. Add unit test to enforce URL standardization across remote inference providers (verifies all use 'base_url' field with HttpUrl \| None type) Signed-off-by: Charlie Doern <cdoern@redhat.com>		2025-11-19 08:44:28 -08:00
..
__init__.py	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
build.yaml	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
doc_template.md	docs: add documentation on how to use custom run yaml in docker (#3949 )	2025-10-28 16:05:44 -07:00
nvidia.py	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
run-with-safety.yaml	feat!: standardize base_url for inference (#4177 )	2025-11-19 08:44:28 -08:00
run.yaml	feat!: standardize base_url for inference (#4177 )	2025-11-19 08:44:28 -08:00