mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-10-04 04:04:14 +00:00
We tried to always keep Ollama enabled. However doing so makes the provider implementation half-assed -- should it error when it cannot connect to Ollama or not? What happens during periodic model refresh? Etc. Instead do the same thing we do for vLLM -- use the `OLLAMA_URL` to conditionally enable the provider. ## Test Plan Run `uv run llama stack build --template starter --image-type venv --run` with and without `OLLAMA_URL` set. Verify using `llama-stack-client provider list` that ollama is correctly enabled. |
||
---|---|---|
.. | ||
ci-tests | ||
dell | ||
meta-reference-gpu | ||
nvidia | ||
open-benchmark | ||
postgres-demo | ||
starter | ||
watsonx | ||
__init__.py | ||
template.py |