refactor structure

2025-12-21 22:22:25 +00:00 · 2024-10-29 14:04:41 -07:00 · 2024-10-29 14:04:41 -07:00 · 42104361a3
commit 42104361a3
parent 9ddc28eca7
13 changed files with 293 additions and 562 deletions
--- a/distributions/ollama/README.md
+++ b/distributions/ollama/README.md
@ -92,6 +92,19 @@ llama stack run ./gpu/run.yaml

 ### Model Serving

+#### Downloading model via Ollama
+
+You can use ollama for managing model downloads.
+
+```
+ollama pull llama3.1:8b-instruct-fp16
+ollama pull llama3.1:70b-instruct-fp16
+```
+
+> [!NOTE]
+> Please check the [OLLAMA_SUPPORTED_MODELS](https://github.com/meta-llama/llama-stack/blob/main/llama_stack/providers/adapters/inference/ollama/ollama.py) for the supported Ollama models.
+
+
 To serve a new model with `ollama`
 ```
 ollama run <model_name>