mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-12-26 13:08:03 +00:00
refactor: set proper name for embedding all-minilm:l6-v2 model
- we are using all-minilm:l6-v2 but the model we download from ollama is all-minilm:latest - even currently they are exactly the same model but if l12-v2 is updated, "latest" might not still valid. - the only change in this PR is pin the model from ollama Signed-off-by: Wen Zhou <wenzhou@redhat.com>
This commit is contained in:
parent
ea966565f6
commit
214b1fe1ae
4 changed files with 5 additions and 5 deletions
|
|
@ -7,7 +7,7 @@ FROM --platform=linux/amd64 ollama/ollama:latest
|
|||
RUN ollama serve & \
|
||||
sleep 5 && \
|
||||
ollama pull llama3.2:3b-instruct-fp16 && \
|
||||
ollama pull all-minilm:latest
|
||||
ollama pull all-minilm:l6-v2
|
||||
|
||||
# Set the entrypoint to start ollama serve
|
||||
ENTRYPOINT ["ollama", "serve"]
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue