llama-stack/llama_stack/providers/utils/inference
Dinesh Yeduguru 516e1a3e59
add embedding model by default to distribution templates (#617)
# What does this PR do?
Adds the sentence transformer provider and the `all-MiniLM-L6-v2`
embedding model to the default models to register in the run.yaml for
all providers.

## Test Plan
llama stack build --template together --image-type conda
llama stack run
~/.llama/distributions/llamastack-together/together-run.yaml
2024-12-13 12:48:00 -08:00
..
__init__.py Added support for llama 3.3 model (#601) 2024-12-10 20:03:31 -08:00
embedding_mixin.py Make embedding generation go through inference (#606) 2024-12-12 11:47:50 -08:00
model_registry.py add embedding model by default to distribution templates (#617) 2024-12-13 12:48:00 -08:00
openai_compat.py Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376) 2024-11-05 16:22:33 -08:00
prompt_adapter.py use logging instead of prints (#499) 2024-11-21 11:32:53 -08:00