llama-stack-mirror/llama_stack/distribution
Ashwin Bharambe 81ce39a607
feat(api): Add options for supporting various embedding models (#1192)
We need to support:
- asymmetric embedding models (#934)
- truncation policies (#933)
- varying dimensional output (#932) 

## Test Plan

```bash
$ cd llama_stack/providers/tests/inference
$ pytest -s -v -k fireworks test_embeddings.py \
   --inference-model nomic-ai/nomic-embed-text-v1.5 --env EMBEDDING_DIMENSION=784
$  pytest -s -v -k together test_embeddings.py \
   --inference-model togethercomputer/m2-bert-80M-8k-retrieval --env EMBEDDING_DIMENSION=784
$ pytest -s -v -k ollama test_embeddings.py \
   --inference-model all-minilm:latest --env EMBEDDING_DIMENSION=784
```
2025-02-20 22:27:12 -08:00
..
routers feat(api): Add options for supporting various embedding models (#1192) 2025-02-20 22:27:12 -08:00
server fix: some telemetry APIs don't currently work (#1188) 2025-02-20 14:09:25 -08:00
store build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00
ui fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
utils fix: Fixing some small issues with the build scripts (#1132) 2025-02-19 22:20:49 -08:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
build.py fix: improve stack build on venv (#980) 2025-02-14 09:22:03 -08:00
build_conda_env.sh feat: Add a new template for dell (#978) 2025-02-06 14:14:39 -08:00
build_container.sh Fix uv pip install timeout issue for PyTorch (#929) 2025-02-03 06:39:35 -08:00
build_venv.sh style: env var in build_venv 2025-02-19 22:32:59 -08:00
client.py chore: move all Llama Stack types from llama-models to llama-stack (#1098) 2025-02-14 09:10:59 -08:00
common.sh fix: Fixing some small issues with the build scripts (#1132) 2025-02-19 22:20:49 -08:00
configure.py build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00
configure_container.sh More generic image type for OCI-compliant container technologies (#802) 2025-01-17 16:37:42 -08:00
datatypes.py fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
distribution.py fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
inspect.py fix: improve signal handling and update dependencies (#1044) 2025-02-13 08:07:59 -08:00
library_client.py style: remove prints in codebase (#1146) 2025-02-18 19:41:37 -08:00
request_headers.py Add X-LlamaStack-Client-Version, rename ProviderData -> Provider-Data (#735) 2025-01-09 11:51:36 -08:00
resolver.py fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
stack.py feat: adding endpoints for files and uploads (#1070) 2025-02-20 13:09:00 -08:00
start_conda_env.sh feat: Add HTTPS serving option (#1000) 2025-02-07 09:39:08 -08:00
start_container.sh feat: Add HTTPS serving option (#1000) 2025-02-07 09:39:08 -08:00
start_venv.sh feat: add support for running in a venv (#1018) 2025-02-12 11:13:04 -05:00