llama-stack-mirror/llama_stack/templates
Ashwin Bharambe dc84bc755a
fix: revert to using faiss for ollama distro (#1530)
This is unfortunate because `sqlite-vec` seems promising. But its PIP
package is not quite complete. It does not have binary for arm64 (I
think, or maybe it even lacks 64 bit builds?) which results in the arm64
container resulting in
```
File "/usr/local/lib/python3.10/site-packages/sqlite_vec/init.py", line 17, in load
    conn.load_extension(loadable_path())
sqlite3.OperationalError: /usr/local/lib/python3.10/site-packages/sqlite_vec/vec0.so: wrong ELF class: ELFCLASS32
```

To get around I tried to install from source via `uv pip install
sqlite-vec --no-binary=sqlite-vec` however it even lacks a source
distribution which makes that impossible.

## Test Plan

Build the container locally using: 

```bash
LLAMA_STACK_DIR=. llama stack build --template ollama --image-type container
```

Run the container as: 

```
podman run --privileged -it -p $LLAMA_STACK_PORT:$LLAMA_STACK_PORT \
   -v ~/.llama:/root/.llama \
    --env INFERENCE_MODEL=$INFERENCE_MODEL \
    --env OLLAMA_URL=http://host.containers.internal:11434 \
    -v ~/local/llama-stack:/app/llama-stack-source 
    localhost/distribution-ollama:dev --port $LLAMA_STACK_PORT
```

Verify the container starts up correctly. Without this patch, it would
encounter the ELFCLASS32 error.
2025-03-10 16:15:17 -07:00
..
bedrock refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
cerebras refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
ci-tests refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
dell chore: remove straggler references to llama-models (#1345) 2025-03-01 14:26:03 -08:00
dev refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
experimental-post-training feat: [post training] support save hf safetensor format checkpoint (#845) 2025-02-25 23:29:08 -08:00
fireworks refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401) 2025-03-04 14:53:47 -08:00
groq fix: register provider model name and HF alias in run.yaml (#1304) 2025-02-27 16:39:23 -08:00
hf-endpoint refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
hf-serverless refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
meta-reference-gpu refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
meta-reference-quantized-gpu refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
nvidia fix: register provider model name and HF alias in run.yaml (#1304) 2025-02-27 16:39:23 -08:00
ollama fix: revert to using faiss for ollama distro (#1530) 2025-03-10 16:15:17 -07:00
open-benchmark fix: update the open benchmark eval doc (#1497) 2025-03-07 15:05:27 -08:00
passthrough feat: inference passthrough provider (#1166) 2025-02-19 21:47:00 -08:00
remote-vllm refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401) 2025-03-04 14:53:47 -08:00
sambanova refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
tgi refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00
together refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401) 2025-03-04 14:53:47 -08:00
vllm-gpu feat: updated inline vllm inference provider (#880) 2025-03-07 13:38:23 -08:00
__init__.py Auto-generate distro yamls + docs (#468) 2024-11-18 14:57:06 -08:00
template.py refactor(test): unify vector_io tests and make them configurable (#1398) 2025-03-04 13:37:45 -08:00