llama-stack-mirror/docs/source
Ben Browning 8ede67b809 More work on file_search verification test
This gets the file_search verification test working against ollama,
fireworks, and api.openai.com. We don't have the entirety of the
vector store API implemented in Llama Stack yet, so this still has a
bit of a hack to swap between using only OpenAI-compatible APIs versus
using the LlamaStackClient to insert content into our vector stores.

Outside of actually inserting file contents, the rest of the test
works the same and uses only the OpenAI client for all of these providers.

How to run the tests:

Ollama (sometimes flakes with small model):

```
ollama run llama3.2:3b

INFERENCE_MODEL="meta-llama/Llama-3.2-3B-Instruct" \
llama stack run ./llama_stack/templates/ollama/run.yaml \
  --image-type venv \
  --env OLLAMA_URL="http://0.0.0.0:11434"

pytest -sv \
  'tests/verifications/openai_api/test_responses.py::test_response_non_streaming_file_search' \
  --base-url=http://localhost:8321/v1/openai/v1 \
  --model meta-llama/Llama-3.2-3B-Instruct
```

Fireworks via Llama Stack:

```
llama stack run llama_stack/templates/fireworks/run.yaml

pytest -sv \
  'tests/verifications/openai_api/test_responses.py::test_response_non_streaming_file_search' \
  --base-url=http://localhost:8321/v1/openai/v1 \
  --model meta-llama/Llama-3.3-70B-Instruct
```

OpenAI directly:

```
pytest -sv \
  'tests/verifications/openai_api/test_responses.py::test_response_non_streaming_file_search' \
  --base-url=https://api.openai.com/v1 \
  --model gpt-4o
```

Signed-off-by: Ben Browning <bbrownin@redhat.com>
2025-06-13 09:36:04 -04:00
..
building_applications docs(mcp): add a few lines for how to specify Auth headers in MCP tools (#2336) 2025-06-02 14:28:38 -07:00
concepts docs(mcp): add a few lines for how to specify Auth headers in MCP tools (#2336) 2025-06-02 14:28:38 -07:00
contributing docs: revamp testing documentation (#2155) 2025-05-13 11:28:29 -07:00
distributions More work on file_search verification test 2025-06-13 09:36:04 -04:00
getting_started docs: Add missing dependencies in quickstart demo command (#2347) 2025-06-03 18:01:36 +02:00
introduction docs: Remove mentions of focus on Llama models (#1690) 2025-03-19 00:17:22 -04:00
openai docs: Add OpenAI API compatibility page (#2316) 2025-06-04 06:51:52 -04:00
playground chore: simplify running the demo UI (#1907) 2025-04-09 11:22:29 -07:00
providers docs: add post training to providers list (#2280) 2025-05-28 09:32:00 -04:00
references chore: remove last instances of code-interpreter provider (#2143) 2025-05-12 10:54:43 -07:00
conf.py fix: use pypi browser agent (#2260) 2025-05-24 23:26:30 -07:00
index.md docs: Add OpenAI API compatibility page (#2316) 2025-06-04 06:51:52 -04:00