phoenix-oss/llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-27 19:02:01 +00:00

History

Ben Browning fa34468308 feat: File search tool for Responses API This is an initial working prototype of wiring up the `file_search` builtin tool for the Responses API to our existing rag knowledge search tool. I stubbed in a new test (that uses a hardcoded url hybrid of the OpenAI and Llama Stack clients for now, only until we finish landing the vector store APIs and insertion support). Note that this is currently under tests/verification only because it sometimes flakes with tool calling of the small Llama-3.2-3B model we run in CI (and that I use as an example below). We'd want to make the test a bit more robust in some way if we moved this over to tests/integration and ran it in CI. ``` ollama run llama3.2:3b INFERENCE_MODEL="meta-llama/Llama-3.2-3B-Instruct" \ llama stack run ./llama_stack/templates/ollama/run.yaml \ --image-type venv \ --env OLLAMA_URL="http://0.0.0.0:11434" pytest -sv 'tests/verifications/openai_api/test_responses.py::test_response_non_streaming_file_search' \ --base-url=http://localhost:8321/v1/openai/v1 \ --model meta-llama/Llama-3.2-3B-Instruct ``` Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-06-13 09:36:04 -04:00
..
client-sdk/post_training	feat: Add nemo customizer (#1448 )	2025-03-25 11:01:10 -07:00
common	feat(responses): implement full multi-turn support (#2295 )	2025-06-02 15:35:49 -07:00
external-provider/llama-stack-provider-ollama	chore: mark blobpath as optional (#2271 )	2025-05-27 10:55:24 +02:00
integration	feat: update openai tests to work with both clients (#2442 )	2025-06-12 16:30:23 -07:00
unit	feat(auth): allow token to be provided for use against jwks endpoint (#2394 )	2025-06-13 10:13:41 +02:00
verifications	feat: File search tool for Responses API	2025-06-13 09:36:04 -04:00
__init__.py	refactor(test): introduce --stack-config and simplify options (#1404 )	2025-03-05 17:02:02 -08:00
Containerfile	ci: use ollama container image with loaded models (#2410 )	2025-06-06 12:08:20 +02:00
README.md	docs: revamp testing documentation (#2155 )	2025-05-13 11:28:29 -07:00

README.md

Llama Stack Tests

Llama Stack has multiple layers of testing done to ensure continuous functionality and prevent regressions to the codebase.

Testing Type	Details
Unit	unit/README.md
Integration	integration/README.md
Verification	verifications/README.md