mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-17 06:12:36 +00:00

History

Francisco Javier Arceo 62005dc1a9 feat: Making static prompt values in Rag/File Search configurable in Vector Store Config (#4368 ) # What does this PR do? - Enables users to configure prompts used throughout the File Search / Vector Retrieval - Configuration is defined in the Vector Stores Config so they can be modified at runtime - Backwards compatible, which means the fields are optional and default to the previously used values This is the summary of the new options in the `run.yaml` ```yaml vector_stores: file_search_params: header_template: 'knowledge_search tool found {num_chunks} chunks:\nBEGIN of knowledge_search tool results.\n' footer_template: 'END of knowledge_search tool results.\n' context_prompt_params: chunk_annotation_template: 'Result {index}\nContent: {chunk.content}\nMetadata: {metadata}\n' context_template: 'The above results were retrieved to help answer the user\'s query: "{query}". Use them as supporting information only in answering this query.{annotation_instruction}\n' annotation_prompt_params: enable_annotations: true annotation_instruction_template: 'Cite sources immediately at the end of sentences before punctuation, using `<\|file-id\|>` format like \'This is a fact <\|file-Cn3MSNn72ENTiiq11Qda4A\|>.\'. Do not add extra punctuation. Use only the file IDs provided, do not invent new ones.' chunk_annotation_template: '[{index}] {metadata_text} cite as <\|{file_id}\|>\n{chunk_text}\n' ``` <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[issue-number] --> ## Test Plan Added tests. --------- Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>		2025-12-15 11:39:01 -05:00
..
cli	feat: remove usage of build yaml (#4192 )	2025-12-10 10:12:12 +01:00
conversations	feat: remove usage of build yaml (#4192 )	2025-12-10 10:12:12 +01:00
core	feat: Making static prompt values in Rag/File Search configurable in Vector Store Config (#4368 )	2025-12-15 11:39:01 -05:00
distribution	feat: convert Benchmarks API to use FastAPI router (#4309 )	2025-12-10 15:04:27 +01:00
files	refactor(storage): make { kvstore, sqlstore } as llama stack "internal" APIs (#4181 )	2025-11-18 13:15:16 -08:00
models	refactor: remove dead inference API code and clean up imports (#4093 )	2025-11-10 15:29:24 -08:00
prompts/prompts	feat: remove usage of build yaml (#4192 )	2025-12-10 10:12:12 +01:00
providers	fix(inference): AttributeError in streaming response cleanup (#4236 )	2025-12-14 07:51:09 -05:00
rag	feat: Making static prompt values in Rag/File Search configurable in Vector Store Config (#4368 )	2025-12-15 11:39:01 -05:00
registry	refactor(storage): make { kvstore, sqlstore } as llama stack "internal" APIs (#4181 )	2025-11-18 13:15:16 -08:00
server	feat: remove usage of build yaml (#4192 )	2025-12-10 10:12:12 +01:00
tools	fix: rename llama_stack_api dir (#4155 )	2025-11-13 15:04:36 -08:00
utils	fix(inference): respect table_name config in InferenceStore (#4371 )	2025-12-11 14:50:23 +01:00
__init__.py	chore: Add fixtures to conftest.py (#2067 )	2025-05-06 13:57:48 +02:00
conftest.py	test: suppress expected error logs in SSE test (#3886 )	2025-10-22 14:34:32 -07:00
fixtures.py	refactor(storage): make { kvstore, sqlstore } as llama stack "internal" APIs (#4181 )	2025-11-18 13:15:16 -08:00
README.md	test: Measure and track code coverage (#2636 )	2025-07-18 18:08:36 +02:00

README.md

Llama Stack Unit Tests

Unit Tests

Unit tests verify individual components and functions in isolation. They are fast, reliable, and don't require external services.

Prerequisites

Python Environment: Ensure you have Python 3.12+ installed
uv Package Manager: Install uv if not already installed

You can run the unit tests by running:

./scripts/unit-tests.sh [PYTEST_ARGS]

Any additional arguments are passed to pytest. For example, you can specify a test directory, a specific test file, or any pytest flags (e.g., -vvv for verbosity). If no test directory is specified, it defaults to "tests/unit", e.g:

./scripts/unit-tests.sh tests/unit/registry/test_registry.py -vvv

If you'd like to run for a non-default version of Python (currently 3.12), pass PYTHON_VERSION variable as follows:

source .venv/bin/activate
PYTHON_VERSION=3.13 ./scripts/unit-tests.sh

Test Configuration

Test Discovery: Tests are automatically discovered in the tests/unit/ directory
Async Support: Tests use --asyncio-mode=auto for automatic async test handling
Coverage: Tests generate coverage reports in htmlcov/ directory
Python Version: Defaults to Python 3.12, but can be overridden with PYTHON_VERSION environment variable

Coverage Reports

After running tests, you can view coverage reports:

# Open HTML coverage report in browser
open htmlcov/index.html  # macOS
xdg-open htmlcov/index.html  # Linux
start htmlcov/index.html  # Windows