mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-04 10:10:36 +00:00

History

Ashwin Bharambe db6488b379 fix: enforce allowed_models during inference requests The `allowed_models` configuration was only filtering the model list endpoint but not enforcing restrictions during actual inference requests. This allowed users to bypass the restriction by directly requesting models not in the allowed list, potentially accessing expensive models when only cheaper ones were intended. This change adds validation to all inference methods (`openai_chat_completion`, `openai_completion`, `openai_embeddings`) to reject requests for disallowed models with a clear error message. Implementation: - Added `_validate_model_allowed()` helper method that checks if a model is in the `allowed_models` list - Called validation in all three inference methods before making API requests - Validation occurs after resolving the provider model ID to ensure consistency Test Plan: - Added unit tests verifying all inference methods respect `allowed_models` - Tests cover allowed models (success), disallowed models (rejection), and no restrictions (None allows all, empty list blocks all) - All existing tests continue to pass Fixes GHSA-5rjj-4jp6-fw39		2025-11-19 12:12:28 -08:00
..
cli	fix: generate provider config when using --providers (#4044 )	2025-11-03 11:37:58 -08:00
conversations	chore: add storage sane defaults (#4182 )	2025-11-18 15:22:26 -08:00
core	chore: add storage sane defaults (#4182 )	2025-11-18 15:22:26 -08:00
distribution	fix: MCP authorization parameter implementation (#4052 )	2025-11-14 08:54:42 -08:00
files	refactor(storage): make { kvstore, sqlstore } as llama stack "internal" APIs (#4181 )	2025-11-18 13:15:16 -08:00
models	refactor: remove dead inference API code and clean up imports (#4093 )	2025-11-10 15:29:24 -08:00
prompts/prompts	refactor(storage): make { kvstore, sqlstore } as llama stack "internal" APIs (#4181 )	2025-11-18 13:15:16 -08:00
providers	fix: enforce allowed_models during inference requests	2025-11-19 12:12:28 -08:00
rag	fix: rename llama_stack_api dir (#4155 )	2025-11-13 15:04:36 -08:00
registry	refactor(storage): make { kvstore, sqlstore } as llama stack "internal" APIs (#4181 )	2025-11-18 13:15:16 -08:00
server	refactor(storage): make { kvstore, sqlstore } as llama stack "internal" APIs (#4181 )	2025-11-18 13:15:16 -08:00
tools	fix: rename llama_stack_api dir (#4155 )	2025-11-13 15:04:36 -08:00
utils	refactor(storage): make { kvstore, sqlstore } as llama stack "internal" APIs (#4181 )	2025-11-18 13:15:16 -08:00
__init__.py	chore: Add fixtures to conftest.py (#2067 )	2025-05-06 13:57:48 +02:00
conftest.py	test: suppress expected error logs in SSE test (#3886 )	2025-10-22 14:34:32 -07:00
fixtures.py	refactor(storage): make { kvstore, sqlstore } as llama stack "internal" APIs (#4181 )	2025-11-18 13:15:16 -08:00
README.md	test: Measure and track code coverage (#2636 )	2025-07-18 18:08:36 +02:00

README.md

Llama Stack Unit Tests

Unit Tests

Unit tests verify individual components and functions in isolation. They are fast, reliable, and don't require external services.

Prerequisites

Python Environment: Ensure you have Python 3.12+ installed
uv Package Manager: Install uv if not already installed

You can run the unit tests by running:

./scripts/unit-tests.sh [PYTEST_ARGS]

Any additional arguments are passed to pytest. For example, you can specify a test directory, a specific test file, or any pytest flags (e.g., -vvv for verbosity). If no test directory is specified, it defaults to "tests/unit", e.g:

./scripts/unit-tests.sh tests/unit/registry/test_registry.py -vvv

If you'd like to run for a non-default version of Python (currently 3.12), pass PYTHON_VERSION variable as follows:

source .venv/bin/activate
PYTHON_VERSION=3.13 ./scripts/unit-tests.sh

Test Configuration

Test Discovery: Tests are automatically discovered in the tests/unit/ directory
Async Support: Tests use --asyncio-mode=auto for automatic async test handling
Coverage: Tests generate coverage reports in htmlcov/ directory
Python Version: Defaults to Python 3.12, but can be overridden with PYTHON_VERSION environment variable

Coverage Reports

After running tests, you can view coverage reports:

# Open HTML coverage report in browser
open htmlcov/index.html  # macOS
xdg-open htmlcov/index.html  # Linux
start htmlcov/index.html  # Windows