llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Ashwin Bharambe 2fe976ed0a refactor(test): introduce --stack-config and simplify options (#1404 ) You now run the integration tests with these options: ```bash Custom options: --stack-config=STACK_CONFIG a 'pointer' to the stack. this can be either be: (a) a template name like `fireworks`, or (b) a path to a run.yaml file, or (c) an adhoc config spec, e.g. `inference=fireworks,safety=llama-guard,agents=meta- reference` --env=ENV Set environment variables, e.g. --env KEY=value --text-model=TEXT_MODEL comma-separated list of text models. Fixture name: text_model_id --vision-model=VISION_MODEL comma-separated list of vision models. Fixture name: vision_model_id --embedding-model=EMBEDDING_MODEL comma-separated list of embedding models. Fixture name: embedding_model_id --safety-shield=SAFETY_SHIELD comma-separated list of safety shields. Fixture name: shield_id --judge-model=JUDGE_MODEL comma-separated list of judge models. Fixture name: judge_model_id --embedding-dimension=EMBEDDING_DIMENSION Output dimensionality of the embedding model to use for testing. Default: 384 --record-responses Record new API responses instead of using cached ones. --report=REPORT Path where the test report should be written, e.g. --report=/path/to/report.md ``` Importantly, if you don't specify any of the models (text-model, vision-model, etc.) the relevant tests will get skipped! This will make running tests somewhat more annoying since all options will need to be specified. We will make this easier by adding some easy wrapper yaml configs. ## Test Plan Example: ```bash ashwin@ashwin-mbp ~/local/llama-stack/tests/integration (unify_tests) $ LLAMA_STACK_CONFIG=fireworks pytest -s -v inference/test_text_inference.py \ --text-model meta-llama/Llama-3.2-3B-Instruct ```		2025-03-05 17:02:02 -08:00
..
routers	fix: don't import from llama_models (#1436 )	2025-03-05 15:30:38 -08:00
server	feat: add more logs to agent_instance.py	2025-03-03 16:15:47 -08:00
store	refactor: move a few tests to top-level tests/ directory	2025-03-03 17:33:39 -08:00
ui	chore: Make README code blocks more easily copy pastable (#1420 )	2025-03-05 09:11:01 -08:00
utils	chore: remove unused build dir (#1379 )	2025-03-05 15:40:00 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
build.py	build(container): misc improvements (#1291 )	2025-02-28 10:01:52 -08:00
build_conda_env.sh	chore: remove straggler references to llama-models (#1345 )	2025-03-01 14:26:03 -08:00
build_container.sh	chore: remove straggler references to llama-models (#1345 )	2025-03-01 14:26:03 -08:00
build_venv.sh	chore: remove straggler references to llama-models (#1345 )	2025-03-01 14:26:03 -08:00
client.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
common.sh	fix: Fixing some small issues with the build scripts (#1132 )	2025-02-19 22:20:49 -08:00
configure.py	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
datatypes.py	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
distribution.py	chore(lint): update Ruff ignores for project conventions and maintainability (#1184 )	2025-02-28 09:36:49 -08:00
inspect.py	fix: improve signal handling and update dependencies (#1044 )	2025-02-13 08:07:59 -08:00
library_client.py	fix: raise error when request param failed to convert (#1339 )	2025-03-01 10:39:05 -08:00
request_headers.py	Add X-LlamaStack-Client-Version, rename ProviderData -> Provider-Data (#735 )	2025-01-09 11:51:36 -08:00
resolver.py	feat: record token usage for inference API (#1300 )	2025-03-05 12:41:45 -08:00
stack.py	refactor(test): introduce --stack-config and simplify options (#1404 )	2025-03-05 17:02:02 -08:00
start_stack.sh	feat: add a configurable category-based logger (#1352 )	2025-03-02 18:51:14 -08:00