mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-12 13:57:57 +00:00

History

ehhuang 06e4cd8e02 Some checks failed SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 0s Details SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 0s Details Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s Details Python Package Build Test / build (3.12) (push) Failing after 1s Details Python Package Build Test / build (3.13) (push) Failing after 1s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 3s Details Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Details Vector IO Integration Tests / test-matrix (push) Failing after 5s Details API Conformance Tests / check-schema-compatibility (push) Successful in 9s Details Test External API and Providers / test-external (venv) (push) Failing after 4s Details Unit Tests / unit-tests (3.12) (push) Failing after 4s Details Unit Tests / unit-tests (3.13) (push) Failing after 4s Details UI Tests / ui-tests (22) (push) Successful in 38s Details Pre-commit / pre-commit (push) Successful in 1m27s Details feat(api)!: BREAKING CHANGE: support passing `extra_body` through to providers (#3777 ) # What does this PR do? Allows passing through extra_body parameters to inference providers. With this, we removed the 2 vllm-specific parameters from completions API into `extra_body`. Before/After <img width="1883" height="324" alt="image" src="https://github.com/user-attachments/assets/acb27c08-c748-46c9-b1da-0de64e9908a1" /> closes #2720 ## Test Plan CI and added new test ``` ❯ uv run pytest -s -v tests/integration/ --stack-config=server:starter --inference-mode=record -k 'not( builtin_tool or safety_with_image or code_interpreter or test_rag ) and test_openai_completion_guided_choice' --setup=vllm --suite=base --color=yes Uninstalled 3 packages in 125ms Installed 3 packages in 19ms INFO 2025-10-10 14:29:54,317 tests.integration.conftest:118 tests: Applying setup 'vllm' for suite base INFO 2025-10-10 14:29:54,331 tests.integration.conftest:47 tests: Test stack config type: server (stack_config=server:starter) ============================================================================================================== test session starts ============================================================================================================== platform darwin -- Python 3.12.11, pytest-8.4.2, pluggy-1.6.0 -- /Users/erichuang/projects/llama-stack-1/.venv/bin/python cachedir: .pytest_cache metadata: {'Python': '3.12.11', 'Platform': 'macOS-15.6.1-arm64-arm-64bit', 'Packages': {'pytest': '8.4.2', 'pluggy': '1.6.0'}, 'Plugins': {'anyio': '4.9.0', 'html': '4.1.1', 'socket': '0.7.0', 'asyncio': '1.1.0', 'json-report': '1.5.0', 'timeout': '2.4.0', 'metadata': '3.1.1', 'cov': '6.2.1', 'nbval': '0.11.0'}} rootdir: /Users/erichuang/projects/llama-stack-1 configfile: pyproject.toml plugins: anyio-4.9.0, html-4.1.1, socket-0.7.0, asyncio-1.1.0, json-report-1.5.0, timeout-2.4.0, metadata-3.1.1, cov-6.2.1, nbval-0.11.0 asyncio: mode=Mode.AUTO, asyncio_default_fixture_loop_scope=None, asyncio_default_test_loop_scope=function collected 285 items / 284 deselected / 1 selected tests/integration/inference/test_openai_completion.py::test_openai_completion_guided_choice[txt=vllm/Qwen/Qwen3-0.6B] instantiating llama_stack_client Starting llama stack server with config 'starter' on port 8321... Waiting for server at http://localhost:8321... (0.0s elapsed) Waiting for server at http://localhost:8321... (0.5s elapsed) Waiting for server at http://localhost:8321... (5.1s elapsed) Waiting for server at http://localhost:8321... (5.6s elapsed) Waiting for server at http://localhost:8321... (10.1s elapsed) Waiting for server at http://localhost:8321... (10.6s elapsed) Server is ready at http://localhost:8321 llama_stack_client instantiated in 11.773s PASSEDTerminating llama stack server process... Terminating process 98444 and its group... Server process and children terminated gracefully ============================================================================================================= slowest 10 durations ============================================================================================================== 11.88s setup tests/integration/inference/test_openai_completion.py::test_openai_completion_guided_choice[txt=vllm/Qwen/Qwen3-0.6B] 3.02s call tests/integration/inference/test_openai_completion.py::test_openai_completion_guided_choice[txt=vllm/Qwen/Qwen3-0.6B] 0.01s teardown tests/integration/inference/test_openai_completion.py::test_openai_completion_guided_choice[txt=vllm/Qwen/Qwen3-0.6B] ================================================================================================ 1 passed, 284 deselected, 3 warnings in 16.21s ================================================================================================= ```		2025-10-10 16:21:44 -07:00
..
changelog.yml	chore(github-deps): bump actions/checkout from 4.2.2 to 5.0.0 (#3178 )	2025-08-20 16:51:40 -07:00
conformance.yml	feat(api)!: BREAKING CHANGE: support passing `extra_body` through to providers (#3777 )	2025-10-10 16:21:44 -07:00
install-script-ci.yml	chore(github-deps): bump actions/checkout from 4.2.2 to 5.0.0 (#3178 )	2025-08-20 16:51:40 -07:00
integration-auth-tests.yml	fix(auth): allow unauthenticated access to health and version endpoints (#3736 )	2025-10-10 13:41:43 -07:00
integration-sql-store-tests.yml	fix(ci): make all CI workflows have the correct concurrency defn	2025-08-21 16:05:25 -07:00
integration-tests.yml	fix(ci): remove responses from CI for now (#3773 )	2025-10-10 11:52:17 -07:00
integration-vector-io-tests.yml	chore(github-deps): bump actions/checkout from 4.2.2 to 5.0.0 (#3178 )	2025-08-20 16:51:40 -07:00
pre-commit.yml	fix: Improve pre-commit workflow error handling and feedback (#3400 )	2025-09-12 11:10:59 +02:00
precommit-trigger.yml	chore(github-deps): bump actions/github-script from 7.0.1 to 8.0.0 (#3685 )	2025-10-05 21:20:00 -07:00
providers-build.yml	chore: use uvicorn to start llama stack server everywhere (#3625 )	2025-10-06 14:27:40 +02:00
python-build-test.yml	chore!: remove model mgmt from CLI for Hugging Face CLI (#3700 )	2025-10-09 16:50:33 -07:00
README.md	fix: merge workflows to avoid GITHUB_TOKEN limitation	2025-10-03 12:04:02 -07:00
record-integration-tests.yml	feat(tests): make inference_recorder into api_recorder (include tool_invoke) (#3403 )	2025-10-09 14:27:51 -07:00
semantic-pr.yml	chore(github-deps): bump amannn/action-semantic-pull-request from 6.1.0 to 6.1.1 (#3248 )	2025-08-25 17:34:17 +02:00
stale_bot.yml	chore(github-deps): bump actions/stale from 10.0.0 to 10.1.0 (#3684 )	2025-10-08 12:16:54 +02:00
test-external-provider-module.yml	chore!: remove --env from `llama stack run` (#3711 )	2025-10-07 20:58:15 -07:00
test-external.yml	chore!: remove --env from `llama stack run` (#3711 )	2025-10-07 20:58:15 -07:00
ui-unit-tests.yml	chore(github-deps): bump actions/setup-node from 4.4.0 to 5.0.0 (#3353 )	2025-09-08 10:05:00 +02:00
unit-tests.yml	fix(ci): make all CI workflows have the correct concurrency defn	2025-08-21 16:05:25 -07:00

README.md

Llama Stack CI

Llama Stack uses GitHub Actions for Continuous Integration (CI). Below is a table detailing what CI the project includes and the purpose.

Name	File	Purpose
Update Changelog	changelog.yml	Creates PR for updating the CHANGELOG.md
API Conformance Tests	conformance.yml	Run the API Conformance test suite on the changes.
Installer CI	install-script-ci.yml	Test the installation script
Integration Auth Tests	integration-auth-tests.yml	Run the integration test suite with Kubernetes authentication
SqlStore Integration Tests	integration-sql-store-tests.yml	Run the integration test suite with SqlStore
Integration Tests (Replay)	integration-tests.yml	Run the integration test suites from tests/integration in replay mode
Vector IO Integration Tests	integration-vector-io-tests.yml	Run the integration test suite with various VectorIO providers
Pre-commit	pre-commit.yml	Run pre-commit checks
Pre-commit Bot	precommit-trigger.yml	Pre-commit bot for PR
Test Llama Stack Build	providers-build.yml	Test llama stack build
Python Package Build Test	python-build-test.yml	Test building the llama-stack PyPI project
Integration Tests (Record)	record-integration-tests.yml	Run the integration test suite from tests/integration
Check semantic PR titles	semantic-pr.yml	Ensure that PR titles follow the conventional commit spec
Close stale issues and PRs	stale_bot.yml	Run the Stale Bot action
Test External Providers Installed via Module	test-external-provider-module.yml	Test External Provider installation via Python module
Test External API and Providers	test-external.yml	Test the External API and Provider mechanisms
UI Tests	ui-unit-tests.yml	Run the UI test suite
Unit Tests	unit-tests.yml	Run the unit test suite