llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-23 08:33:09 +00:00

Author	SHA1	Message	Date
Eric Huang	558e0dc84b	merge commit for archive created by Sapling	2025-10-19 21:12:19 -07:00
Eric Huang	edb7e6aebe	dockerfile # What does this PR do? ## Test Plan	2025-10-19 21:12:12 -07:00
Eric Huang	58eb97d152	merge commit for archive created by Sapling	2025-10-19 21:11:16 -07:00
Eric Huang	af23850e83	dockerfile # What does this PR do? ## Test Plan	2025-10-19 21:11:10 -07:00
Eric Huang	b53c66e191	merge commit for archive created by Sapling	2025-10-19 16:44:46 -07:00
Eric Huang	4ebd4a60de	dockerfile # What does this PR do? ## Test Plan	2025-10-19 16:38:30 -07:00
ehhuang	8f9910ff24	Merge `a35d090fd9` into sapling-pr-archive-ehhuang	2025-10-18 17:23:35 -07:00
Eric Huang	a35d090fd9	dockerfile # What does this PR do? ## Test Plan	2025-10-18 17:23:32 -07:00
Eric Huang	d23a445cfd	merge commit for archive created by Sapling	2025-10-18 17:16:07 -07:00
Eric Huang	34f0ad0d25	dockerfile # What does this PR do? ## Test Plan	2025-10-18 17:16:02 -07:00
Eric Huang	11de912b66	merge commit for archive created by Sapling	2025-10-17 22:29:29 -07:00
Eric Huang	d227602376	dockerfile # What does this PR do? ## Test Plan	2025-10-17 22:27:52 -07:00
Eric Huang	37aef9d51d	merge commit for archive created by Sapling	2025-10-17 21:49:49 -07:00
Eric Huang	3d53dc201d	dockerfile # What does this PR do? ## Test Plan	2025-10-17 21:49:05 -07:00
Charlie Doern	b11bcfde11	refactor(build): rework CLI commands and build process (1/2) (#2974 ) Some checks failed SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 0s Details SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 0s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 3s Details Test Llama Stack Build / generate-matrix (push) Successful in 22s Details Test llama stack list-deps / show-single-provider (push) Failing after 53s Details Test Llama Stack Build / build-single-provider (push) Failing after 3s Details Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Details Python Package Build Test / build (3.12) (push) Failing after 18s Details Python Package Build Test / build (3.13) (push) Failing after 24s Details Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 26s Details Test Llama Stack Build / build-custom-container-distribution (push) Failing after 27s Details Unit Tests / unit-tests (3.12) (push) Failing after 26s Details Vector IO Integration Tests / test-matrix (push) Failing after 44s Details API Conformance Tests / check-schema-compatibility (push) Successful in 52s Details Test llama stack list-deps / generate-matrix (push) Successful in 52s Details Test Llama Stack Build / build (push) Failing after 29s Details Test External API and Providers / test-external (venv) (push) Failing after 53s Details Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1m2s Details Unit Tests / unit-tests (3.13) (push) Failing after 1m30s Details Test llama stack list-deps / list-deps-from-config (push) Failing after 1m59s Details Test llama stack list-deps / list-deps (push) Failing after 1m10s Details UI Tests / ui-tests (22) (push) Successful in 2m26s Details Pre-commit / pre-commit (push) Successful in 3m8s Details # What does this PR do? This PR does a few things outlined in #2878 namely: 1. adds `llama stack list-deps` a command which simply takes the build logic and instead of executing one of the `build_...` scripts, it displays all of the providers' dependencies using the `module` and `uv`. 2. deprecated `llama stack build` in favor of `llama stack list-deps` 3. updates all tests to use `list-deps` alongside `build`. PR 2/2 will migrate `llama stack run`'s default behavior to be `llama stack build --run` and use the new `list-deps` command under the hood before running the server. examples of `llama stack list-deps starter` ``` llama stack list-deps starter --format json { "name": "starter", "description": "Quick start template for running Llama Stack with several popular providers. This distribution is intended for CPU-only environments.", "apis": [ { "api": "inference", "provider": "remote::cerebras" }, { "api": "inference", "provider": "remote::ollama" }, { "api": "inference", "provider": "remote::vllm" }, { "api": "inference", "provider": "remote::tgi" }, { "api": "inference", "provider": "remote::fireworks" }, { "api": "inference", "provider": "remote::together" }, { "api": "inference", "provider": "remote::bedrock" }, { "api": "inference", "provider": "remote::nvidia" }, { "api": "inference", "provider": "remote::openai" }, { "api": "inference", "provider": "remote::anthropic" }, { "api": "inference", "provider": "remote::gemini" }, { "api": "inference", "provider": "remote::vertexai" }, { "api": "inference", "provider": "remote::groq" }, { "api": "inference", "provider": "remote::sambanova" }, { "api": "inference", "provider": "remote::azure" }, { "api": "inference", "provider": "inline::sentence-transformers" }, { "api": "vector_io", "provider": "inline::faiss" }, { "api": "vector_io", "provider": "inline::sqlite-vec" }, { "api": "vector_io", "provider": "inline::milvus" }, { "api": "vector_io", "provider": "remote::chromadb" }, { "api": "vector_io", "provider": "remote::pgvector" }, { "api": "files", "provider": "inline::localfs" }, { "api": "safety", "provider": "inline::llama-guard" }, { "api": "safety", "provider": "inline::code-scanner" }, { "api": "agents", "provider": "inline::meta-reference" }, { "api": "telemetry", "provider": "inline::meta-reference" }, { "api": "post_training", "provider": "inline::torchtune-cpu" }, { "api": "eval", "provider": "inline::meta-reference" }, { "api": "datasetio", "provider": "remote::huggingface" }, { "api": "datasetio", "provider": "inline::localfs" }, { "api": "scoring", "provider": "inline::basic" }, { "api": "scoring", "provider": "inline::llm-as-judge" }, { "api": "scoring", "provider": "inline::braintrust" }, { "api": "tool_runtime", "provider": "remote::brave-search" }, { "api": "tool_runtime", "provider": "remote::tavily-search" }, { "api": "tool_runtime", "provider": "inline::rag-runtime" }, { "api": "tool_runtime", "provider": "remote::model-context-protocol" }, { "api": "batches", "provider": "inline::reference" } ], "pip_dependencies": [ "pandas", "opentelemetry-exporter-otlp-proto-http", "matplotlib", "opentelemetry-sdk", "sentence-transformers", "datasets", "pymilvus[milvus-lite]>=2.4.10", "codeshield", "scipy", "torchvision", "tree_sitter", "h11>=0.16.0", "aiohttp", "pymongo", "tqdm", "pythainlp", "pillow", "torch", "emoji", "grpcio>=1.67.1,<1.71.0", "fireworks-ai", "langdetect", "psycopg2-binary", "asyncpg", "redis", "together", "torchao>=0.12.0", "openai", "sentencepiece", "aiosqlite", "google-cloud-aiplatform", "faiss-cpu", "numpy", "sqlite-vec", "nltk", "scikit-learn", "mcp>=1.8.1", "transformers", "boto3", "huggingface_hub", "ollama", "autoevals", "sqlalchemy[asyncio]", "torchtune>=0.5.0", "chromadb-client", "pypdf", "requests", "anthropic", "chardet", "aiosqlite", "fastapi", "fire", "httpx", "uvicorn", "opentelemetry-sdk", "opentelemetry-exporter-otlp-proto-http" ] } ``` <img width="1500" height="420" alt="Screenshot 2025-10-16 at 5 53 03 PM" src="https://github.com/user-attachments/assets/765929fb-93e2-44d7-9c3d-8918b70fc721" /> --------- Signed-off-by: Charlie Doern <cdoern@redhat.com>	2025-10-17 19:52:14 -07:00
Eric Huang	5fde82448e	merge commit for archive created by Sapling	2025-10-17 14:56:01 -07:00
Eric Huang	aad3fd2cf3	dockerfile # What does this PR do? ## Test Plan	2025-10-17 14:55:54 -07:00
Eric Huang	25b572f21d	merge commit for archive created by Sapling	2025-10-17 14:27:15 -07:00
Eric Huang	e7d850311e	dockerfile # What does this PR do? ## Test Plan	2025-10-17 14:27:04 -07:00
Eric Huang	03af593f6d	merge commit for archive created by Sapling	2025-10-17 14:15:02 -07:00
Eric Huang	78d61fd54b	dockerfile # What does this PR do? ## Test Plan	2025-10-17 14:14:52 -07:00
ehhuang	e2269fc9d0	Merge `9ea4c63dd7` into sapling-pr-archive-ehhuang	2025-10-17 14:02:31 -07:00
Eric Huang	9ea4c63dd7	chore: add telemetry setup to install.sh Some checks failed Installer CI / lint (push) Failing after 10s Details Installer CI / smoke-test-on-dev (push) Failing after 56s Details # What does this PR do? ## Test Plan	2025-10-17 14:02:16 -07:00
ehhuang	f12cdac4f4	Merge `28b154a6ca` into sapling-pr-archive-ehhuang	2025-10-17 14:00:50 -07:00
Eric Huang	28b154a6ca	chore: add telemetry setup to install.sh # What does this PR do? ## Test Plan	2025-10-17 14:00:45 -07:00
Eric Huang	08d6213aff	merge commit for archive created by Sapling	2025-10-17 13:33:02 -07:00
Eric Huang	0a1a69e6dd	dockerfile # What does this PR do? ## Test Plan	2025-10-17 13:32:48 -07:00
Emilio Garcia	943558af36	test(telemetry): Telemetry Tests (#3805 ) Some checks failed SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 0s Details SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 0s Details Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Details Python Package Build Test / build (3.12) (push) Failing after 10s Details Python Package Build Test / build (3.13) (push) Failing after 10s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 14s Details Unit Tests / unit-tests (3.13) (push) Failing after 11s Details Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 20s Details Unit Tests / unit-tests (3.12) (push) Failing after 16s Details Test External API and Providers / test-external (venv) (push) Failing after 28s Details Vector IO Integration Tests / test-matrix (push) Failing after 30s Details API Conformance Tests / check-schema-compatibility (push) Successful in 38s Details UI Tests / ui-tests (22) (push) Successful in 1m32s Details Pre-commit / pre-commit (push) Successful in 3m16s Details # What does this PR do? Adds a test and a standardized way to build future tests out for telemetry in llama stack. Contributes to https://github.com/llamastack/llama-stack/issues/3806 ## Test Plan This is the test plan 😎	2025-10-17 10:43:33 -07:00
Alexey Rybak	224c99560c	docs: update docstrings for better formatting (#3838 ) # What does this PR do? Updates docstrings for Conversations and Eval APIs to render better in the docs nav sidebar. Before: <img width="363" height="233" alt="Screenshot 2025-10-17 at 9 52 17 AM" src="https://github.com/user-attachments/assets/3a77f9e3-3b03-43ae-8584-a21d1f44d54d" /> After: <img width="410" height="206" alt="Screenshot 2025-10-17 at 9 52 11 AM" src="https://github.com/user-attachments/assets/fa5d428d-2bde-4453-84fd-9aceebe712e8" /> ## Test Plan * Manual testing	2025-10-17 10:41:50 -07:00
Alexey Rybak	c9f0bebcb7	chore: update API leveling docs with deprecation flag (#3837 ) # What does this PR do? Adds information on the `deprecated=True` flags to the documentation for extra clarity. ## Test Plan * Manual testing	2025-10-17 10:17:58 -07:00
Ashwin Bharambe	a701f68bd7	feat(ci): enable docker based server tests (#3833 ) Some checks failed SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 1s Details Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Details Python Package Build Test / build (3.12) (push) Failing after 3s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 3s Details SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 7s Details Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 9s Details Unit Tests / unit-tests (3.12) (push) Failing after 7s Details Python Package Build Test / build (3.13) (push) Failing after 12s Details Unit Tests / unit-tests (3.13) (push) Failing after 13s Details Test External API and Providers / test-external (venv) (push) Failing after 19s Details Vector IO Integration Tests / test-matrix (push) Failing after 22s Details API Conformance Tests / check-schema-compatibility (push) Successful in 31s Details UI Tests / ui-tests (22) (push) Successful in 1m35s Details Pre-commit / pre-commit (push) Successful in 2m27s Details	2025-10-17 09:19:25 +02:00
Ashwin Bharambe	4c9d944380	fix(perf): make batches tests finish 30x faster (#3834 ) In replay mode, inference is instantenous. We don't need to wait 15 seconds for the batch to be done. Fixing polling to do exp backoff makes things work super fast.	2025-10-17 09:16:44 +02:00
Ashwin Bharambe	cd152f4240	feat(ci): add support for docker:distro in tests (#3832 ) Some checks failed Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s Details SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 0s Details SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 0s Details Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Details Python Package Build Test / build (3.13) (push) Failing after 2s Details Test Llama Stack Build / generate-matrix (push) Successful in 6s Details Unit Tests / unit-tests (3.12) (push) Failing after 5s Details Test Llama Stack Build / build-single-provider (push) Failing after 9s Details Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 10s Details Vector IO Integration Tests / test-matrix (push) Failing after 14s Details Unit Tests / unit-tests (3.13) (push) Failing after 7s Details Test External API and Providers / test-external (venv) (push) Failing after 12s Details API Conformance Tests / check-schema-compatibility (push) Successful in 19s Details Test Llama Stack Build / build (push) Failing after 7s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 26s Details Test Llama Stack Build / build-custom-container-distribution (push) Failing after 25s Details Python Package Build Test / build (3.12) (push) Failing after 33s Details UI Tests / ui-tests (22) (push) Successful in 1m26s Details Pre-commit / pre-commit (push) Successful in 2m18s Details Also a critical bug fix so test recordings can be found inside docker	2025-10-16 19:33:13 -07:00
ehhuang	b3099d40e2	fix(telemetry): remove dependency on old telemetry config (#3830 ) Some checks failed SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 0s Details SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 0s Details Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 2s Details Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Details Test Llama Stack Build / generate-matrix (push) Successful in 8s Details Test Llama Stack Build / build-custom-container-distribution (push) Failing after 10s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 12s Details Test Llama Stack Build / build-single-provider (push) Failing after 11s Details Python Package Build Test / build (3.12) (push) Failing after 10s Details Test External API and Providers / test-external (venv) (push) Failing after 11s Details Python Package Build Test / build (3.13) (push) Failing after 13s Details Unit Tests / unit-tests (3.13) (push) Failing after 14s Details Test Llama Stack Build / build (push) Failing after 12s Details Unit Tests / unit-tests (3.12) (push) Failing after 21s Details Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 57s Details Vector IO Integration Tests / test-matrix (push) Failing after 1m13s Details API Conformance Tests / check-schema-compatibility (push) Successful in 1m22s Details UI Tests / ui-tests (22) (push) Successful in 1m33s Details Pre-commit / pre-commit (push) Successful in 1m55s Details # What does this PR do? old telemetry config was removed in #3815 ## Test Plan ❯ OTEL_SERVICE_NAME=aloha OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4318 uv run llama stack run starter <img width="1888" height="605" alt="image" src="https://github.com/user-attachments/assets/dd5cc9f0-213a-4dc6-9385-f61a3a13b4c3" />	2025-10-16 12:05:10 -07:00
ehhuang	61daef193e	Merge `6fbbb3e78b` into sapling-pr-archive-ehhuang Some checks failed Installer CI / smoke-test-on-dev (push) Failing after 8s Details Installer CI / lint (push) Failing after 9s Details	2025-10-16 11:33:32 -07:00
Eric Huang	6fbbb3e78b	fix(telemetry): remove dependency on old telemetry config # What does this PR do? old telemetry config was removed in #3815 ## Test Plan ❯ OTEL_SERVICE_NAME=aloha OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4318 uv run llama stack run starter	2025-10-16 11:33:24 -07:00
ehhuang	07ff15d917	chore: distrogen enables telemetry by default (#3828 ) # What does this PR do? leftover from #3815 ## Test Plan CI --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/llamastack/llama-stack/pull/3828). * #3830 * __->__ #3828	2025-10-16 11:29:51 -07:00
ehhuang	cdeb41f438	Merge `5a991b5634` into sapling-pr-archive-ehhuang	2025-10-16 11:29:11 -07:00
Eric Huang	5a991b5634	fix(telemetry): remove dependency on old telemetry config # What does this PR do? old telemetry config was removed in #3815 ## Test Plan ❯ OTEL_SERVICE_NAME=aloha OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4318 uv run llama stack run starter	2025-10-16 11:29:06 -07:00
ehhuang	53ea3222ac	Merge `38976b5ac1` into sapling-pr-archive-ehhuang	2025-10-16 11:26:09 -07:00
Eric Huang	38976b5ac1	fix(telemetry): remove dependency on old telemetry config # What does this PR do? old telemetry config was removed in #3815 ## Test Plan ❯ OTEL_SERVICE_NAME=aloha OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4318 uv run llama stack run starter	2025-10-16 11:26:01 -07:00
Eric Huang	c4662ac316	merge commit for archive created by Sapling	2025-10-16 11:21:19 -07:00
Eric Huang	3679612b5f	chore: distrogen enables telemetry by default # What does this PR do? ## Test Plan Telemetry provider was added to all distributions in the latest commit but the protocol mapping was missing, causing a KeyError when the stack tried to validate provider compliance.	2025-10-16 11:21:13 -07:00
ehhuang	41c54b7e16	Merge `b7c276ea6d` into sapling-pr-archive-ehhuang	2025-10-16 10:56:14 -07:00
Eric Huang	b7c276ea6d	chore: distrogen enables telemetry by default # What does this PR do? ## Test Plan Telemetry provider was added to all distributions in the latest commit but the protocol mapping was missing, causing a KeyError when the stack tried to validate provider compliance.	2025-10-16 10:56:07 -07:00
Eric Huang	70c96147ae	merge commit for archive created by Sapling	2025-10-16 10:47:44 -07:00
Eric Huang	60e7d2ac60	chore: distrogen enables telemetry by default # What does this PR do? ## Test Plan	2025-10-16 10:47:35 -07:00
Charlie Doern	f22aaef42f	chore!: remove telemetry API usage (#3815 ) # What does this PR do? remove telemetry as a providable API from the codebase. This includes removing it from generated distributions but also the provider registry, the router, etc since `setup_logger` is tied pretty strictly to `Api.telemetry` being in impls we still need an "instantiated provider" in our implementations. However it should not be auto-routed or provided. So in validate_and_prepare_providers (called from resolve_impls) I made it so that if run_config.telemetry.enabled, we set up the meta-reference "provider" internally to be used so that log_event will work when called. This is the neatest way I think we can remove telemetry from the provider configs but also not need to rip apart the whole "telemetry is a provider" logic just yet, but we can do it internally later without disrupting users. so telemetry is removed from the registry such that if a user puts `telemetry:` as an API in their build/run config it will err out, but can still be used by us internally as we go through this transition. relates to #3806 Signed-off-by: Charlie Doern <cdoern@redhat.com>	2025-10-16 10:39:32 -07:00
slekkala1	8c5705d39e	fix: test id not being set in headers (#3827 ) # What does this PR do? When stack config is set to server in docker STACK_CONFIG_ARG=--stack-config=http://localhost:8321, the env variable was not getting correctly set and test id not set, causing This is needed for test-and-cut to work E openai.BadRequestError: Error code: 400 - {'detail': 'Invalid value: Test ID is required for file ID allocation'} `5286461406` ## Test Plan CI	2025-10-16 10:29:07 -07:00
Bill Murdock	c19eb9854d	docs: Document known limitations of Responses (#3776 ) # What does this PR do? Adds a subpage of the OpenAI compatibility page in the documentation. This subpage documents known limitations of the Responses API. <!-- If resolving an issue, uncomment and update the line below --> Closes #3575 --------- Signed-off-by: Bill Murdock <bmurdock@redhat.com>	2025-10-16 10:26:23 -07:00

1 2 3 4 5 ...

3169 commits