llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-22 16:23:08 +00:00

Author	SHA1	Message	Date
ehhuang	5da5fe2983	Merge `0ef4f1cd6b` into sapling-pr-archive-ehhuang	2025-10-20 15:04:54 -07:00
Eric Huang	0ef4f1cd6b	chore: skip shutdown if otel_endpoint is not set # What does this PR do? rid following error when ctrl+c'd server │ /Users/erichuang/projects/lst3/llama_stack/providers/inline/telemetry/meta_reference/telemetry.py:92 in │ │ shutdown │ │ │ │ 89 │ │ pass │ │ 90 │ │ │ 91 │ async def shutdown(self) -> None: │ │ ❱ 92 │ │ trace.get_tracer_provider().force_flush() │ │ 93 │ │ │ 94 │ async def log_event(self, event: Event, ttl_seconds: int = 604800) -> None: │ │ 95 │ │ if isinstance(event, UnstructuredLogEvent): │ ╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────╯ AttributeError: 'ProxyTracerProvider' object has no attribute 'force_flush' ## Test Plan	2025-10-20 15:04:46 -07:00
ehhuang	f064b90161	Merge `76da90c1fc` into sapling-pr-archive-ehhuang	2025-10-20 14:58:14 -07:00
Eric Huang	76da90c1fc	chore: remove dead code # What does this PR do? ## Test Plan	2025-10-20 14:58:05 -07:00
ehhuang	6cb148dbe6	Merge `059d880bc0` into sapling-pr-archive-ehhuang	2025-10-20 14:53:48 -07:00
Eric Huang	059d880bc0	chore: install client first # What does this PR do? ## Test Plan	2025-10-20 14:53:42 -07:00
Ashwin Bharambe	94faec7bc5	chore(yaml)!: move registered resources to a sub-key (#3861 ) NOTE: this is a backwards incompatible change to the run-configs. A small QOL update, but this will prove useful when I do a rename for "vector_dbs" to "vector_stores" next. Moves all the `models, shields, ...` keys in run-config under a `registered_resources` sub-key.	2025-10-20 14:52:48 -07:00
Ashwin Bharambe	483d53cc37	feat(stainless): add stainless source of truth config (#3860 ) Source of truth for Stainless should be in this repository. This was long due.	2025-10-20 14:32:20 -07:00
Francisco Arceo	48581bf651	chore: Updating how default embedding model is set in stack (#3818 ) # What does this PR do? Refactor setting default vector store provider and embedding model to use an optional `vector_stores` config in the `StackRunConfig` and clean up code to do so (had to add back in some pieces of VectorDB). Also added remote Qdrant and Weaviate to starter distro (based on other PR where inference providers were added for UX). New config is simply (default for Starter distro): ```yaml vector_stores: default_provider_id: faiss default_embedding_model: provider_id: sentence-transformers model_id: nomic-ai/nomic-embed-text-v1.5 ``` ## Test Plan CI and Unit tests. --------- Signed-off-by: Francisco Javier Arceo <farceo@redhat.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>	2025-10-20 14:22:45 -07:00
Ashwin Bharambe	2c43285e22	feat(stores)!: use backend storage references instead of configs (#3697 ) This PR changes configurations in a backward incompatible way. Run configs today repeat full SQLite/Postgres snippets everywhere a store is needed, which means duplicated credentials, extra connection pools, and lots of drift between files. This PR introduces named storage backends so the stack and providers can share a single catalog and reference those backends by name. ## Key Changes - Add `storage.backends` to `StackRunConfig`, register each KV/SQL backend once at startup, and validate that references point to the right family. - Move server stores under `storage.stores` with lightweight references (backend + namespace/table) instead of full configs. - Update every provider/config/doc to use the new reference style; docs/codegen now surface the simplified YAML. ## Migration Before: ```yaml metadata_store: type: sqlite db_path: ~/.llama/distributions/foo/registry.db inference_store: type: postgres host: ${env.POSTGRES_HOST} port: ${env.POSTGRES_PORT} db: ${env.POSTGRES_DB} user: ${env.POSTGRES_USER} password: ${env.POSTGRES_PASSWORD} conversations_store: type: postgres host: ${env.POSTGRES_HOST} port: ${env.POSTGRES_PORT} db: ${env.POSTGRES_DB} user: ${env.POSTGRES_USER} password: ${env.POSTGRES_PASSWORD} ``` After: ```yaml storage: backends: kv_default: type: kv_sqlite db_path: ~/.llama/distributions/foo/kvstore.db sql_default: type: sql_postgres host: ${env.POSTGRES_HOST} port: ${env.POSTGRES_PORT} db: ${env.POSTGRES_DB} user: ${env.POSTGRES_USER} password: ${env.POSTGRES_PASSWORD} stores: metadata: backend: kv_default namespace: registry inference: backend: sql_default table_name: inference_store max_write_queue_size: 10000 num_writers: 4 conversations: backend: sql_default table_name: openai_conversations ``` Provider configs follow the same pattern—for example, a Chroma vector adapter switches from: ```yaml providers: vector_io: - provider_id: chromadb provider_type: remote::chromadb config: url: ${env.CHROMADB_URL} kvstore: type: sqlite db_path: ~/.llama/distributions/foo/chroma.db ``` to: ```yaml providers: vector_io: - provider_id: chromadb provider_type: remote::chromadb config: url: ${env.CHROMADB_URL} persistence: backend: kv_default namespace: vector_io::chroma_remote ``` Once the backends are declared, everything else just points at them, so rotating credentials or swapping to Postgres happens in one place and the stack reuses a single connection pool.	2025-10-20 13:20:09 -07:00
Shabana Baig	add64e8e2a	feat: Add instructions parameter in response object (#3741 ) # Problem The current inline provider appends the user provided instructions to messages as a system prompt, but the returned response object does not contain the instructions field (as specified in the OpenAI responses spec). # What does this PR do? This pull request adds the instruction field to the response object definition and updates the inline provider. It also ensures that instructions from previous response is not carried over to the next response (as specified in the openAI spec). Closes #[3566](https://github.com/llamastack/llama-stack/issues/3566) ## Test Plan - Tested manually for change in model response w.r.t supplied instructions field. - Added unit test to check that the instructions from previous response is not carried over to the next response. - Added integration tests to check instructions parameter in the returned response object. - Added new recordings for the integration tests. --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-10-20 13:10:37 -07:00
Derek Higgins	1f38359d95	fix: nested claims mapping in OAuth2 token validation (#3814 ) fix: nested claims mapping in OAuth2 token validation The get_attributes_from_claims function was only checking for top-level claim keys, causing token validation to fail when using nested claims like "resource_access.llamastack.roles" (common in Keycloak JWT tokens). Updated the function to support dot notation for traversing nested claim structures. Give precedence to dot notation over literal keys with dots in claims mapping. Added test coverage. Closes: #3812 Signed-off-by: Derek Higgins <derekh@redhat.com>	2025-10-20 12:34:55 -07:00
dependabot[bot]	08cbb69ef7	chore(python-deps): bump sqlalchemy from 2.0.41 to 2.0.44 (#3848 ) Bumps [sqlalchemy](https://github.com/sqlalchemy/sqlalchemy) from 2.0.41 to 2.0.44. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/sqlalchemy/sqlalchemy/releases">sqlalchemy's releases</a>.</em></p> <blockquote> <h1>2.0.44</h1> <p>Released: October 10, 2025</p> <h2>platform</h2> <ul> <li><strong>[platform] [bug]</strong> Unblocked automatic greenlet installation for Python 3.14 now that there are greenlet wheels on pypi for python 3.14.</li> </ul> <h2>orm</h2> <ul> <li> <p><strong>[orm] [usecase]</strong> The way ORM Annotated Declarative interprets Python <a href="https://peps.python.org/pep-0695">PEP 695</a> type aliases in <code>Mapped[]</code> annotations has been refined to expand the lookup scheme. A <a href="https://peps.python.org/pep-0695">PEP 695</a> type can now be resolved based on either its direct presence in <code>_orm.registry.type_annotation_map</code> or its immediate resolved value, as long as a recursive lookup across multiple <a href="https://peps.python.org/pep-0695">PEP 695</a> types is not required for it to resolve. This change reverses part of the restrictions introduced in 2.0.37 as part of <a href="https://www.sqlalchemy.org/trac/ticket/11955">#11955</a>, which deprecated (and disallowed in 2.1) the ability to resolve any <a href="https://peps.python.org/pep-0695">PEP 695</a> type that was not explicitly present in <code>_orm.registry.type_annotation_map</code>. Recursive lookups of <a href="https://peps.python.org/pep-0695">PEP 695</a> types remains deprecated in 2.0 and disallowed in version 2.1, as do implicit lookups of <code>NewType</code> types without an entry in <code>_orm.registry.type_annotation_map</code>.</p> <p>Additionally, new support has been added for generic <a href="https://peps.python.org/pep-0695">PEP 695</a> aliases that refer to <a href="https://peps.python.org/pep-0593">PEP 593</a> <code>Annotated</code> constructs containing <code>_orm.mapped_column()</code> configurations. See the sections below for examples.</p> <p>References: <a href="https://www.sqlalchemy.org/trac/ticket/12829">#12829</a></p> </li> <li> <p><strong>[orm] [bug]</strong> Fixed a caching issue where <code>_orm.with_loader_criteria()</code> would incorrectly reuse cached bound parameter values when used with <code>_sql.CompoundSelect</code> constructs such as <code>_sql.union()</code>. The issue was caused by the cache key for compound selects not including the execution options that are part of the <code>_sql.Executable</code> base class, which <code>_orm.with_loader_criteria()</code> uses to apply its criteria dynamically. The fix ensures that compound selects and other executable constructs properly include execution options in their cache key traversal.</p> <p>References: <a href="https://www.sqlalchemy.org/trac/ticket/12905">#12905</a></p> </li> </ul> <h2>engine</h2> <ul> <li><strong>[engine] [bug]</strong> Implemented initial support for free-threaded Python by adding new tests and reworking the test harness to include Python 3.13t and Python 3.14t in</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li>See full diff in <a href="https://github.com/sqlalchemy/sqlalchemy/commits">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=sqlalchemy&package-manager=uv&previous-version=2.0.41&new-version=2.0.44)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-10-20 12:34:11 -07:00
dependabot[bot]	112a974005	chore(python-deps): bump ruff from 0.9.10 to 0.14.1 (#3846 ) Bumps [ruff](https://github.com/astral-sh/ruff) from 0.9.10 to 0.14.1. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/astral-sh/ruff/releases">ruff's releases</a>.</em></p> <blockquote> <h2>0.14.1</h2> <h2>Release Notes</h2> <p>Released on 2025-10-16.</p> <h3>Preview features</h3> <ul> <li>[formatter] Remove parentheses around multiple exception types on Python 3.14+ (<a href="https://redirect.github.com/astral-sh/ruff/pull/20768">#20768</a>)</li> <li>[<code>flake8-bugbear</code>] Omit annotation in preview fix for <code>B006</code> (<a href="https://redirect.github.com/astral-sh/ruff/pull/20877">#20877</a>)</li> <li>[<code>flake8-logging-format</code>] Avoid dropping implicitly concatenated pieces in the <code>G004</code> fix (<a href="https://redirect.github.com/astral-sh/ruff/pull/20793">#20793</a>)</li> <li>[<code>pydoclint</code>] Implement <code>docstring-extraneous-parameter</code> (<code>DOC102</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20376">#20376</a>)</li> <li>[<code>pyupgrade</code>] Extend <code>UP019</code> to detect <code>typing_extensions.Text</code> (<code>UP019</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20825">#20825</a>)</li> <li>[<code>pyupgrade</code>] Fix false negative for <code>TypeVar</code> with default argument in <code>non-pep695-generic-class</code> (<code>UP046</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20660">#20660</a>)</li> </ul> <h3>Bug fixes</h3> <ul> <li>Fix false negatives in <code>Truthiness::from_expr</code> for lambdas, generators, and f-strings (<a href="https://redirect.github.com/astral-sh/ruff/pull/20704">#20704</a>)</li> <li>Fix syntax error false positives for escapes and quotes in f-strings (<a href="https://redirect.github.com/astral-sh/ruff/pull/20867">#20867</a>)</li> <li>Fix syntax error false positives on parenthesized context managers (<a href="https://redirect.github.com/astral-sh/ruff/pull/20846">#20846</a>)</li> <li>[<code>fastapi</code>] Fix false positives for path parameters that FastAPI doesn't recognize (<code>FAST003</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20687">#20687</a>)</li> <li>[<code>flake8-pyi</code>] Fix operator precedence by adding parentheses when needed (<code>PYI061</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20508">#20508</a>)</li> <li>[<code>ruff</code>] Suppress diagnostic for f-string interpolations with debug text (<code>RUF010</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20525">#20525</a>)</li> </ul> <h3>Rule changes</h3> <ul> <li>[<code>airflow</code>] Add warning to <code>airflow.datasets.DatasetEvent</code> usage (<code>AIR301</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20551">#20551</a>)</li> <li>[<code>flake8-bugbear</code>] Mark <code>B905</code> and <code>B912</code> fixes as unsafe (<a href="https://redirect.github.com/astral-sh/ruff/pull/20695">#20695</a>)</li> <li>Use <code>DiagnosticTag</code> for more rules - changes display in editors (<a href="https://redirect.github.com/astral-sh/ruff/pull/20758">#20758</a>,<a href="https://redirect.github.com/astral-sh/ruff/pull/20734">#20734</a>)</li> </ul> <h3>Documentation</h3> <ul> <li>Update Python compatibility from 3.13 to 3.14 in README.md (<a href="https://redirect.github.com/astral-sh/ruff/pull/20852">#20852</a>)</li> <li>Update <code>lint.flake8-type-checking.quoted-annotations</code> docs (<a href="https://redirect.github.com/astral-sh/ruff/pull/20765">#20765</a>)</li> <li>Update setup instructions for Zed 0.208.0+ (<a href="https://redirect.github.com/astral-sh/ruff/pull/20902">#20902</a>)</li> <li>[<code>flake8-datetimez</code>] Clarify docs for several rules (<a href="https://redirect.github.com/astral-sh/ruff/pull/20778">#20778</a>)</li> <li>Fix typo in <code>RUF015</code> description (<a href="https://redirect.github.com/astral-sh/ruff/pull/20873">#20873</a>)</li> </ul> <h3>Other changes</h3> <ul> <li>Reduce binary size (<a href="https://redirect.github.com/astral-sh/ruff/pull/20863">#20863</a>)</li> <li>Improved error recovery for unclosed strings (including f- and t-strings) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20848">#20848</a>)</li> </ul> <h3>Contributors</h3> <ul> <li><a href="https://github.com/ntBre"><code>@ntBre</code></a></li> <li><a href="https://github.com/Paillat-dev"><code>@Paillat-dev</code></a></li> <li><a href="https://github.com/terror"><code>@terror</code></a></li> <li><a href="https://github.com/pieterh-oai"><code>@pieterh-oai</code></a></li> <li><a href="https://github.com/MichaReiser"><code>@MichaReiser</code></a></li> <li><a href="https://github.com/TaKO8Ki"><code>@TaKO8Ki</code></a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/astral-sh/ruff/blob/main/CHANGELOG.md">ruff's changelog</a>.</em></p> <blockquote> <h2>0.14.1</h2> <p>Released on 2025-10-16.</p> <h3>Preview features</h3> <ul> <li>[formatter] Remove parentheses around multiple exception types on Python 3.14+ (<a href="https://redirect.github.com/astral-sh/ruff/pull/20768">#20768</a>)</li> <li>[<code>flake8-bugbear</code>] Omit annotation in preview fix for <code>B006</code> (<a href="https://redirect.github.com/astral-sh/ruff/pull/20877">#20877</a>)</li> <li>[<code>flake8-logging-format</code>] Avoid dropping implicitly concatenated pieces in the <code>G004</code> fix (<a href="https://redirect.github.com/astral-sh/ruff/pull/20793">#20793</a>)</li> <li>[<code>pydoclint</code>] Implement <code>docstring-extraneous-parameter</code> (<code>DOC102</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20376">#20376</a>)</li> <li>[<code>pyupgrade</code>] Extend <code>UP019</code> to detect <code>typing_extensions.Text</code> (<code>UP019</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20825">#20825</a>)</li> <li>[<code>pyupgrade</code>] Fix false negative for <code>TypeVar</code> with default argument in <code>non-pep695-generic-class</code> (<code>UP046</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20660">#20660</a>)</li> </ul> <h3>Bug fixes</h3> <ul> <li>Fix false negatives in <code>Truthiness::from_expr</code> for lambdas, generators, and f-strings (<a href="https://redirect.github.com/astral-sh/ruff/pull/20704">#20704</a>)</li> <li>Fix syntax error false positives for escapes and quotes in f-strings (<a href="https://redirect.github.com/astral-sh/ruff/pull/20867">#20867</a>)</li> <li>Fix syntax error false positives on parenthesized context managers (<a href="https://redirect.github.com/astral-sh/ruff/pull/20846">#20846</a>)</li> <li>[<code>fastapi</code>] Fix false positives for path parameters that FastAPI doesn't recognize (<code>FAST003</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20687">#20687</a>)</li> <li>[<code>flake8-pyi</code>] Fix operator precedence by adding parentheses when needed (<code>PYI061</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20508">#20508</a>)</li> <li>[<code>ruff</code>] Suppress diagnostic for f-string interpolations with debug text (<code>RUF010</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20525">#20525</a>)</li> </ul> <h3>Rule changes</h3> <ul> <li>[<code>airflow</code>] Add warning to <code>airflow.datasets.DatasetEvent</code> usage (<code>AIR301</code>) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20551">#20551</a>)</li> <li>[<code>flake8-bugbear</code>] Mark <code>B905</code> and <code>B912</code> fixes as unsafe (<a href="https://redirect.github.com/astral-sh/ruff/pull/20695">#20695</a>)</li> <li>Use <code>DiagnosticTag</code> for more rules - changes display in editors (<a href="https://redirect.github.com/astral-sh/ruff/pull/20758">#20758</a>,<a href="https://redirect.github.com/astral-sh/ruff/pull/20734">#20734</a>)</li> </ul> <h3>Documentation</h3> <ul> <li>Update Python compatibility from 3.13 to 3.14 in README.md (<a href="https://redirect.github.com/astral-sh/ruff/pull/20852">#20852</a>)</li> <li>Update <code>lint.flake8-type-checking.quoted-annotations</code> docs (<a href="https://redirect.github.com/astral-sh/ruff/pull/20765">#20765</a>)</li> <li>Update setup instructions for Zed 0.208.0+ (<a href="https://redirect.github.com/astral-sh/ruff/pull/20902">#20902</a>)</li> <li>[<code>flake8-datetimez</code>] Clarify docs for several rules (<a href="https://redirect.github.com/astral-sh/ruff/pull/20778">#20778</a>)</li> <li>Fix typo in <code>RUF015</code> description (<a href="https://redirect.github.com/astral-sh/ruff/pull/20873">#20873</a>)</li> </ul> <h3>Other changes</h3> <ul> <li>Reduce binary size (<a href="https://redirect.github.com/astral-sh/ruff/pull/20863">#20863</a>)</li> <li>Improved error recovery for unclosed strings (including f- and t-strings) (<a href="https://redirect.github.com/astral-sh/ruff/pull/20848">#20848</a>)</li> </ul> <h3>Contributors</h3> <ul> <li><a href="https://github.com/ntBre"><code>@ntBre</code></a></li> <li><a href="https://github.com/Paillat-dev"><code>@Paillat-dev</code></a></li> <li><a href="https://github.com/terror"><code>@terror</code></a></li> <li><a href="https://github.com/pieterh-oai"><code>@pieterh-oai</code></a></li> <li><a href="https://github.com/MichaReiser"><code>@MichaReiser</code></a></li> <li><a href="https://github.com/TaKO8Ki"><code>@TaKO8Ki</code></a></li> <li><a href="https://github.com/ageorgou"><code>@ageorgou</code></a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`2bffef5966`"><code>2bffef5</code></a> Bump 0.14.1 (<a href="https://redirect.github.com/astral-sh/ruff/issues/20925">#20925</a>)</li> <li><a href="`e64d772788`"><code>e64d772</code></a> Standardize syntax error construction (<a href="https://redirect.github.com/astral-sh/ruff/issues/20903">#20903</a>)</li> <li><a href="`03696687ea`"><code>0369668</code></a> [<code>pydoclint</code>] Implement <code>docstring-extraneous-parameter</code> (<code>DOC102</code>) (<a href="https://redirect.github.com/astral-sh/ruff/issues/20376">#20376</a>)</li> <li><a href="`058fc37542`"><code>058fc37</code></a> [ty] Fix panic 'missing root' when handling completion request (<a href="https://redirect.github.com/astral-sh/ruff/issues/20917">#20917</a>)</li> <li><a href="`ec9faa34be`"><code>ec9faa3</code></a> [ty] Run file watching tests serial when using nextest (<a href="https://redirect.github.com/astral-sh/ruff/issues/20918">#20918</a>)</li> <li><a href="`7155a62e5c`"><code>7155a62</code></a> [ty] Add version hint for failed stdlib attribute accesses (<a href="https://redirect.github.com/astral-sh/ruff/issues/20909">#20909</a>)</li> <li><a href="`a67e0690f2`"><code>a67e069</code></a> More CI improvements (<a href="https://redirect.github.com/astral-sh/ruff/issues/20920">#20920</a>)</li> <li><a href="`6a1e91ce97`"><code>6a1e91c</code></a> [ty] Check typeshed VERSIONS for parent modules when reporting failed stdlib ...</li> <li><a href="`3db5d5906e`"><code>3db5d59</code></a> Don't use codspeed or depot runners in CI jobs on forks (<a href="https://redirect.github.com/astral-sh/ruff/issues/20894">#20894</a>)</li> <li><a href="`d23826ce46`"><code>d23826c</code></a> [ty] cache Type::is_redundant_with (<a href="https://redirect.github.com/astral-sh/ruff/issues/20477">#20477</a>)</li> <li>Additional commits viewable in <a href="https://github.com/astral-sh/ruff/compare/0.9.10...0.14.1">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=ruff&package-manager=uv&previous-version=0.9.10&new-version=0.14.1)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-10-20 12:33:44 -07:00
ehhuang	9936f33f7e	chore: disable telemetry if otel endpoint isn't set (#3859 ) # What does this PR do? removes error: ConnectionError: HTTPConnectionPool(host='localhost', port=4318): Max retries exceeded with url: /v1/traces (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x10fd98e60>: Failed to establish a new connection: [Errno 61] Connection refused')) ## Test Plan uv run llama stack run starter curl http://localhost:8321/v1/models observe no error in server logs	2025-10-20 11:42:57 -07:00
ehhuang	c29b8605b6	Merge `0dbdc448e7` into sapling-pr-archive-ehhuang	2025-10-20 11:20:50 -07:00
Eric Huang	0dbdc448e7	chore: disable telemetry if otel endpoint isn't set # What does this PR do? removes error: ConnectionError: HTTPConnectionPool(host='localhost', port=4318): Max retries exceeded with url: /v1/traces (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x10fd98e60>: Failed to establish a new connection: [Errno 61] Connection refused')) ## Test Plan uv run llama stack run starter curl http://localhost:8321/v1/models observe no error in server logs	2025-10-20 11:20:42 -07:00
ehhuang	359df3a37c	chore: update doc (#3857 ) # What does this PR do? follows https://github.com/llamastack/llama-stack/pull/3839 ## Test Plan	2025-10-20 10:33:21 -07:00
ehhuang	3287cb1a03	Merge `353821635e` into sapling-pr-archive-ehhuang	2025-10-20 10:25:46 -07:00
Eric Huang	353821635e	chore: update doc # What does this PR do? ## Test Plan	2025-10-20 10:25:26 -07:00
ehhuang	21772de5d3	chore: use dockerfile for building containers (#3839 ) # What does this PR do? relates to #2878 We introduce a Containerfile which is used to replaced the `llama stack build` command (removal in a separate PR). ``` llama stack build --distro starter --image-type venv --run ``` is replaced by ``` llama stack list-deps starter \| xargs -L1 uv pip install llama stack run starter ``` - See the updated workflow files for e2e workflow. ## Test Plan CI ``` ❯ docker build . -f docker/Dockerfile --build-arg DISTRO_NAME=starter --build-arg INSTALL_MODE=editable --tag test_starter ❯ docker run -p 8321:8321 test_starter ❯ curl http://localhost:8321/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "gpt-4o-mini", "messages": [ { "role": "user", "content": "Hello!" } ] }' ``` --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/llamastack/llama-stack/pull/3839). * #3855 * __->__ #3839	2025-10-20 10:23:01 -07:00
ehhuang	a4b68008f4	Merge `756aa0ade8` into sapling-pr-archive-ehhuang	2025-10-20 10:20:27 -07:00
Eric Huang	756aa0ade8	chore: update doc # What does this PR do? ## Test Plan	2025-10-20 10:20:14 -07:00
ehhuang	07fe3b4825	Merge `cb95964fb0` into sapling-pr-archive-ehhuang	2025-10-20 10:19:31 -07:00
Eric Huang	cb95964fb0	dockerfile # What does this PR do? ## Test Plan	2025-10-20 10:19:24 -07:00
Eric Huang	64e1c84374	merge commit for archive created by Sapling	2025-10-20 10:17:59 -07:00
Eric Huang	59503697e9	dockerfile # What does this PR do? ## Test Plan	2025-10-20 10:17:52 -07:00
Charlie Doern	573e783ff0	docs: fix sidebar of `Detailed Tutorial` (#3856 ) # What does this PR do? the sidebar currently has an extra `ii. Run the Script` because its incorrectly put into the doc as an H3 not an H4 (like the other ones) <img width="239" height="218" alt="Screenshot 2025-10-20 at 1 04 54 PM" src="https://github.com/user-attachments/assets/eb8cb26e-7ea9-4b61-9101-d64965b39647" /> Fix this which will update the sidebar Signed-off-by: Charlie Doern <cdoern@redhat.com>	2025-10-20 13:10:50 -04:00
Eric Huang	dbb6a746bf	merge commit for archive created by Sapling	2025-10-20 10:02:43 -07:00
Eric Huang	32c7bfb3b9	dockerfile # What does this PR do? ## Test Plan	2025-10-20 10:02:37 -07:00
Eric Huang	d7ab19a291	merge commit for archive created by Sapling	2025-10-20 09:54:01 -07:00
Eric Huang	8dfff91067	dockerfile # What does this PR do? ## Test Plan	2025-10-20 09:53:53 -07:00
Eric Huang	257afc589d	merge commit for archive created by Sapling	2025-10-20 09:52:27 -07:00
Eric Huang	e1c75f7d0a	dockerfile # What does this PR do? ## Test Plan	2025-10-20 09:52:18 -07:00
Jiayi Ni	165b8b07f4	docs: Documentation update for NVIDIA Inference Provider (#3840 ) # What does this PR do? <!-- Provide a short summary of what this PR does and why. Link to relevant issues if applicable. --> <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[issue-number] --> - Fix examples in the NVIDIA inference documentation to align with current API requirements. ## Test Plan <!-- Describe the tests you ran to verify your changes with result summaries. Provide clear instructions so the plan can be easily re-executed. --> N/A	2025-10-20 09:51:43 -07:00
Eric Huang	082ff1f94c	merge commit for archive created by Sapling Some checks failed Installer CI / lint (push) Failing after 2s Details Installer CI / smoke-test-on-dev (push) Failing after 20s Details	2025-10-19 21:57:19 -07:00
Eric Huang	35b929af83	chore: update doc # What does this PR do? ## Test Plan	2025-10-19 21:57:12 -07:00
Eric Huang	5b0298b46b	merge commit for archive created by Sapling	2025-10-19 21:47:52 -07:00
Eric Huang	bd2411354c	chore: update doc # What does this PR do? ## Test Plan	2025-10-19 21:47:45 -07:00
Eric Huang	e9f312ac89	merge commit for archive created by Sapling	2025-10-19 21:42:54 -07:00
Eric Huang	e38b6c09f7	chore: update doc # What does this PR do? ## Test Plan	2025-10-19 21:42:44 -07:00
ehhuang	043b9d93cd	Merge `e0dda3bb06` into sapling-pr-archive-ehhuang	2025-10-19 21:13:51 -07:00
Eric Huang	e0dda3bb06	chore: update doc # What does this PR do? ## Test Plan	2025-10-19 21:13:40 -07:00
Eric Huang	a9ff8c5c9f	dockerfile Some checks failed Installer CI / lint (push) Failing after 3s Details Installer CI / smoke-test-on-dev (push) Failing after 2s Details # What does this PR do? ## Test Plan	2025-10-19 21:13:36 -07:00
Eric Huang	558e0dc84b	merge commit for archive created by Sapling	2025-10-19 21:12:19 -07:00
Eric Huang	edb7e6aebe	dockerfile # What does this PR do? ## Test Plan	2025-10-19 21:12:12 -07:00
Eric Huang	58eb97d152	merge commit for archive created by Sapling	2025-10-19 21:11:16 -07:00
Eric Huang	af23850e83	dockerfile # What does this PR do? ## Test Plan	2025-10-19 21:11:10 -07:00
Eric Huang	b53c66e191	merge commit for archive created by Sapling	2025-10-19 16:44:46 -07:00
Eric Huang	4ebd4a60de	dockerfile # What does this PR do? ## Test Plan	2025-10-19 16:38:30 -07:00

1 2 3 4 5 ...

3217 commits