mirror of
				https://github.com/meta-llama/llama-stack.git
				synced 2025-10-25 09:05:37 +00:00 
			
		
		
		
	
	
		
			141 commits
		
	
	
	| Author | SHA1 | Message | Date | |
|---|---|---|---|---|
|  | 1136daf310 | fix: replace python-jose with PyJWT for JWT handling (#3756) # What does this PR do? This commit migrates the authentication system from python-jose to PyJWT to eliminate the dependency on the archived rsa package. The migration includes: - Refactored OAuth2TokenAuthProvider to use PyJWT's PyJWKClient for clean JWKS handling - Removed manual JWKS fetching, caching and key extraction logic in favor of PyJWT's built-in functionality The new implementation is cleaner, more maintainable, and follows PyJWT best practices while maintaining full backward compatibility. ## Test Plan Unit tests. Auth CI. --------- Signed-off-by: Sébastien Han <seb@redhat.com> | ||
|  | cfd2e303db | chore(python-deps): bump black from 25.1.0 to 25.9.0 (#3783) Bumps [black](https://github.com/psf/black) from 25.1.0 to 25.9.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/psf/black/releases">black's releases</a>.</em></p> <blockquote> <h2>25.9.0</h2> <h3>Highlights</h3> <ul> <li>Remove support for pre-python 3.7 <code>await/async</code> as soft keywords/variable names (<a href="https://redirect.github.com/psf/black/issues/4676">#4676</a>)</li> </ul> <h3>Stable style</h3> <ul> <li>Fix crash while formatting a long <code>del</code> statement containing tuples (<a href="https://redirect.github.com/psf/black/issues/4628">#4628</a>)</li> <li>Fix crash while formatting expressions using the walrus operator in complex <code>with</code> statements (<a href="https://redirect.github.com/psf/black/issues/4630">#4630</a>)</li> <li>Handle <code># fmt: skip</code> followed by a comment at the end of file (<a href="https://redirect.github.com/psf/black/issues/4635">#4635</a>)</li> <li>Fix crash when a tuple appears in the <code>as</code> clause of a <code>with</code> statement (<a href="https://redirect.github.com/psf/black/issues/4634">#4634</a>)</li> <li>Fix crash when tuple is used as a context manager inside a <code>with</code> statement (<a href="https://redirect.github.com/psf/black/issues/4646">#4646</a>)</li> <li>Fix crash when formatting a <code>\</code> followed by a <code>\r</code> followed by a comment (<a href="https://redirect.github.com/psf/black/issues/4663">#4663</a>)</li> <li>Fix crash on a <code>\\r\n</code> (<a href="https://redirect.github.com/psf/black/issues/4673">#4673</a>)</li> <li>Fix crash on <code>await ...</code> (where <code>...</code> is a literal <code>Ellipsis</code>) (<a href="https://redirect.github.com/psf/black/issues/4676">#4676</a>)</li> <li>Fix crash on parenthesized expression inside a type parameter bound (<a href="https://redirect.github.com/psf/black/issues/4684">#4684</a>)</li> <li>Fix crash when using line ranges excluding indented single line decorated items (<a href="https://redirect.github.com/psf/black/issues/4670">#4670</a>)</li> </ul> <h3>Preview style</h3> <ul> <li>Fix a bug where one-liner functions/conditionals marked with <code># fmt: skip</code> would still be formatted (<a href="https://redirect.github.com/psf/black/issues/4552">#4552</a>)</li> <li>Improve <code>multiline_string_handling</code> with ternaries and dictionaries (<a href="https://redirect.github.com/psf/black/issues/4657">#4657</a>)</li> <li>Fix a bug where <code>string_processing</code> would not split f-strings directly after expressions (<a href="https://redirect.github.com/psf/black/issues/4680">#4680</a>)</li> <li>Wrap the <code>in</code> clause of comprehensions across lines if necessary (<a href="https://redirect.github.com/psf/black/issues/4699">#4699</a>)</li> <li>Remove parentheses around multiple exception types in <code>except</code> and <code>except*</code> without <code>as</code>. (<a href="https://redirect.github.com/psf/black/issues/4720">#4720</a>)</li> <li>Add <code>\r</code> style newlines to the potential newlines to normalize file newlines both from and to (<a href="https://redirect.github.com/psf/black/issues/4710">#4710</a>)</li> </ul> <h3>Parser</h3> <ul> <li>Rewrite tokenizer to improve performance and compliance (<a href="https://redirect.github.com/psf/black/issues/4536">#4536</a>)</li> <li>Fix bug where certain unusual expressions (e.g., lambdas) were not accepted in type parameter bounds and defaults. (<a href="https://redirect.github.com/psf/black/issues/4602">#4602</a>)</li> </ul> <h3>Performance</h3> <ul> <li>Avoid using an extra process when running with only one worker (<a href="https://redirect.github.com/psf/black/issues/4734">#4734</a>)</li> </ul> <h3>Integrations</h3> <ul> <li>Fix the version check in the vim file to reject Python 3.8 (<a href="https://redirect.github.com/psf/black/issues/4567">#4567</a>)</li> <li>Enhance GitHub Action <code>psf/black</code> to read Black version from an additional section in pyproject.toml: <code>[project.dependency-groups]</code> (<a href="https://redirect.github.com/psf/black/issues/4606">#4606</a>)</li> <li>Build gallery docker image with python3-slim and reduce image size (<a href="https://redirect.github.com/psf/black/issues/4686">#4686</a>)</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/psf/black/blob/main/CHANGES.md">black's changelog</a>.</em></p> <blockquote> <h2>25.9.0</h2> <h3>Highlights</h3> <ul> <li>Remove support for pre-python 3.7 <code>await/async</code> as soft keywords/variable names (<a href="https://redirect.github.com/psf/black/issues/4676">#4676</a>)</li> </ul> <h3>Stable style</h3> <ul> <li>Fix crash while formatting a long <code>del</code> statement containing tuples (<a href="https://redirect.github.com/psf/black/issues/4628">#4628</a>)</li> <li>Fix crash while formatting expressions using the walrus operator in complex <code>with</code> statements (<a href="https://redirect.github.com/psf/black/issues/4630">#4630</a>)</li> <li>Handle <code># fmt: skip</code> followed by a comment at the end of file (<a href="https://redirect.github.com/psf/black/issues/4635">#4635</a>)</li> <li>Fix crash when a tuple appears in the <code>as</code> clause of a <code>with</code> statement (<a href="https://redirect.github.com/psf/black/issues/4634">#4634</a>)</li> <li>Fix crash when tuple is used as a context manager inside a <code>with</code> statement (<a href="https://redirect.github.com/psf/black/issues/4646">#4646</a>)</li> <li>Fix crash when formatting a <code>\</code> followed by a <code>\r</code> followed by a comment (<a href="https://redirect.github.com/psf/black/issues/4663">#4663</a>)</li> <li>Fix crash on a <code>\\r\n</code> (<a href="https://redirect.github.com/psf/black/issues/4673">#4673</a>)</li> <li>Fix crash on <code>await ...</code> (where <code>...</code> is a literal <code>Ellipsis</code>) (<a href="https://redirect.github.com/psf/black/issues/4676">#4676</a>)</li> <li>Fix crash on parenthesized expression inside a type parameter bound (<a href="https://redirect.github.com/psf/black/issues/4684">#4684</a>)</li> <li>Fix crash when using line ranges excluding indented single line decorated items (<a href="https://redirect.github.com/psf/black/issues/4670">#4670</a>)</li> </ul> <h3>Preview style</h3> <ul> <li>Fix a bug where one-liner functions/conditionals marked with <code># fmt: skip</code> would still be formatted (<a href="https://redirect.github.com/psf/black/issues/4552">#4552</a>)</li> <li>Improve <code>multiline_string_handling</code> with ternaries and dictionaries (<a href="https://redirect.github.com/psf/black/issues/4657">#4657</a>)</li> <li>Fix a bug where <code>string_processing</code> would not split f-strings directly after expressions (<a href="https://redirect.github.com/psf/black/issues/4680">#4680</a>)</li> <li>Wrap the <code>in</code> clause of comprehensions across lines if necessary (<a href="https://redirect.github.com/psf/black/issues/4699">#4699</a>)</li> <li>Remove parentheses around multiple exception types in <code>except</code> and <code>except*</code> without <code>as</code>. (<a href="https://redirect.github.com/psf/black/issues/4720">#4720</a>)</li> <li>Add <code>\r</code> style newlines to the potential newlines to normalize file newlines both from and to (<a href="https://redirect.github.com/psf/black/issues/4710">#4710</a>)</li> </ul> <h3>Parser</h3> <ul> <li>Rewrite tokenizer to improve performance and compliance (<a href="https://redirect.github.com/psf/black/issues/4536">#4536</a>)</li> <li>Fix bug where certain unusual expressions (e.g., lambdas) were not accepted in type parameter bounds and defaults. (<a href="https://redirect.github.com/psf/black/issues/4602">#4602</a>)</li> </ul> <h3>Performance</h3> <ul> <li>Avoid using an extra process when running with only one worker (<a href="https://redirect.github.com/psf/black/issues/4734">#4734</a>)</li> </ul> <h3>Integrations</h3> <ul> <li>Fix the version check in the vim file to reject Python 3.8 (<a href="https://redirect.github.com/psf/black/issues/4567">#4567</a>)</li> <li>Enhance GitHub Action <code>psf/black</code> to read Black version from an additional section in pyproject.toml: <code>[project.dependency-groups]</code> (<a href="https://redirect.github.com/psf/black/issues/4606">#4606</a>)</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 055a7664f0 | chore(python-deps): bump blobfile from 3.0.0 to 3.1.0 (#3784) Bumps [blobfile](https://github.com/christopher-hesse/blobfile) from 3.0.0 to 3.1.0. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/blobfile/blobfile/blob/master/CHANGES.md">blobfile's changelog</a>.</em></p> <blockquote> <h2>3.1.0</h2> <ul> <li>Improve <code>bf.join</code></li> <li>Add option to support blind writes</li> <li>Treat <code>EAI_NODATA</code> similarly to <code>EAI_NONAME</code> in DNS retry logic</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 13518e7562 | chore(python-deps): bump ollama from 0.5.1 to 0.6.0 (#3786) Bumps [ollama](https://github.com/ollama/ollama-python) from 0.5.1 to 0.6.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/ollama/ollama-python/releases">ollama's releases</a>.</em></p> <blockquote> <h2>v0.6.0</h2> <h2>What's Changed</h2> <ul> <li> <p>client: add web search and web crawl capabilities by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/578">ollama/ollama-python#578</a></p> </li> <li> <p>client: load OLLAMA_API_KEY on init by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/583">ollama/ollama-python#583</a></p> </li> <li> <p>client/types: update web search and fetch API by <a href="https://github.com/npardal"><code>@npardal</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/584">ollama/ollama-python#584</a></p> </li> <li> <p>examples: add mcp server for web_search web_crawl by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/585">ollama/ollama-python#585</a></p> </li> <li> <p>examples: gpt oss browser tool by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/588">ollama/ollama-python#588</a></p> </li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/npardal"><code>@npardal</code></a> made their first contribution in <a href="https://redirect.github.com/ollama/ollama-python/pull/584">ollama/ollama-python#584</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/ollama/ollama-python/compare/v0.5.4...v0.6.0">https://github.com/ollama/ollama-python/compare/v0.5.4...v0.6.0</a></p> <h2>v0.5.4</h2> <h2>What's Changed</h2> <ul> <li>examples: add gpt-oss browser example by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/558">ollama/ollama-python#558</a></li> <li>build(deps): bump actions/checkout from 4 to 5 by <a href="https://github.com/dependabot"><code>@dependabot</code></a>[bot] in <a href="https://redirect.github.com/ollama/ollama-python/pull/559">ollama/ollama-python#559</a></li> <li>examples/gpt-oss: fix examples by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/566">ollama/ollama-python#566</a></li> <li>Fix link for thinking-levels.py in documentation by <a href="https://github.com/btjanaka"><code>@btjanaka</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/567">ollama/ollama-python#567</a></li> <li>examples: fix gpt-oss-tools-stream for adding tool calls by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/568">ollama/ollama-python#568</a></li> <li>examples: resolve invalid tool usage status code 400 if llm makes a mistake gpt-oss by <a href="https://github.com/MarkWard0110"><code>@MarkWard0110</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/569">ollama/ollama-python#569</a></li> <li>build(deps): bump actions/setup-python from 5 to 6 by <a href="https://github.com/dependabot"><code>@dependabot</code></a>[bot] in <a href="https://redirect.github.com/ollama/ollama-python/pull/571">ollama/ollama-python#571</a></li> <li>feat: add dimensions to embed request by <a href="https://github.com/mxyng"><code>@mxyng</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/574">ollama/ollama-python#574</a></li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/btjanaka"><code>@btjanaka</code></a> made their first contribution in <a href="https://redirect.github.com/ollama/ollama-python/pull/567">ollama/ollama-python#567</a></li> <li><a href="https://github.com/MarkWard0110"><code>@MarkWard0110</code></a> made their first contribution in <a href="https://redirect.github.com/ollama/ollama-python/pull/569">ollama/ollama-python#569</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/ollama/ollama-python/compare/v0.5.3...v0.5.4">https://github.com/ollama/ollama-python/compare/v0.5.3...v0.5.4</a></p> <h2>v0.5.3</h2> <h2>What's Changed</h2> <ul> <li>add support for 'high'/'medium'/'low' think values by <a href="https://github.com/drifkin"><code>@drifkin</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/553">ollama/ollama-python#553</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/ollama/ollama-python/compare/v0.5.2...v0.5.3">https://github.com/ollama/ollama-python/compare/v0.5.2...v0.5.3</a></p> <h2>v0.5.2</h2> <h2>What's Changed</h2> <ul> <li> <p>types/examples: add tool_name to message and examples by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/537">ollama/ollama-python#537</a></p> </li> <li> <p>types: add <code>context_length</code> to ProcessResponse by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/538">ollama/ollama-python#538</a></p> </li> <li> <p>types: relax type for tools by <a href="https://github.com/ParthSareen"><code>@ParthSareen</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/550">ollama/ollama-python#550</a></p> </li> <li> <p>add license metadata to package by <a href="https://github.com/ViViDboarder"><code>@ViViDboarder</code></a> in <a href="https://redirect.github.com/ollama/ollama-python/pull/526">ollama/ollama-python#526</a></p> </li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/hwittenborn"><code>@hwittenborn</code></a> made their first contribution in <a href="https://redirect.github.com/ollama/ollama-python/pull/525">ollama/ollama-python#525</a></li> <li><a href="https://github.com/ViViDboarder"><code>@ViViDboarder</code></a> made their first contribution in <a href="https://redirect.github.com/ollama/ollama-python/pull/526">ollama/ollama-python#526</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | dac1d7be1c | chore(python-deps): bump fire from 0.7.0 to 0.7.1 (#3787) 
		
			Some checks failed
		
		
	 SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 0s Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Vector IO Integration Tests / test-matrix (push) Failing after 4s Test External API and Providers / test-external (venv) (push) Failing after 4s Unit Tests / unit-tests (3.13) (push) Failing after 4s SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 19s Python Package Build Test / build (3.12) (push) Failing after 19s Python Package Build Test / build (3.13) (push) Failing after 38s Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 42s Unit Tests / unit-tests (3.12) (push) Failing after 39s API Conformance Tests / check-schema-compatibility (push) Successful in 51s UI Tests / ui-tests (22) (push) Successful in 54s Pre-commit / pre-commit (push) Successful in 1m24s Bumps [fire](https://github.com/google/python-fire) from 0.7.0 to 0.7.1. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/google/python-fire/releases">fire's releases</a>.</em></p> <blockquote> <h2>Python Fire v0.7.1</h2> <h2>What's Changed</h2> <ul> <li>Use Neutral theme for IPython Inspector, supporting newer IPython versions in <a href="https://redirect.github.com/google/python-fire/pull/588">google/python-fire#588</a></li> <li>Call inspectutils.GetClassAttrsDict on component, not None in <a href="https://redirect.github.com/google/python-fire/pull/606">google/python-fire#606</a></li> <li>Move to pyproject.toml, adding wheel support in pypi</li> <li>Use ty in place of pytype</li> <li>Update requirements <a href="https://github.com/dependabot"><code>@dependabot</code></a>[bot]</li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/google/python-fire/compare/v0.7.0...v0.7.1">https://github.com/google/python-fire/compare/v0.7.0...v0.7.1</a></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 2cb1b19efe | chore(python-deps): bump psycopg2-binary from 2.9.10 to 2.9.11 (#3785) Bumps [psycopg2-binary](https://github.com/psycopg/psycopg2) from 2.9.10 to 2.9.11. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/psycopg/psycopg2/blob/master/NEWS">psycopg2-binary's changelog</a>.</em></p> <blockquote> <h2>Current release</h2> <p>What's new in psycopg 2.9.11 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^</p> <ul> <li>Add support for Python 3.14.</li> <li>Avoid a segfault passing more arguments than placeholders if Python is built with assertions enabled (🎫<code>[#1791](https://github.com/psycopg/psycopg2/issues/1791)</code>).</li> <li><code>~psycopg2.errorcodes</code> map and <code>~psycopg2.errors</code> classes updated to PostgreSQL 18.</li> <li>Drop support for Python 3.8.</li> </ul> <p>What's new in psycopg 2.9.10 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^</p> <ul> <li>Add support for Python 3.13.</li> <li>Receive notifications on commit (🎫<code>[#1728](https://github.com/psycopg/psycopg2/issues/1728)</code>).</li> <li><code>~psycopg2.errorcodes</code> map and <code>~psycopg2.errors</code> classes updated to PostgreSQL 17.</li> <li>Drop support for Python 3.7.</li> </ul> <p>What's new in psycopg 2.9.9 ^^^^^^^^^^^^^^^^^^^^^^^^^^^</p> <ul> <li>Add support for Python 3.12.</li> <li>Drop support for Python 3.6.</li> </ul> <p>What's new in psycopg 2.9.8 ^^^^^^^^^^^^^^^^^^^^^^^^^^^</p> <ul> <li>Wheel package bundled with PostgreSQL 16 libpq in order to add support for recent features, such as <code>sslcertmode</code>.</li> </ul> <p>What's new in psycopg 2.9.7 ^^^^^^^^^^^^^^^^^^^^^^^^^^^</p> <ul> <li>Fix propagation of exceptions raised during module initialization (🎫<code>[#1598](https://github.com/psycopg/psycopg2/issues/1598)</code>).</li> <li>Fix building when pg_config returns an empty string (🎫<code>[#1599](https://github.com/psycopg/psycopg2/issues/1599)</code>).</li> <li>Wheel package bundled with OpenSSL 1.1.1v.</li> </ul> <p>What's new in psycopg 2.9.6 ^^^^^^^^^^^^^^^^^^^^^^^^^^^</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 7ee0ee7843 | chore!: remove model mgmt from CLI for Hugging Face CLI (#3700) This change removes the `llama model` and `llama download` subcommands from the CLI, replacing them with recommendations to use the Hugging Face CLI instead. Rationale for this change: - The model management functionality was largely duplicating what Hugging Face CLI already provides, leading to unnecessary maintenance overhead (except the download source from Meta?) - Maintaining our own implementation required fixing bugs and keeping up with changes in model repositories and download mechanisms - The Hugging Face CLI is more mature, widely adopted, and better maintained - This allows us to focus on the core Llama Stack functionality rather than reimplementing model management tools Changes made: - Removed all model-related CLI commands and their implementations - Updated documentation to recommend using `huggingface-cli` for model downloads - Removed Meta-specific download logic and statements - Simplified the CLI to focus solely on stack management operations Users should now use: - `huggingface-cli download` for downloading models - `huggingface-cli scan-cache` for listing downloaded models This is a breaking change as it removes previously available CLI commands. Signed-off-by: Sébastien Han <seb@redhat.com> | ||
|  | a055a32ee4 | fix(tests): remove chroma and qdrant from vector io unit tests (#3759) These vector databases are already thoroughly tested in integration tests. Unit tests now focus on sqlite_vec, faiss, and pgvector with mocked dependencies, removing the need for external service dependencies. ## Changes: - Deleted test_qdrant.py unit test file - Removed chroma/qdrant fixtures and parametrization from conftest.py - Fixed SqliteKVStoreConfig import to use correct location - Removed chromadb, qdrant-client, pymilvus, milvus-lite, and weaviate-client from unit test dependencies in pyproject.toml | ||
|  | 92219fd8fb | chore(python-deps): bump pandas from 2.3.1 to 2.3.3 (#3689) 
		
			Some checks failed
		
		
	 SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 1s SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 0s Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Python Package Build Test / build (3.12) (push) Failing after 1s Python Package Build Test / build (3.13) (push) Failing after 0s Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 3s Vector IO Integration Tests / test-matrix (push) Failing after 4s Unit Tests / unit-tests (3.12) (push) Failing after 3s Unit Tests / unit-tests (3.13) (push) Failing after 3s API Conformance Tests / check-schema-compatibility (push) Successful in 8s Test External API and Providers / test-external (venv) (push) Failing after 5s UI Tests / ui-tests (22) (push) Successful in 41s Pre-commit / pre-commit (push) Successful in 1m26s Bumps [pandas](https://github.com/pandas-dev/pandas) from 2.3.1 to 2.3.3. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/pandas-dev/pandas/releases">pandas's releases</a>.</em></p> <blockquote> <h2>Pandas 2.3.3</h2> <p>We are pleased to announce the release of pandas 2.3.3. This release includes some improvements and fixes to the future string data type (preview feature for the upcoming pandas 3.0). We recommend that all users upgrade to this version.</p> <p>See the <a href="https://pandas.pydata.org/pandas-docs/version/2.3/whatsnew/v2.3.3.html">full whatsnew</a> for a list of all the changes. Pandas 2.3.3 supports Python 3.9 and higher, and is the first release to support Python 3.14.</p> <p>The release will be available on the conda-forge channel:</p> <pre><code>conda install pandas --channel conda-forge </code></pre> <p>Or via PyPI:</p> <pre><code>python3 -m pip install --upgrade pandas </code></pre> <p>Please report any issues with the release on the <a href="https://github.com/pandas-dev/pandas/issues">pandas issue tracker</a>.</p> <p>Thanks to all the contributors who made this release possible.</p> <h2>Pandas 2.3.2</h2> <p>We are pleased to announce the release of pandas 2.3.2. This release includes some improvements and fixes to the future string data type (preview feature for the upcoming pandas 3.0). We recommend that all users upgrade to this version.</p> <p>See the <a href="https://pandas.pydata.org/pandas-docs/version/2.3/whatsnew/v2.3.2.html">full whatsnew</a> for a list of all the changes. Pandas 2.3.2 supports Python 3.9 and higher.</p> <p>The release will be available on the conda-forge channel:</p> <pre><code>conda install pandas --channel conda-forge </code></pre> <p>Or via PyPI:</p> <pre><code>python3 -m pip install --upgrade pandas </code></pre> <p>Please report any issues with the release on the <a href="https://github.com/pandas-dev/pandas/issues">pandas issue tracker</a>.</p> <p>Thanks to all the contributors who made this release possible.</p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 45cf74db33 | chore(python-deps): bump requests from 2.32.4 to 2.32.5 (#3691) Bumps [requests](https://github.com/psf/requests) from 2.32.4 to 2.32.5. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/psf/requests/releases">requests's releases</a>.</em></p> <blockquote> <h2>v2.32.5</h2> <h2>2.32.5 (2025-08-18)</h2> <p><strong>Bugfixes</strong></p> <ul> <li>The SSLContext caching feature originally introduced in 2.32.0 has created a new class of issues in Requests that have had negative impact across a number of use cases. The Requests team has decided to revert this feature as long term maintenance of it is proving to be unsustainable in its current iteration.</li> </ul> <p><strong>Deprecations</strong></p> <ul> <li>Added support for Python 3.14.</li> <li>Dropped support for Python 3.8 following its end of support.</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/psf/requests/blob/main/HISTORY.md">requests's changelog</a>.</em></p> <blockquote> <h2>2.32.5 (2025-08-18)</h2> <p><strong>Bugfixes</strong></p> <ul> <li>The SSLContext caching feature originally introduced in 2.32.0 has created a new class of issues in Requests that have had negative impact across a number of use cases. The Requests team has decided to revert this feature as long term maintenance of it is proving to be unsustainable in its current iteration.</li> </ul> <p><strong>Deprecations</strong></p> <ul> <li>Added support for Python 3.14.</li> <li>Dropped support for Python 3.8 following its end of support.</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 7ec7e0c1ac | chore: Add weaviate client to unit group in pyproject.toml and uv.lock (#3675) # What does this PR do? `uv add "weaviate-client>=4.16.4" --group unit` ## Test Plan <!-- Describe the tests you ran to verify your changes with result summaries. *Provide clear instructions so the plan can be easily re-executed.* --> Signed-off-by: Francisco Javier Arceo <farceo@redhat.com> | ||
|  | a20e8eac8c | feat: Add OpenAI Conversations API (#3429) # What does this PR do? Initial implementation for `Conversations` and `ConversationItems` using `AuthorizedSqlStore` with endpoints to: - CREATE - UPDATE - GET/RETRIEVE/LIST - DELETE Set `level=LLAMA_STACK_API_V1`. NOTE: This does not currently incorporate changes for Responses, that'll be done in a subsequent PR. Closes https://github.com/llamastack/llama-stack/issues/3235 ## Test Plan - Unit tests - Integration tests Also comparison of [OpenAPI spec for OpenAI API](https://github.com/openai/openai-openapi/tree/manual_spec) ```bash oasdiff breaking --fail-on ERR docs/static/llama-stack-spec.yaml https://raw.githubusercontent.com/openai/openai-openapi/refs/heads/manual_spec/openapi.yaml --strip-prefix-base "/v1/openai/v1" \ --match-path '(^/v1/openai/v1/conversations.*|^/conversations.*)' ``` Note I still have some uncertainty about this, I borrowed this info from @cdoern on https://github.com/llamastack/llama-stack/pull/3514 but need to spend more time to confirm it's working, at the moment it suggests it does. UPDATE on `oasdiff`, I investigated the OpenAI spec further and it looks like currently the spec does not list Conversations, so that analysis is useless. Noting for future reference. --------- Signed-off-by: Francisco Javier Arceo <farceo@redhat.com> | ||
|  | 4487b88ffe | build: Bump version to 0.2.23 | ||
|  | ececc323d3 | build: Bump version to 0.2.22 
		
			Some checks failed
		
		
	 Pre-commit / pre-commit (push) Successful in 1m14s SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 0s SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 0s Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 2s Test Llama Stack Build / generate-matrix (push) Successful in 2s Test Llama Stack Build / build-single-provider (push) Failing after 3s Python Package Build Test / build (3.13) (push) Failing after 1s Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 3s Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Test Llama Stack Build / build-custom-container-distribution (push) Failing after 3s Python Package Build Test / build (3.12) (push) Failing after 3s UI Tests / ui-tests (22) (push) Successful in 31s Vector IO Integration Tests / test-matrix (push) Failing after 4s API Conformance Tests / check-schema-compatibility (push) Successful in 7s Unit Tests / unit-tests (3.13) (push) Failing after 3s Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 3s Test External API and Providers / test-external (venv) (push) Failing after 3s Update ReadTheDocs / update-readthedocs (push) Failing after 3s Unit Tests / unit-tests (3.12) (push) Failing after 4s Test Llama Stack Build / build (push) Failing after 4s | ||
|  | 3defdf7d3a | fix: docker failing to start container[pydantic] (#3460) # What does this PR do?
Pinning to latest pydantic version 2.11.9 as sometime we are picking
older version and failing to start container in github actions :
 | ||
|  | 8e05c68d15 | chore: remove openai dependency from providers (#3398) # What does this PR do? The openai package is already a dependency of the llama-stack project itself, so let's the project dictate which openai version we need and avoid potential breakage with unsatisfiable dependency resolution. Signed-off-by: Sébastien Han <seb@redhat.com> | ||
|  | 438c037b1f | chore(python-deps): bump openai from 1.102.0 to 1.106.1 (#3356) Bumps [openai](https://github.com/openai/openai-python) from 1.102.0 to 1.106.1. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/openai/openai-python/releases">openai's releases</a>.</em></p> <blockquote> <h2>v1.106.1</h2> <h2>1.106.1 (2025-09-04)</h2> <p>Full Changelog: <a href="https://github.com/openai/openai-python/compare/v1.106.0...v1.106.1">v1.106.0...v1.106.1</a></p> <h3>Chores</h3> <ul> <li><strong>internal:</strong> move mypy configurations to <code>pyproject.toml</code> file (<a href=" | ||
|  | 369083c069 | chore(python-deps): bump locust from 2.39.1 to 2.40.1 (#3358) Bumps [locust](https://github.com/locustio/locust) from 2.39.1 to 2.40.1. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/locustio/locust/releases">locust's releases</a>.</em></p> <blockquote> <h2>2.40.1</h2> <h2>What's Changed</h2> <ul> <li>Pytest plugin: Delay imports to avoid monkey patching until someone uses the fixtures by <a href="https://github.com/cyberw"><code>@cyberw</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3204">locustio/locust#3204</a></li> <li>Move pytest plugin to its own directory, to prevent accidental import by <a href="https://github.com/cyberw"><code>@cyberw</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3205">locustio/locust#3205</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/locustio/locust/compare/2.40.0...2.40.1">https://github.com/locustio/locust/compare/2.40.0...2.40.1</a></p> <h2>2.40.0</h2> <h2>What's Changed</h2> <ul> <li>Refactor FastHttpSession to be more like HttpSession by <a href="https://github.com/cyberw"><code>@cyberw</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3198">locustio/locust#3198</a></li> <li>Update Dockerfile base to Python 3.13 by <a href="https://github.com/adaamz"><code>@adaamz</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3193">locustio/locust#3193</a></li> <li>Avoid exception in HttpUser if requests has lost track of the request it made by <a href="https://github.com/cyberw"><code>@cyberw</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3201">locustio/locust#3201</a></li> <li>Support pytests as locustfiles by <a href="https://github.com/cyberw"><code>@cyberw</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3200">locustio/locust#3200</a></li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/adaamz"><code>@adaamz</code></a> made their first contribution in <a href="https://redirect.github.com/locustio/locust/pull/3193">locustio/locust#3193</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/locustio/locust/compare/2.39.1...2.40.0">https://github.com/locustio/locust/compare/2.39.1...2.40.0</a></p> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/locustio/locust/blob/master/CHANGELOG.md">locust's changelog</a>.</em></p> <blockquote> <h1>Detailed changelog</h1> <p>The most important changes can also be found in <a href="https://docs.locust.io/en/latest/changelog.html">the documentation</a>.</p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | a844c4f6e1 | chore(python-deps): bump pytest from 8.4.1 to 8.4.2 (#3359) Bumps [pytest](https://github.com/pytest-dev/pytest) from 8.4.1 to 8.4.2. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/pytest-dev/pytest/releases">pytest's releases</a>.</em></p> <blockquote> <h2>8.4.2</h2> <h1>pytest 8.4.2 (2025-09-03)</h1> <h2>Bug fixes</h2> <ul> <li> <p><a href="https://redirect.github.com/pytest-dev/pytest/issues/13478">#13478</a>: Fixed a crash when using <code>console_output_style</code>{.interpreted-text role="confval"} with <code>times</code> and a module is skipped.</p> </li> <li> <p><a href="https://redirect.github.com/pytest-dev/pytest/issues/13530">#13530</a>: Fixed a crash when using <code>pytest.approx</code>{.interpreted-text role="func"} and <code>decimal.Decimal</code>{.interpreted-text role="class"} instances with the <code>decimal.FloatOperation</code>{.interpreted-text role="class"} trap set.</p> </li> <li> <p><a href="https://redirect.github.com/pytest-dev/pytest/issues/13549">#13549</a>: No longer evaluate type annotations in Python <code>3.14</code> when inspecting function signatures.</p> <p>This prevents crashes during module collection when modules do not explicitly use <code>from __future__ import annotations</code> and import types for annotations within a <code>if TYPE_CHECKING:</code> block.</p> </li> <li> <p><a href="https://redirect.github.com/pytest-dev/pytest/issues/13559">#13559</a>: Added missing [int]{.title-ref} and [float]{.title-ref} variants to the [Literal]{.title-ref} type annotation of the [type]{.title-ref} parameter in <code>pytest.Parser.addini</code>{.interpreted-text role="meth"}.</p> </li> <li> <p><a href="https://redirect.github.com/pytest-dev/pytest/issues/13563">#13563</a>: <code>pytest.approx</code>{.interpreted-text role="func"} now only imports <code>numpy</code> if NumPy is already in <code>sys.modules</code>. This fixes unconditional import behavior introduced in [8.4.0]{.title-ref}.</p> </li> </ul> <h2>Improved documentation</h2> <ul> <li><a href="https://redirect.github.com/pytest-dev/pytest/issues/13577">#13577</a>: Clarify that <code>pytest_generate_tests</code> is discovered in test modules/classes; other hooks must be in <code>conftest.py</code> or plugins.</li> </ul> <h2>Contributor-facing changes</h2> <ul> <li><a href="https://redirect.github.com/pytest-dev/pytest/issues/13480">#13480</a>: Self-testing: fixed a few test failures when run with <code>-Wdefault</code> or a similar override.</li> <li><a href="https://redirect.github.com/pytest-dev/pytest/issues/13547">#13547</a>: Self-testing: corrected expected message for <code>test_doctest_unexpected_exception</code> in Python <code>3.14</code>.</li> <li><a href="https://redirect.github.com/pytest-dev/pytest/issues/13684">#13684</a>: Make pytest's own testsuite insensitive to the presence of the <code>CI</code> environment variable -- by <code>ogrisel</code>{.interpreted-text role="user"}.</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 28696c3f30 | build: Bump version to 0.2.21 
		
			Some checks failed
		
		
	 Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 3s Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 4s Test Llama Stack Build / generate-matrix (push) Successful in 4s SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 7s API Conformance Tests / check-schema-compatibility (push) Successful in 8s Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Python Package Build Test / build (3.12) (push) Failing after 2s Python Package Build Test / build (3.13) (push) Failing after 2s SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 8s Test Llama Stack Build / build-single-provider (push) Failing after 5s Vector IO Integration Tests / test-matrix (push) Failing after 7s Test Llama Stack Build / build-custom-container-distribution (push) Failing after 6s Unit Tests / unit-tests (3.12) (push) Failing after 3s Update ReadTheDocs / update-readthedocs (push) Failing after 2s Unit Tests / unit-tests (3.13) (push) Failing after 3s Test Llama Stack Build / build (push) Failing after 4s Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 41s UI Tests / ui-tests (22) (push) Successful in 37s Test External API and Providers / test-external (venv) (push) Failing after 41s Pre-commit / pre-commit (push) Successful in 2m0s | ||
|  | c9268a7a8c | fix: pre-commit failing (#3381) # What does this PR do? Fix failing pre-commit, https://github.com/llamastack/llama-stack/actions/workflows/pre-commit.yml ## Test Plan CI | ||
|  | 5c873d53db | chore(python-deps): bump pymilvus from 2.6.0 to 2.6.1 (#3285) 
		
			Some checks failed
		
		
	 Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s Integration Tests (Replay) / Integration Tests (, , , client=, vision=) (push) Failing after 1s SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 2s SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 1s Vector IO Integration Tests / test-matrix (push) Failing after 0s Pre-commit / pre-commit (push) Failing after 1s Test Llama Stack Build / generate-matrix (push) Failing after 1s Test Llama Stack Build / build-single-provider (push) Failing after 1s Test Llama Stack Build / build (push) Has been skipped Test Llama Stack Build / build-custom-container-distribution (push) Failing after 1s Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 1s Python Package Build Test / build (3.12) (push) Failing after 0s Python Package Build Test / build (3.13) (push) Failing after 0s Test External API and Providers / test-external (venv) (push) Failing after 1s Unit Tests / unit-tests (3.13) (push) Failing after 0s Update ReadTheDocs / update-readthedocs (push) Failing after 0s UI Tests / ui-tests (22) (push) Failing after 1s Unit Tests / unit-tests (3.12) (push) Failing after 1s Bumps [pymilvus](https://github.com/milvus-io/pymilvus) from 2.6.0 to 2.6.1. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/milvus-io/pymilvus/releases">pymilvus's releases</a>.</em></p> <blockquote> <h2>PyMilvus v2.6.1 Release Notes</h2> <h2>What's Changed</h2> <ul> <li>Avoid describe_collection when query by ids by <a href="https://github.com/yhmo"><code>@yhmo</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2930">milvus-io/pymilvus#2930</a></li> <li>bulkImport add objectUrls/token paramster & add example use by <a href="https://github.com/lentitude2tk"><code>@lentitude2tk</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2934">milvus-io/pymilvus#2934</a></li> <li>support stageManager & stageFileManager by <a href="https://github.com/lentitude2tk"><code>@lentitude2tk</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2935">milvus-io/pymilvus#2935</a></li> <li>fix: Fix the existing version fmt by <a href="https://github.com/XuanYang-cn"><code>@XuanYang-cn</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2960">milvus-io/pymilvus#2960</a></li> <li>enhance: Add unixmsec in every RPC call by <a href="https://github.com/XuanYang-cn"><code>@XuanYang-cn</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2961">milvus-io/pymilvus#2961</a></li> <li>enhance: Multiple cherry picks from master branch by <a href="https://github.com/XuanYang-cn"><code>@XuanYang-cn</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2962">milvus-io/pymilvus#2962</a></li> <li>fix: Passing unknown req.is_refresh to wait by <a href="https://github.com/XuanYang-cn"><code>@XuanYang-cn</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2964">milvus-io/pymilvus#2964</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/milvus-io/pymilvus/compare/v2.6.0...v2.6.1">https://github.com/milvus-io/pymilvus/compare/v2.6.0...v2.6.1</a></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 4a59961a6c | refactor: remove lama-api-client from pyproject.toml (#3299) 
		
			Some checks failed
		
		
	 Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 0s Vector IO Integration Tests / test-matrix (push) Failing after 1s Integration Tests (Replay) / Integration Tests (, , , client=, vision=) (push) Failing after 1s Pre-commit / pre-commit (push) Failing after 1s Test Llama Stack Build / generate-matrix (push) Failing after 0s Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 1s Test Llama Stack Build / build (push) Has been skipped Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Python Package Build Test / build (3.12) (push) Failing after 1s Python Package Build Test / build (3.13) (push) Failing after 1s Unit Tests / unit-tests (3.12) (push) Failing after 1s Test External API and Providers / test-external (venv) (push) Failing after 1s Unit Tests / unit-tests (3.13) (push) Failing after 1s Update ReadTheDocs / update-readthedocs (push) Failing after 1s UI Tests / ui-tests (22) (push) Failing after 2s Test Llama Stack Build / build-custom-container-distribution (push) Has started running Test Llama Stack Build / build-single-provider (push) Has started running SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 7s SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 8s # What does this PR do? <!-- Provide a short summary of what this PR does and why. Link to relevant issues if applicable. --> This PR is eliminating `lama-api-client` dependency at `pyproject.toml` because it's not used in Llama Stack codebase <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[issue-number] --> ## Test Plan <!-- Describe the tests you ran to verify your changes with result summaries. *Provide clear instructions so the plan can be easily re-executed.* --> ` ./scripts/unit-tests.sh` | ||
|  | 9625ac6d02 | chore(python-deps): bump locust from 2.39.0 to 2.39.1 (#3284) Bumps [locust](https://github.com/locustio/locust) from 2.39.0 to 2.39.1. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/locustio/locust/releases">locust's releases</a>.</em></p> <blockquote> <h2>2.39.1</h2> <h2>What's Changed</h2> <ul> <li>Avoid broken gevent version for now by <a href="https://github.com/cyberw"><code>@cyberw</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3196">locustio/locust#3196</a></li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/JumboBear"><code>@JumboBear</code></a> made their first contribution in <a href="https://redirect.github.com/locustio/locust/pull/3195">locustio/locust#3195</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/locustio/locust/compare/2.39.0...2.39.1">https://github.com/locustio/locust/compare/2.39.0...2.39.1</a></p> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/locustio/locust/blob/master/CHANGELOG.md">locust's changelog</a>.</em></p> <blockquote> <h1>Detailed changelog</h1> <p>The most important changes can also be found in <a href="https://docs.locust.io/en/latest/changelog.html">the documentation</a>.</p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 3370d8e557 | feat(files, s3, expiration): add expires_after support to S3 files provider (#3283) | ||
|  | 78a78264a7 | build: Bump version to 0.2.20 | ||
|  | 3130ca0a78 | feat: implement keyword, vector and hybrid search inside vector stores for PGVector provider (#3064) # What does this PR do?
<!-- Provide a short summary of what this PR does and why. Link to
relevant issues if applicable. -->
The purpose of this task is to implement
`openai/v1/vector_stores/{vector_store_id}/search` for PGVector
provider. It involves implementing vector similarity search, keyword
search and hybrid search for `PGVectorIndex`.
<!-- If resolving an issue, uncomment and update the line below -->
<!-- Closes #[issue-number] -->
Closes #3006 
## Test Plan
<!-- Describe the tests you ran to verify your changes with result
summaries. *Provide clear instructions so the plan can be easily
re-executed.* -->
Run unit tests:
` ./scripts/unit-tests.sh `
Run integration tests for openai vector stores:
1. Export env vars:
```
export ENABLE_PGVECTOR=true
export PGVECTOR_HOST=localhost
export PGVECTOR_PORT=5432
export PGVECTOR_DB=llamastack
export PGVECTOR_USER=llamastack
export PGVECTOR_PASSWORD=llamastack
```
2. Create DB:
```
psql -h localhost -U postgres -c "CREATE ROLE llamastack LOGIN PASSWORD 'llamastack';"
psql -h localhost -U postgres -c "CREATE DATABASE llamastack OWNER llamastack;"
psql -h localhost -U llamastack -d llamastack -c "CREATE EXTENSION IF NOT EXISTS vector;"
```
3. Install sentence-transformers:
` uv pip install sentence-transformers  `
4. Run:
```
uv run --group test pytest -s -v --stack-config="inference=inline::sentence-transformers,vector_io=remote::pgvector" --embedding-model sentence-transformers/all-MiniLM-L6-v2 tests/integration/vector_io/test_openai_vector_stores.py
```
Inspect PGVector vector stores (optional):
```
psql llamastack                                                                                                         
psql (14.18 (Homebrew))
Type "help" for help.
llamastack=# \z
                                                    Access privileges
 Schema |                         Name                         | Type  | Access privileges | Column privileges | Policies 
--------+------------------------------------------------------+-------+-------------------+-------------------+----------
 public | llamastack_kvstore                                   | table |                   |                   | 
 public | metadata_store                                       | table |                   |                   | 
 public | vector_store_pgvector_main                           | table |                   |                   | 
 public | vector_store_vs_1dfbc061_1f4d_4497_9165_ecba2622ba3a | table |                   |                   | 
 public | vector_store_vs_2085a9fb_1822_4e42_a277_c6a685843fa7 | table |                   |                   | 
 public | vector_store_vs_2b3dae46_38be_462a_afd6_37ee5fe661b1 | table |                   |                   | 
 public | vector_store_vs_2f438de6_f606_4561_9d50_ef9160eb9060 | table |                   |                   | 
 public | vector_store_vs_3eeca564_2580_4c68_bfea_83dc57e31214 | table |                   |                   | 
 public | vector_store_vs_53942163_05f3_40e0_83c0_0997c64613da | table |                   |                   | 
 public | vector_store_vs_545bac75_8950_4ff1_b084_e221192d4709 | table |                   |                   | 
 public | vector_store_vs_688a37d8_35b2_4298_a035_bfedf5b21f86 | table |                   |                   | 
 public | vector_store_vs_70624d9a_f6ac_4c42_b8ab_0649473c6600 | table |                   |                   | 
 public | vector_store_vs_73fc1dd2_e942_4972_afb1_1e177b591ac2 | table |                   |                   | 
 public | vector_store_vs_9d464949_d51f_49db_9f87_e033b8b84ac9 | table |                   |                   | 
 public | vector_store_vs_a1e4d724_5162_4d6d_a6c0_bdafaf6b76ec | table |                   |                   | 
 public | vector_store_vs_a328fb1b_1a21_480f_9624_ffaa60fb6672 | table |                   |                   | 
 public | vector_store_vs_a8981bf0_2e66_4445_a267_a8fff442db53 | table |                   |                   | 
 public | vector_store_vs_ccd4b6a4_1efd_4984_ad03_e7ff8eadb296 | table |                   |                   | 
 public | vector_store_vs_cd6420a4_a1fc_4cec_948c_1413a26281c9 | table |                   |                   | 
 public | vector_store_vs_cd709284_e5cf_4a88_aba5_dc76a35364bd | table |                   |                   | 
 public | vector_store_vs_d7a4548e_fbc1_44d7_b2ec_b664417f2a46 | table |                   |                   | 
 public | vector_store_vs_e7f73231_414c_4523_886c_d1174eee836e | table |                   |                   | 
 public | vector_store_vs_ffd53588_819f_47e8_bb9d_954af6f7833d | table |                   |                   | 
(23 rows)
llamastack=# 
```
Co-authored-by: Francisco Arceo <arceofrancisco@gmail.com> | ||
|  | 963305c84d | build: Bump version to 0.2.19 | ||
|  | f520e244d9 | feat: Add S3 Files Provider (#3202) Implements a complete S3-based file storage provider for Llama Stack
with:
    
    Core Implementation:
    - S3FilesImpl class with full OpenAI Files API compatibility
    - Support for file upload, download, listing, deletion operations
    - Sqlite-based metadata storage for fast queries and API compliance
    - Configurable S3 endpoints (AWS, MinIO, LocalStack support)
    
    Key Features:
    - Automatic S3 bucket creation and management
    - Metadata persistence
    - Proper error handling for S3 connectivity and permissions
    
    Dependencies:
    - Adds boto3 for AWS S3 integration
    - Adds moto[s3] for testing infrastructure
    
    Testing:
    
Unit: `./scripts/unit-tests.sh tests/unit/files
tests/unit/providers/files`
    
     Integration:
    
Start MinIO: `podman run --rm -it -p 9000:9000 minio/minio server /data`
    
Start stack w/ S3 provider: `S3_ENDPOINT_URL=http://localhost:9000
AWS_ACCESS_KEY_ID=minioadmin AWS_SECRET_ACCESS_KEY=minioadmin
S3_BUCKET_NAME=llama-stack-files uv run llama stack build --image-type
venv --providers files=remote::s3 --run`
    
Run integration tests: `./scripts/integration-tests.sh --stack-config
http://localhost:8321 --provider ollama --test-subdirs files` | ||
|  | bd1a794add | chore(python-deps): bump llama-api-client from 0.1.2 to 0.2.0 (#3173) Bumps [llama-api-client](https://github.com/meta-llama/llama-api-python) from 0.1.2 to 0.2.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/meta-llama/llama-api-python/releases">llama-api-client's releases</a>.</em></p> <blockquote> <h2>v0.2.0</h2> <h2>0.2.0 (2025-08-07)</h2> <p>Full Changelog: <a href="https://github.com/meta-llama/llama-api-python/compare/v0.1.2...v0.2.0">v0.1.2...v0.2.0</a></p> <h3>Features</h3> <ul> <li>clean up environment call outs (<a href=" | ||
|  | bf3b201d61 | chore(python-deps): bump chromadb from 1.0.16 to 1.0.20 (#3217) Bumps [chromadb](https://github.com/chroma-core/chroma) from 1.0.16 to 1.0.20. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/chroma-core/chroma/releases">chromadb's releases</a>.</em></p> <blockquote> <h2>1.0.20</h2> <p>Version: <code>1.0.20</code> Git ref: <code>refs/tags/1.0.20</code> Build Date: <code>2025-08-18T17:04</code> PIP Package: <code>chroma-1.0.20.tar.gz</code> Github Container Registry Image: <code>:1.0.20</code> DockerHub Image: <code>:1.0.20</code></p> <h2>What's Changed</h2> <ul> <li>[RELEASE] 1.0.20 by <a href="https://github.com/itaismith"><code>@itaismith</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5303">chroma-core/chroma#5303</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/chroma-core/chroma/compare/1.0.19...1.0.20">https://github.com/chroma-core/chroma/compare/1.0.19...1.0.20</a></p> <h2>1.0.18</h2> <p>Version: <code>1.0.18</code> Git ref: <code>refs/tags/1.0.18</code> Build Date: <code>2025-08-18T08:09</code> PIP Package: <code>chroma-1.0.18.tar.gz</code> Github Container Registry Image: <code>:1.0.18</code> DockerHub Image: <code>:1.0.18</code></p> <h2>What's Changed</h2> <ul> <li>[CHORE]: Added short descriptions to CLI commands by <a href="https://github.com/tazarov"><code>@tazarov</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5217">chroma-core/chroma#5217</a></li> <li>[ENH] Use AVX in distance calculations by <a href="https://github.com/jairad26"><code>@jairad26</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5258">chroma-core/chroma#5258</a></li> <li>[ENH] Auto-set tenant, scoped database in python CloudClient by <a href="https://github.com/jairad26"><code>@jairad26</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5026">chroma-core/chroma#5026</a></li> <li>[PERF]: Modify get_range to return an iterator by <a href="https://github.com/sanketkedia"><code>@sanketkedia</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5256">chroma-core/chroma#5256</a></li> <li>[BUG] Mark dirty on rollback of cursor to guarantee compaction picks it up. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5265">chroma-core/chroma#5265</a></li> <li>[ENH]: add metric for component queue depth & change dispatcher queue depth metric buckets by <a href="https://github.com/codetheweb"><code>@codetheweb</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5261">chroma-core/chroma#5261</a></li> <li>[ENH]: add garbage collection CLI for manual garbage collection by <a href="https://github.com/codetheweb"><code>@codetheweb</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5250">chroma-core/chroma#5250</a></li> <li>[DOC] Clean up DEVELOP.md by <a href="https://github.com/kylediaz"><code>@kylediaz</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5270">chroma-core/chroma#5270</a></li> <li>[ENH]: Further optimize query on getCollections when databases pkey is fully specified by <a href="https://github.com/tanujnay112"><code>@tanujnay112</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5268">chroma-core/chroma#5268</a></li> <li>[ENH] Update Rust to allow build with AVX when flag is set by <a href="https://github.com/jairad26"><code>@jairad26</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5269">chroma-core/chroma#5269</a></li> <li>[ENH]: Fix test_add flake by <a href="https://github.com/sanketkedia"><code>@sanketkedia</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5272">chroma-core/chroma#5272</a></li> <li>[BUG]: Revert "[ENH]: Further optimize query on getCollections when databases pkey is fully specified (<a href="https://redirect.github.com/chroma-core/chroma/issues/5268">#5268</a>)" by <a href="https://github.com/tanujnay112"><code>@tanujnay112</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5273">chroma-core/chroma#5273</a></li> <li>[BLD] Add maturin to dev dependencies by <a href="https://github.com/kylediaz"><code>@kylediaz</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5271">chroma-core/chroma#5271</a></li> <li>[ENH]: Optimize GetCollections and remove usage of raw gorm by <a href="https://github.com/tanujnay112"><code>@tanujnay112</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5274">chroma-core/chroma#5274</a></li> <li>[ENH]: add config param to garbage collector to control how many collections are fetched from SysDb by <a href="https://github.com/codetheweb"><code>@codetheweb</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5275">chroma-core/chroma#5275</a></li> <li>[ENH] Reject version files without paths. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5267">chroma-core/chroma#5267</a></li> <li>[ENH] Enable getting a collection by CRN by <a href="https://github.com/drewkim"><code>@drewkim</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5244">chroma-core/chroma#5244</a></li> <li>[BUG] CompactionError did not proxy should_trace_error by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5282">chroma-core/chroma#5282</a></li> <li>[BUG] Resolve deadlock in system crate? by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5283">chroma-core/chroma#5283</a></li> <li>[ENH] Complete the NAC metrics for the write half. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5278">chroma-core/chroma#5278</a></li> <li>[BUG]: fix missing node in constructed version graph for garbage collection by <a href="https://github.com/codetheweb"><code>@codetheweb</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5284">chroma-core/chroma#5284</a></li> <li>[BUG] Fix test flake from 5283. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5287">chroma-core/chroma#5287</a></li> <li>[BUG]: Don't GC hnsw if it is empty by <a href="https://github.com/sanketkedia"><code>@sanketkedia</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5295">chroma-core/chroma#5295</a></li> <li>[ENH] Sync before flushing by <a href="https://github.com/HammadB"><code>@HammadB</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5296">chroma-core/chroma#5296</a></li> <li>[DOC] update quota limits by <a href="https://github.com/philipithomas"><code>@philipithomas</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5297">chroma-core/chroma#5297</a></li> <li>[BUG] Fix CLI copy offset by <a href="https://github.com/itaismith"><code>@itaismith</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5288">chroma-core/chroma#5288</a></li> <li>[ENH] Add support for default space in create coll config by <a href="https://github.com/jairad26"><code>@jairad26</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5293">chroma-core/chroma#5293</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 09bee51d6b | chore(python-deps): bump locust from 2.38.0 to 2.39.0 (#3221) Bumps [locust](https://github.com/locustio/locust) from 2.38.0 to 2.39.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/locustio/locust/releases">locust's releases</a>.</em></p> <blockquote> <h2>2.39.0</h2> <h2>What's Changed</h2> <ul> <li>Add MilvusUser and example by <a href="https://github.com/zhuwenxing"><code>@zhuwenxing</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3168">locustio/locust#3168</a></li> <li>Add SocketIOUser by <a href="https://github.com/cyberw"><code>@cyberw</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3189">locustio/locust#3189</a></li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/zhuwenxing"><code>@zhuwenxing</code></a> made their first contribution in <a href="https://redirect.github.com/locustio/locust/pull/3168">locustio/locust#3168</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/locustio/locust/compare/2.38.1...2.39.0">https://github.com/locustio/locust/compare/2.38.1...2.39.0</a></p> <h2>2.38.1</h2> <h2>What's Changed</h2> <ul> <li>Fix test flakyness and update error message by <a href="https://github.com/amadeuppereira"><code>@amadeuppereira</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3187">locustio/locust#3187</a></li> <li>FastHttpUser: Dont send zstd in Accept-Encoding header by <a href="https://github.com/cyberw"><code>@cyberw</code></a> in <a href="https://redirect.github.com/locustio/locust/pull/3188">locustio/locust#3188</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/locustio/locust/compare/2.38.0...2.38.1">https://github.com/locustio/locust/compare/2.38.0...2.38.1</a></p> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/locustio/locust/blob/master/CHANGELOG.md">locust's changelog</a>.</em></p> <blockquote> <h1>Detailed changelog</h1> <p>The most important changes can also be found in <a href="https://docs.locust.io/en/latest/changelog.html">the documentation</a>.</p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | eff97f122b | chore(python-deps): bump weaviate-client from 4.16.5 to 4.16.9 (#3219) Bumps [weaviate-client](https://github.com/weaviate/weaviate-python-client) from 4.16.5 to 4.16.9. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/weaviate/weaviate-python-client/releases">weaviate-client's releases</a>.</em></p> <blockquote> <h2>v4.16.9</h2> <h2>What's Changed</h2> <ul> <li>Deprecate broken method by <a href="https://github.com/dirkkul"><code>@dirkkul</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/1795">weaviate/weaviate-python-client#1795</a></li> <li>Improve user create docstring by <a href="https://github.com/dirkkul"><code>@dirkkul</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/1796">weaviate/weaviate-python-client#1796</a></li> <li>Fixup dependencies for package test by <a href="https://github.com/dirkkul"><code>@dirkkul</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/1791">weaviate/weaviate-python-client#1791</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/weaviate/weaviate-python-client/compare/v4.16.8...v4.16.9">https://github.com/weaviate/weaviate-python-client/compare/v4.16.8...v4.16.9</a></p> <h2>v4.16.8</h2> <h2>What's Changed</h2> <ul> <li>Add backup list endpoint by <a href="https://github.com/dirkkul"><code>@dirkkul</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/1785">weaviate/weaviate-python-client#1785</a></li> <li>Attempt further fix of protobuf runtime stub incompatibilities by <a href="https://github.com/tsmith023"><code>@tsmith023</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/1788">weaviate/weaviate-python-client#1788</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/weaviate/weaviate-python-client/compare/v4.16.7...v4.16.8">https://github.com/weaviate/weaviate-python-client/compare/v4.16.7...v4.16.8</a></p> <h2>v4.16.6</h2> <h2>What's Changed</h2> <ul> <li>rq: Add bits to the update method by <a href="https://github.com/rlmanrique"><code>@rlmanrique</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/1766">weaviate/weaviate-python-client#1766</a></li> <li>Deprecate contextionar, add model2vec and dimension parameter for transformers by <a href="https://github.com/dirkkul"><code>@dirkkul</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/1773">weaviate/weaviate-python-client#1773</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/weaviate/weaviate-python-client/compare/v4.16.5...v4.16.6">https://github.com/weaviate/weaviate-python-client/compare/v4.16.5...v4.16.6</a></p> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/weaviate/weaviate-python-client/blob/main/docs/changelog.rst">weaviate-client's changelog</a>.</em></p> <blockquote> <h2>Version 4.16.9</h2> <p>This patch version includes: - Explicitly depend on protobuf package</p> <h2>Version 4.16.8</h2> <p>This patch version includes: - Further attempted fixes for <code>protobuf</code> compatability issues - Introduction of the <code>backups.list()</code> method</p> <h2>Version 4.16.7</h2> <p>This patch version includes: - Fixes compatability issues between the built gRPC stubs and differing protobuf versions depending on the version of <code>grpcio</code> used to build the stubs - Add <code>text2vec-model2vec</code> module to <code>Configure.NamedVectors</code> - Deprecated <code>min_occurrences</code> in <code>Metrics.text</code> in favour of <code>limit</code></p> <h2>Version 4.16.6</h2> <p>This patch version includes: - Add <code>dimensions</code> property to <code>text2vec-transformers</code> vectorizers in <code>Configure.Vectors</code> - Add <code>text2vec-model2vec</code> vectorizer in <code>Configure.Vectors</code> - Deprecate <code>text2vec-contextionary</code> vectorizer</p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 7f0b2a8764 | build: Bump version to 0.2.18 | ||
|  | 8cc4925f7d | chore: Enable keyword search for Milvus inline (#3073) # What does this PR do? With https://github.com/milvus-io/milvus-lite/pull/294 - Milvus Lite supports keyword search using BM25. While introducing keyword search we had explicitly disabled it for inline milvus. This PR removes the need for the check, and enables `inline::milvus` for tests. <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[issue-number] --> ## Test Plan Run llama stack with `inline::milvus` enabled: ``` pytest tests/integration/vector_io/test_openai_vector_stores.py::test_openai_vector_store_search_modes --stack-config=http://localhost:8321 --embedding-model=all-MiniLM-L6-v2 -v ``` ``` INFO 2025-08-07 17:06:20,932 tests.integration.conftest:64 tests: Setting DISABLE_CODE_SANDBOX=1 for macOS =========================================================================================== test session starts ============================================================================================ platform darwin -- Python 3.12.11, pytest-7.4.4, pluggy-1.5.0 -- /Users/vnarsing/miniconda3/envs/stack-client/bin/python cachedir: .pytest_cache metadata: {'Python': '3.12.11', 'Platform': 'macOS-14.7.6-arm64-arm-64bit', 'Packages': {'pytest': '7.4.4', 'pluggy': '1.5.0'}, 'Plugins': {'asyncio': '0.23.8', 'cov': '6.0.0', 'timeout': '2.2.0', 'socket': '0.7.0', 'html': '3.1.1', 'langsmith': '0.3.39', 'anyio': '4.8.0', 'metadata': '3.0.0'}} rootdir: /Users/vnarsing/go/src/github/meta-llama/llama-stack configfile: pyproject.toml plugins: asyncio-0.23.8, cov-6.0.0, timeout-2.2.0, socket-0.7.0, html-3.1.1, langsmith-0.3.39, anyio-4.8.0, metadata-3.0.0 asyncio: mode=Mode.AUTO collected 3 items tests/integration/vector_io/test_openai_vector_stores.py::test_openai_vector_store_search_modes[None-None-all-MiniLM-L6-v2-None-384-vector] PASSED [ 33%] tests/integration/vector_io/test_openai_vector_stores.py::test_openai_vector_store_search_modes[None-None-all-MiniLM-L6-v2-None-384-keyword] PASSED [ 66%] tests/integration/vector_io/test_openai_vector_stores.py::test_openai_vector_store_search_modes[None-None-all-MiniLM-L6-v2-None-384-hybrid] PASSED [100%] ============================================================================================ 3 passed in 4.75s ============================================================================================= ``` Signed-off-by: Varsha Prasad Narsing <varshaprasad96@gmail.com> Co-authored-by: Francisco Arceo <arceofrancisco@gmail.com> | ||
|  | 27d6becfd0 | fix(misc): pin openai dependency to < 1.100.0 (#3192) This OpenAI client release | ||
|  | 2114214fe3 | chore(python-deps): bump huggingface-hub from 0.34.3 to 0.34.4 (#3084) Bumps [huggingface-hub](https://github.com/huggingface/huggingface_hub) from 0.34.3 to 0.34.4. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/huggingface/huggingface_hub/releases">huggingface-hub's releases</a>.</em></p> <blockquote> <h2>[v0.34.4] Support Image to Video inference + QoL in jobs API, auth and utilities</h2> <p>Biggest update is the support of Image-To-Video task with inference provider Fal AI</p> <ul> <li>[Inference] Support image to video task <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3289">#3289</a> by <a href="https://github.com/hanouticelina"><code>@hanouticelina</code></a></li> </ul> <pre lang="py"><code>>>> from huggingface_hub import InferenceClient >>> client = InferenceClient() >>> video = client.image_to_video("cat.jpg", model="Wan-AI/Wan2.2-I2V-A14B", prompt="turn the cat into a tiger") >>> with open("tiger.mp4", "wb") as f: ... f.write(video) </code></pre> <p>And some quality of life improvements:</p> <ul> <li>Add type to job owner <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3291">#3291</a> by <a href="https://github.com/drbh"><code>@drbh</code></a></li> <li>Include HF_HUB_DISABLE_XET in the environment dump <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3290">#3290</a> by <a href="https://github.com/hanouticelina"><code>@hanouticelina</code></a></li> <li>Whoami: custom message only on unauthorized <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3288">#3288</a> by <a href="https://github.com/Wauplin"><code>@Wauplin</code></a></li> <li>Add validation warnings for repository limits in upload_large_folder <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3280">#3280</a> by <a href="https://github.com/davanstrien"><code>@davanstrien</code></a></li> <li>Add timeout info to Jobs guide docs <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3281">#3281</a> by <a href="https://github.com/davanstrien"><code>@davanstrien</code></a></li> <li>[Jobs] Use current or stored token in a Job secrets <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3272">#3272</a> by <a href="https://github.com/lhoestq"><code>@lhoestq</code></a></li> <li>Fix bash history expansion in hf jobs example <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3277">#3277</a> by <a href="https://github.com/nyuuzyou"><code>@nyuuzyou</code></a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/huggingface/huggingface_hub/compare/v0.34.3...v0.34.4">https://github.com/huggingface/huggingface_hub/compare/v0.34.3...v0.34.4</a></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | a275282685 | chore(python-deps): bump pymilvus from 2.5.14 to 2.6.0 (#3086) Bumps [pymilvus](https://github.com/milvus-io/pymilvus) from 2.5.14 to 2.6.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/milvus-io/pymilvus/releases">pymilvus's releases</a>.</em></p> <blockquote> <h2>PyMilvus v2.6.0 Release Notes</h2> <h2>New Features</h2> <ol> <li>Add APIs in MilvusClient</li> </ol> <ul> <li>enhance: add describe and alter database in MilvusClient by <a href="https://github.com/smellthemoon"><code>@smellthemoon</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2433">milvus-io/pymilvus#2433</a></li> <li>enhance: support milvus-client iterator by <a href="https://github.com/MrPresent-Han"><code>@MrPresent-Han</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2461">milvus-io/pymilvus#2461</a></li> <li>enhance: Enable resource group api in milvus client by <a href="https://github.com/weiliu1031"><code>@weiliu1031</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2513">milvus-io/pymilvus#2513</a></li> <li>enhance: add release_collection, drop_index, create_partition, drop_partition, load_partition and release_partition by <a href="https://github.com/brcarry"><code>@brcarry</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2525">milvus-io/pymilvus#2525</a></li> <li>enhance: enable describe_replica api in milvus client by <a href="https://github.com/weiliu1031"><code>@weiliu1031</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2541">milvus-io/pymilvus#2541</a></li> <li>enhance: support recalls for milvus_client by <a href="https://github.com/chasingegg"><code>@chasingegg</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2552">milvus-io/pymilvus#2552</a></li> <li>enhance: add use_database by <a href="https://github.com/czs007"><code>@czs007</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2491">milvus-io/pymilvus#2491</a></li> </ul> <ol start="2"> <li>Add AsyncMilvusClient</li> </ol> <ul> <li>[FEAT] Asyncio support by <a href="https://github.com/brcarry"><code>@brcarry</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2411">milvus-io/pymilvus#2411</a></li> <li>Add async DDL funcs & DDL examples by <a href="https://github.com/Shawnzheng011019"><code>@Shawnzheng011019</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2852">milvus-io/pymilvus#2852</a></li> </ul> <ol start="3"> <li>Other features</li> </ol> <ul> <li>enhance: support Int8Vector by <a href="https://github.com/cydrain"><code>@cydrain</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2611">milvus-io/pymilvus#2611</a></li> <li>feat: support recalls field in SearchResult by <a href="https://github.com/chasingegg"><code>@chasingegg</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2390">milvus-io/pymilvus#2390</a></li> <li>enhance: Support Python3.13 and upgrade grpcio range by <a href="https://github.com/XuanYang-cn"><code>@XuanYang-cn</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2684">milvus-io/pymilvus#2684</a></li> <li>enhance: support run analyzer return detail token by <a href="https://github.com/aoiasd"><code>@aoiasd</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2679">milvus-io/pymilvus#2679</a></li> <li>enhance: Add force_drop parameter to drop_role method for role deletion by <a href="https://github.com/SimFG"><code>@SimFG</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2705">milvus-io/pymilvus#2705</a></li> <li>enhance: add property func for AnalyzeToken by <a href="https://github.com/aoiasd"><code>@aoiasd</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2704">milvus-io/pymilvus#2704</a></li> <li>enhance: grant/revoke v2 optional db and collection params by <a href="https://github.com/shaoting-huang"><code>@shaoting-huang</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2386">milvus-io/pymilvus#2386</a></li> <li>extend unlimted offset for query iterator(<a href="https://redirect.github.com/milvus-io/pymilvus/issues/2418">#2418</a>) by <a href="https://github.com/MrPresent-Han"><code>@MrPresent-Han</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2419">milvus-io/pymilvus#2419</a></li> <li>enhance: alterindex & altercollection supports altering properties by <a href="https://github.com/JsDove"><code>@JsDove</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2406">milvus-io/pymilvus#2406</a></li> <li>enhance: alterdatabase support delete property by <a href="https://github.com/JsDove"><code>@JsDove</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2435">milvus-io/pymilvus#2435</a></li> <li>enhance: support hints param by <a href="https://github.com/chasingegg"><code>@chasingegg</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2408">milvus-io/pymilvus#2408</a></li> <li>enhance: create database support properties by <a href="https://github.com/JsDove"><code>@JsDove</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2448">milvus-io/pymilvus#2448</a></li> <li>enhance: Add <code>db_name</code> parameter at <code>bulk_import</code> by <a href="https://github.com/counter2015"><code>@counter2015</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2446">milvus-io/pymilvus#2446</a></li> <li>enhance: add search iterator v2 by <a href="https://github.com/PwzXxm"><code>@PwzXxm</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2395">milvus-io/pymilvus#2395</a></li> <li>enhance: simplify the structure of search_params by <a href="https://github.com/smellthemoon"><code>@smellthemoon</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2507">milvus-io/pymilvus#2507</a></li> <li>enhance: Remove long deprecated Milvus class by <a href="https://github.com/XuanYang-cn"><code>@XuanYang-cn</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2544">milvus-io/pymilvus#2544</a></li> <li>enhance: Use new model pkg by <a href="https://github.com/junjiejiangjjj"><code>@junjiejiangjjj</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2595">milvus-io/pymilvus#2595</a></li> <li>enhance: Add schema update time verification to insert and upsert to use cache by <a href="https://github.com/JsDove"><code>@JsDove</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2551">milvus-io/pymilvus#2551</a></li> <li>enhance: describecollection output add created_timestamp by <a href="https://github.com/JsDove"><code>@JsDove</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2618">milvus-io/pymilvus#2618</a></li> <li>feat: add external filter func for search iterator v2 by <a href="https://github.com/PwzXxm"><code>@PwzXxm</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2639">milvus-io/pymilvus#2639</a></li> <li>enhance: support run analyzer by <a href="https://github.com/aoiasd"><code>@aoiasd</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2622">milvus-io/pymilvus#2622</a></li> <li>weighted reranker to allow skip score normalization by <a href="https://github.com/zhengbuqian"><code>@zhengbuqian</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2708">milvus-io/pymilvus#2708</a></li> <li>enhance: Support AddCollectionField API by <a href="https://github.com/congqixia"><code>@congqixia</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2722">milvus-io/pymilvus#2722</a></li> <li>Add 1-Way and 2-Way TLS Support to Bulk Import Functions by <a href="https://github.com/abd-770"><code>@abd-770</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2672">milvus-io/pymilvus#2672</a></li> <li>enhance: Use SearchResult in MilvusClient by <a href="https://github.com/XuanYang-cn"><code>@XuanYang-cn</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2735">milvus-io/pymilvus#2735</a></li> <li>Support rerank by <a href="https://github.com/junjiejiangjjj"><code>@junjiejiangjjj</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2729">milvus-io/pymilvus#2729</a></li> <li>feat: suppoprt multi analyzer params by <a href="https://github.com/aoiasd"><code>@aoiasd</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2747">milvus-io/pymilvus#2747</a></li> <li>Add funciton checker by <a href="https://github.com/junjiejiangjjj"><code>@junjiejiangjjj</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2760">milvus-io/pymilvus#2760</a></li> <li>enhance: Support run analyzer by collection and field by <a href="https://github.com/aoiasd"><code>@aoiasd</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2822">milvus-io/pymilvus#2822</a></li> <li>feat: support load collection/partition with priority(<a href="https://redirect.github.com/milvus-io/pymilvus/issues/2835">#2835</a>) by <a href="https://github.com/MrPresent-Han"><code>@MrPresent-Han</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2836">milvus-io/pymilvus#2836</a></li> <li>enhance: optimize perf for large topk(<a href="https://redirect.github.com/milvus-io/pymilvus/issues/2848">#2848</a>) by <a href="https://github.com/MrPresent-Han"><code>@MrPresent-Han</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2849">milvus-io/pymilvus#2849</a></li> <li>enhance: Add usage guide to manage MilvusClient by <a href="https://github.com/XuanYang-cn"><code>@XuanYang-cn</code></a> in <a href="https://redirect.github.com/milvus-io/pymilvus/pull/2907">milvus-io/pymilvus#2907</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | d6ae54723d | chore: setup for performance benchmarking (#3096) # What does this PR do? 1. Added a simple mock openai-compat server that serves chat/completion 2. Add a benchmark server in EKS that includes mock inference server 3. Add locust (https://locust.io/) file for load testing ## Test Plan bash apply.sh kubectl port-forward service/locust-web-ui 8089:8089 Go to localhost:8089 to start a load test <img width="1392" height="334" alt="image" src="https://github.com/user-attachments/assets/d6aa3deb-583a-42ed-889b-751262b8e91c" /> <img width="1362" height="881" alt="image" src="https://github.com/user-attachments/assets/6a28b9b4-05e6-44e2-b504-07e60c12d35e" /> | ||
|  | 88c4fdc5d7 | chore(python-deps): bump chromadb from 1.0.15 to 1.0.16 (#3083) Bumps [chromadb](https://github.com/chroma-core/chroma) from 1.0.15 to 1.0.16. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/chroma-core/chroma/releases">chromadb's releases</a>.</em></p> <blockquote> <h2>1.0.16</h2> <p>Version: <code>1.0.16</code> Git ref: <code>refs/tags/1.0.16</code> Build Date: <code>2025-08-08T00:26</code> PIP Package: <code>chroma-1.0.16.tar.gz</code> Github Container Registry Image: <code>:1.0.16</code> DockerHub Image: <code>:1.0.16</code></p> <h2>What's Changed</h2> <ul> <li>[ENH]: add cache mount & tolerations to garbage collector template in Helm chart by <a href="https://github.com/codetheweb"><code>@codetheweb</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5016">chroma-core/chroma#5016</a></li> <li>[DOC] Fix docs typo by <a href="https://github.com/itaismith"><code>@itaismith</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5018">chroma-core/chroma#5018</a></li> <li>[CLN] Change GenericQuotaError from 429 to 422 by <a href="https://github.com/drewkim"><code>@drewkim</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5022">chroma-core/chroma#5022</a></li> <li>[CHORE] Fix type error in batch_utils by <a href="https://github.com/jairad26"><code>@jairad26</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5024">chroma-core/chroma#5024</a></li> <li>[ENH] Add block-level metrics by <a href="https://github.com/tanujnay112"><code>@tanujnay112</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/4801">chroma-core/chroma#4801</a></li> <li>[ENH]: return error on /add if embeddings are not provided by <a href="https://github.com/codetheweb"><code>@codetheweb</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5033">chroma-core/chroma#5033</a></li> <li>[DOC] Docs Polish 07/2025 by <a href="https://github.com/itaismith"><code>@itaismith</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5032">chroma-core/chroma#5032</a></li> <li>[DOC] Flatten public txt files by <a href="https://github.com/itaismith"><code>@itaismith</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5040">chroma-core/chroma#5040</a></li> <li>[ENH]: require embeddings & require min embedding dimension on /add by <a href="https://github.com/codetheweb"><code>@codetheweb</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5037">chroma-core/chroma#5037</a></li> <li>[ENH] - Adds in dark mode support for hero image by <a href="https://github.com/tjkrusinskichroma"><code>@tjkrusinskichroma</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5042">chroma-core/chroma#5042</a></li> <li>[BLD] Use 8core runners for all our windows jobs by <a href="https://github.com/eculver"><code>@eculver</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5027">chroma-core/chroma#5027</a></li> <li>[TST] More benchmark queries for regex by <a href="https://github.com/Sicheng-Pan"><code>@Sicheng-Pan</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/4910">chroma-core/chroma#4910</a></li> <li>[BUG]: refactor otel/tracing initialization in the frontend to be independent of hosted entry point by <a href="https://github.com/c-gamble"><code>@c-gamble</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5028">chroma-core/chroma#5028</a></li> <li>[BUG] js client: handle 422 billing errors as QuotaExceeded instead of ChromaConnectionError by <a href="https://github.com/philipithomas"><code>@philipithomas</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5049">chroma-core/chroma#5049</a></li> <li>[BUG] RLS should use 32MB GRPC payload size limit by <a href="https://github.com/Sicheng-Pan"><code>@Sicheng-Pan</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5044">chroma-core/chroma#5044</a></li> <li>[BUG] Sync protoc arch and version in dockerfile by <a href="https://github.com/Sicheng-Pan"><code>@Sicheng-Pan</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5045">chroma-core/chroma#5045</a></li> <li>[BLD] Fix windows runner label by <a href="https://github.com/eculver"><code>@eculver</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5052">chroma-core/chroma#5052</a></li> <li>[PERF]: Prefetch segments in get and query by <a href="https://github.com/sanketkedia"><code>@sanketkedia</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5053">chroma-core/chroma#5053</a></li> <li>[PERF]: Parallelize fetching blocks for brute force regex by <a href="https://github.com/sanketkedia"><code>@sanketkedia</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5051">chroma-core/chroma#5051</a></li> <li>[RELEASE] JS 3.0.7 by <a href="https://github.com/itaismith"><code>@itaismith</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5059">chroma-core/chroma#5059</a></li> <li>[ENH] Add a delete_many call to the storage API. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5020">chroma-core/chroma#5020</a></li> <li>[ENH] Consume delete_many from the wal3 garbage collector. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5021">chroma-core/chroma#5021</a></li> <li>[ENH]: limit number of concurrent get_all_block_ids() when using buffer_unordered() by <a href="https://github.com/codetheweb"><code>@codetheweb</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5062">chroma-core/chroma#5062</a></li> <li>[ENH]: use new <code>delete_many()</code> storage method in DeleteUnusedFiles operator by <a href="https://github.com/codetheweb"><code>@codetheweb</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5061">chroma-core/chroma#5061</a></li> <li>[BUG]: Disable aws stalled stream protection by <a href="https://github.com/tanujnay112"><code>@tanujnay112</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5063">chroma-core/chroma#5063</a></li> <li>[DOC] Update manage collections docs with correct delete collection info by <a href="https://github.com/jairad26"><code>@jairad26</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5066">chroma-core/chroma#5066</a></li> <li>[BUG] Improve wal3 robustness with better shutdown handling and error recovery by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5046">chroma-core/chroma#5046</a></li> <li>[ENH] Do not do any mutations of the manifest from within GC. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5050">chroma-core/chroma#5050</a></li> <li>[CHORE]: enable change notifier otel/tracing by <a href="https://github.com/c-gamble"><code>@c-gamble</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5073">chroma-core/chroma#5073</a></li> <li>[CHORE] Add pprof server to query service by <a href="https://github.com/eculver"><code>@eculver</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5072">chroma-core/chroma#5072</a></li> <li>[ENH]: Dedup inserts to the same key in foyer by <a href="https://github.com/sanketkedia"><code>@sanketkedia</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5074">chroma-core/chroma#5074</a></li> <li>[ENH] "Failed to fetch: status: NotFound" be gone. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5064">chroma-core/chroma#5064</a></li> <li>[CLN] Remove the the top most spammy log lines from rls/wal3. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5071">chroma-core/chroma#5071</a></li> <li>[DOC] Fix badge in readme by <a href="https://github.com/kylediaz"><code>@kylediaz</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5025">chroma-core/chroma#5025</a></li> <li>[ENH] A tool for patching logs that were deleted before a new manifest was installed. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5083">chroma-core/chroma#5083</a></li> <li>[BUG] Add billing errors to JS client by <a href="https://github.com/itaismith"><code>@itaismith</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5084">chroma-core/chroma#5084</a></li> <li>[CHORE]: Add s3 get metrics and pod name to tracing spans by <a href="https://github.com/tanujnay112"><code>@tanujnay112</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5086">chroma-core/chroma#5086</a></li> <li>[RELEASE] JS 3.0.8 by <a href="https://github.com/itaismith"><code>@itaismith</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5087">chroma-core/chroma#5087</a></li> <li>[ENH] A tool to purge the cache. by <a href="https://github.com/rescrv"><code>@rescrv</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5085">chroma-core/chroma#5085</a></li> <li>[DOC] Update PR template for migration and observability by <a href="https://github.com/HammadB"><code>@HammadB</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5089">chroma-core/chroma#5089</a></li> <li>[CHORE]: Fix s3 get metric name by <a href="https://github.com/tanujnay112"><code>@tanujnay112</code></a> in <a href="https://redirect.github.com/chroma-core/chroma/pull/5091">chroma-core/chroma#5091</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 393f3714b0 | chore(python-deps): bump torch from 2.7.1 to 2.8.0 (#3082) Bumps [torch](https://github.com/pytorch/pytorch) from 2.7.1 to 2.8.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/pytorch/pytorch/releases">torch's releases</a>.</em></p> <blockquote> <h1>PyTorch 2.8.0 Release Notes</h1> <ul> <li><a href="https://github.com/pytorch/pytorch/blob/HEAD/#highlights">Highlights</a></li> <li><a href="https://github.com/pytorch/pytorch/blob/HEAD/#backwards-incompatible-changes">Backwards Incompatible Changes</a></li> <li><a href="https://github.com/pytorch/pytorch/blob/HEAD/#deprecations">Deprecations</a></li> <li><a href="https://github.com/pytorch/pytorch/blob/HEAD/#new-features">New Features</a></li> <li><a href="https://github.com/pytorch/pytorch/blob/HEAD/#improvements">Improvements</a></li> <li><a href="https://github.com/pytorch/pytorch/blob/HEAD/#bug-fixes">Bug fixes</a></li> <li><a href="https://github.com/pytorch/pytorch/blob/HEAD/#performance">Performance</a></li> <li><a href="https://github.com/pytorch/pytorch/blob/HEAD/#documentation">Documentation</a></li> <li><a href="https://github.com/pytorch/pytorch/blob/HEAD/#developers">Developers</a></li> </ul> <h1>Highlights</h1> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | b70e2f1f09 | fix(dep): update to openai >= 1.99.6 and use new Function location (#3087) # What does this PR do? closes #3072 ## Test Plan ci | ||
|  | e565b91182 | build: Bump version to 0.2.17 
		
			Some checks failed
		
		
	 Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Vector IO Integration Tests / test-matrix (3.12, inline::milvus) (push) Failing after 8s Vector IO Integration Tests / test-matrix (3.12, remote::pgvector) (push) Failing after 8s Vector IO Integration Tests / test-matrix (3.12, remote::qdrant) (push) Failing after 7s Test Llama Stack Build / build-custom-container-distribution (push) Failing after 7s SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 13s Test Llama Stack Build / generate-matrix (push) Successful in 8s Vector IO Integration Tests / test-matrix (3.13, remote::pgvector) (push) Failing after 9s Vector IO Integration Tests / test-matrix (3.13, remote::weaviate) (push) Failing after 9s Python Package Build Test / build (3.12) (push) Failing after 8s Vector IO Integration Tests / test-matrix (3.12, remote::weaviate) (push) Failing after 14s Vector IO Integration Tests / test-matrix (3.12, inline::sqlite-vec) (push) Failing after 13s Test Llama Stack Build / build-single-provider (push) Failing after 5s Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 7s Test External API and Providers / test-external (venv) (push) Failing after 7s SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 17s Unit Tests / unit-tests (3.12) (push) Failing after 7s Python Package Build Test / build (3.13) (push) Failing after 9s Update ReadTheDocs / update-readthedocs (push) Failing after 7s Vector IO Integration Tests / test-matrix (3.12, remote::chromadb) (push) Failing after 15s Unit Tests / unit-tests (3.13) (push) Failing after 11s Vector IO Integration Tests / test-matrix (3.13, remote::chromadb) (push) Failing after 13s Vector IO Integration Tests / test-matrix (3.13, remote::qdrant) (push) Failing after 12s Vector IO Integration Tests / test-matrix (3.13, inline::faiss) (push) Failing after 11s Vector IO Integration Tests / test-matrix (3.13, inline::milvus) (push) Failing after 11s Vector IO Integration Tests / test-matrix (3.13, inline::sqlite-vec) (push) Failing after 14s Integration Tests (Replay) / Integration Tests (, , , client=, vision=) (push) Failing after 8s Vector IO Integration Tests / test-matrix (3.12, inline::faiss) (push) Failing after 14s Test Llama Stack Build / build (push) Failing after 12s Pre-commit / pre-commit (push) Successful in 1m38s | ||
|  | 55a2694c80 | chore(python-deps): bump openai from 1.97.1 to 1.98.0 (#3025) Bumps [openai](https://github.com/openai/openai-python) from 1.97.1 to 1.98.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/openai/openai-python/releases">openai's releases</a>.</em></p> <blockquote> <h2>v1.98.0</h2> <h2>1.98.0 (2025-07-30)</h2> <p>Full Changelog: <a href="https://github.com/openai/openai-python/compare/v1.97.2...v1.98.0">v1.97.2...v1.98.0</a></p> <h3>Features</h3> <ul> <li><strong>api:</strong> manual updates (<a href=" | ||
|  | 12f964437a | chore(python-deps): bump opentelemetry-exporter-otlp-proto-http from 1.35.0 to 1.36.0 (#3027) 
		
			Some checks failed
		
		
	 Test Llama Stack Build / generate-matrix (push) Successful in 8s Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 19s Python Package Build Test / build (3.13) (push) Failing after 1s Test Llama Stack Build / build-custom-container-distribution (push) Failing after 6s Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 4s Vector IO Integration Tests / test-matrix (3.12, remote::qdrant) (push) Failing after 21s Vector IO Integration Tests / test-matrix (3.12, inline::sqlite-vec) (push) Failing after 25s Python Package Build Test / build (3.12) (push) Failing after 13s Vector IO Integration Tests / test-matrix (3.13, inline::faiss) (push) Failing after 25s Test Llama Stack Build / build-single-provider (push) Failing after 19s Update ReadTheDocs / update-readthedocs (push) Failing after 7s SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 30s Vector IO Integration Tests / test-matrix (3.12, remote::weaviate) (push) Failing after 28s Integration Tests (Replay) / Integration Tests (, , , client=, vision=) (push) Failing after 11s SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 34s Unit Tests / unit-tests (3.12) (push) Failing after 13s Test External API and Providers / test-external (venv) (push) Failing after 16s Vector IO Integration Tests / test-matrix (3.13, inline::sqlite-vec) (push) Failing after 25s Unit Tests / unit-tests (3.13) (push) Failing after 17s Vector IO Integration Tests / test-matrix (3.13, inline::milvus) (push) Failing after 30s Vector IO Integration Tests / test-matrix (3.13, remote::pgvector) (push) Failing after 26s Vector IO Integration Tests / test-matrix (3.13, remote::weaviate) (push) Failing after 24s Vector IO Integration Tests / test-matrix (3.12, inline::faiss) (push) Failing after 30s Vector IO Integration Tests / test-matrix (3.13, remote::qdrant) (push) Failing after 29s Vector IO Integration Tests / test-matrix (3.12, remote::chromadb) (push) Failing after 31s Vector IO Integration Tests / test-matrix (3.12, remote::pgvector) (push) Failing after 27s Vector IO Integration Tests / test-matrix (3.13, remote::chromadb) (push) Has started running Test Llama Stack Build / build (push) Failing after 12s Pre-commit / pre-commit (push) Successful in 1m46s Bumps [opentelemetry-exporter-otlp-proto-http](https://github.com/open-telemetry/opentelemetry-python) from 1.35.0 to 1.36.0. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/open-telemetry/opentelemetry-python/blob/main/CHANGELOG.md">opentelemetry-exporter-otlp-proto-http's changelog</a>.</em></p> <blockquote> <h2>Version 1.36.0/0.57b0 (2025-07-29)</h2> <ul> <li> <p>Add missing Prometheus exporter documentation (<a href="https://redirect.github.com/open-telemetry/opentelemetry-python/pull/4485">#4485</a>)</p> </li> <li> <p>Overwrite logging.config.fileConfig and logging.config.dictConfig to ensure the OTLP <code>LogHandler</code> remains attached to the root logger. Fix a bug that can cause a deadlock to occur over <code>logging._lock</code> in some cases (<a href="https://redirect.github.com/open-telemetry/opentelemetry-python/pull/4636">#4636</a>).</p> </li> <li> <p>otlp-http-exporter: set default value for param <code>timeout_sec</code> in <code>_export</code> method (<a href="https://redirect.github.com/open-telemetry/opentelemetry-python/pull/4691">#4691</a>)</p> </li> <li> <p>Update OTLP gRPC/HTTP exporters: calling shutdown will now interrupt exporters that are sleeping before a retry attempt, and cause them to return failure immediately. Update BatchSpan/LogRecordProcessors: shutdown will now complete after 30 seconds of trying to finish exporting any buffered telemetry, instead of continuing to export until all telemetry was exported. (<a href="https://redirect.github.com/open-telemetry/opentelemetry-python/pull/4638">#4638</a>).</p> </li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 48b49e318f | chore(python-deps): bump weaviate-client from 4.16.4 to 4.16.5 (#3026) [//]: # (dependabot-start) ⚠️ **Dependabot is rebasing this PR** ⚠️ Rebasing might not happen immediately, so don't worry if this takes some time. Note: if you make any changes to this PR yourself, they will take precedence over the rebase. --- [//]: # (dependabot-end) Bumps [weaviate-client](https://github.com/weaviate/weaviate-python-client) from 4.16.4 to 4.16.5. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/weaviate/weaviate-python-client/releases">weaviate-client's releases</a>.</em></p> <blockquote> <h2>v3.13.0 - Support for Weaviate v1.18</h2> <h2>What's Changed</h2> <ul> <li>Extend CRUD operations for single data objects and reference with consistency level by <a href="https://github.com/redouan-rhazouani"><code>@redouan-rhazouani</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/234">weaviate/weaviate-python-client#234</a></li> <li>Extend batch operations with consistency level by <a href="https://github.com/redouan-rhazouani"><code>@redouan-rhazouani</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/240">weaviate/weaviate-python-client#240</a></li> <li>Add Cursor api by <a href="https://github.com/dirkkul"><code>@dirkkul</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/241">weaviate/weaviate-python-client#241</a></li> <li>Add support for backup Azure module by <a href="https://github.com/antas-marcin"><code>@antas-marcin</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/246">weaviate/weaviate-python-client#246</a></li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/redouan-rhazouani"><code>@redouan-rhazouani</code></a> made their first contribution in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/234">weaviate/weaviate-python-client#234</a></li> <li><a href="https://github.com/antas-marcin"><code>@antas-marcin</code></a> made their first contribution in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/246">weaviate/weaviate-python-client#246</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/weaviate/weaviate-python-client/compare/v3.12.0...v3.13.0">https://github.com/weaviate/weaviate-python-client/compare/v3.12.0...v3.13.0</a></p> <h2>v3.12.1b - Support for weaviate v1.18</h2> <h2>What's Changed</h2> <ul> <li>Extend CRUD operations for single data objects and reference with consistency level by <a href="https://github.com/redouan-rhazouani"><code>@redouan-rhazouani</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/234">weaviate/weaviate-python-client#234</a></li> <li>Extend batch operations with consistency level by <a href="https://github.com/redouan-rhazouani"><code>@redouan-rhazouani</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/240">weaviate/weaviate-python-client#240</a></li> <li>Add Cursor api by <a href="https://github.com/dirkkul"><code>@dirkkul</code></a> in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/241">weaviate/weaviate-python-client#241</a></li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/redouan-rhazouani"><code>@redouan-rhazouani</code></a> made their first contribution in <a href="https://redirect.github.com/weaviate/weaviate-python-client/pull/234">weaviate/weaviate-python-client#234</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/weaviate/weaviate-python-client/compare/v3.12.0...v3.12.1b">https://github.com/weaviate/weaviate-python-client/compare/v3.12.0...v3.12.1b</a></p> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/weaviate/weaviate-python-client/blob/main/docs/changelog.rst">weaviate-client's changelog</a>.</em></p> <blockquote> <h2>Version 4.16.5</h2> <p>This patch version includes: - Add <code>dimensions</code> property to Google vectorizers in <code>Configure.Vectors</code></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 4411e6e362 | chore(ci): remove reportlab dep (#3033) # What does this PR do? remove reportlab dep. change dynamic pdf generation into a pre-computed pdf. ## Test Plan ci | ||
|  | edc19698fb | chore(python-deps): bump huggingface-hub from 0.34.2 to 0.34.3 (#3028) Bumps [huggingface-hub](https://github.com/huggingface/huggingface_hub) from 0.34.2 to 0.34.3. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/huggingface/huggingface_hub/releases">huggingface-hub's releases</a>.</em></p> <blockquote> <h2>[v0.34.3] Jobs improvements and <code>whoami</code> user prefix</h2> <ul> <li>[Jobs] Update uv image <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3270">#3270</a> by <a href="https://github.com/lhoestq"><code>@lhoestq</code></a></li> <li>[Update] HF Jobs Documentation <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3268">#3268</a> by <a href="https://github.com/ariG23498"><code>@ariG23498</code></a></li> <li>Add 'user:' prefix to whoami command output <a href="https://redirect.github.com/huggingface/huggingface_hub/issues/3267">#3267</a> by <a href="https://github.com/gary149"><code>@gary149</code></a></li> </ul> <p>Full Changelog: <a href="https://github.com/huggingface/huggingface_hub/compare/v0.34.2...v0.34.3">https://github.com/huggingface/huggingface_hub/compare/v0.34.2...v0.34.3</a></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href=" | ||
|  | 33cca26154 | chore: Enabling Integration tests for Weaviate (#2882) # What does this PR do? This PR (1) enables the files API for Weaviate and (2) enables integration tests for Weaviate, which adds a docker container to the github action. This PR also handles a couple of edge cases for in creating the collection and ensuring the tests all pass. ## Test Plan CI enabled --------- Signed-off-by: Francisco Javier Arceo <farceo@redhat.com> | ||
|  | 25d3dfa30f | fix: fix No module named 'ollama' in test_inference_recordings.py (#2967) # What does this PR do? <!-- Provide a short summary of what this PR does and why. Link to relevant issues if applicable. --> This PR fixes the following error in unit test that was running on up to date main branch: ``` FAILED tests/unit/distribution/test_inference_recordings.py::TestInferenceRecording::test_recording_mode - ModuleNotFoundError: No module named 'ollama' FAILED tests/unit/distribution/test_inference_recordings.py::TestInferenceRecording::test_replay_mode - ModuleNotFoundError: No module named 'ollama' FAILED tests/unit/distribution/test_inference_recordings.py::TestInferenceRecording::test_replay_missing_recording - ModuleNotFoundError: No module named 'ollama' FAILED tests/unit/distribution/test_inference_recordings.py::TestInferenceRecording::test_embeddings_recording - ModuleNotFoundError: No module named 'ollama' =============================== 4 failed, 499 passed, 198 warnings in 34.50s ================================ ``` <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[issue-number] --> ## Test Plan <!-- Describe the tests you ran to verify your changes with result summaries. *Provide clear instructions so the plan can be easily re-executed.* --> Run `./scripts/unit-tests.sh` |