mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-06 02:30:58 +00:00

History

IAN MILLER 007efa6eb5 refactor: replace default all-MiniLM-L6-v2 embedding model by nomic-embed-text-v1.5 in Llama Stack (#3183 ) # What does this PR do? <!-- Provide a short summary of what this PR does and why. Link to relevant issues if applicable. --> The purpose of this PR is to replace the Llama Stack's default embedding model by nomic-embed-text-v1.5. These are the key reasons why Llama Stack community decided to switch from all-MiniLM-L6-v2 to nomic-embed-text-v1.5: 1. The training data for [all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2#training-data) includes a lot of data sets with various licensing terms, so it is tricky to know when/whether it is appropriate to use this model for commercial applications. 2. The model is not particularly competitive on major benchmarks. For example, if you look at the [MTEB Leaderboard](https://huggingface.co/spaces/mteb/leaderboard) and click on Miscellaneous/BEIR to see English information retrieval accuracy, you see that the top of the leaderboard is dominated by enormous models but also that there are many, many models of relatively modest size whith much higher Retrieval scores. If you want to look closely at the data, I recommend clicking "Download Table" because it is easier to browse that way. More discussion info can be founded [here](https://github.com/llamastack/llama-stack/issues/2418) <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[issue-number] --> Closes #2418 ## Test Plan <!-- Describe the tests you ran to verify your changes with result summaries. Provide clear instructions so the plan can be easily re-executed. --> 1. Run `./scripts/unit-tests.sh` 2. Integration tests via CI wokrflow --------- Signed-off-by: Sébastien Han <seb@redhat.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Francisco Arceo <arceofrancisco@gmail.com> Co-authored-by: Sébastien Han <seb@redhat.com>		2025-10-14 10:44:20 -04:00
..
changelog.yml	chore(github-deps): bump actions/checkout from 4.2.2 to 5.0.0 (#3178 )	2025-08-20 16:51:40 -07:00
conformance.yml	feat(api)!: BREAKING CHANGE: support passing `extra_body` through to providers (#3777 )	2025-10-10 16:21:44 -07:00
install-script-ci.yml	chore(github-deps): bump actions/checkout from 4.2.2 to 5.0.0 (#3178 )	2025-08-20 16:51:40 -07:00
integration-auth-tests.yml	fix(auth): allow unauthenticated access to health and version endpoints (#3736 )	2025-10-10 13:41:43 -07:00
integration-sql-store-tests.yml	fix(ci): make all CI workflows have the correct concurrency defn	2025-08-21 16:05:25 -07:00
integration-tests.yml	fix(ci): remove responses from CI for now (#3773 )	2025-10-10 11:52:17 -07:00
integration-vector-io-tests.yml	refactor: replace default all-MiniLM-L6-v2 embedding model by nomic-embed-text-v1.5 in Llama Stack (#3183 )	2025-10-14 10:44:20 -04:00
pre-commit.yml	fix: Improve pre-commit workflow error handling and feedback (#3400 )	2025-09-12 11:10:59 +02:00
precommit-trigger.yml	chore(github-deps): bump actions/github-script from 7.0.1 to 8.0.0 (#3685 )	2025-10-05 21:20:00 -07:00
providers-build.yml	chore: use uvicorn to start llama stack server everywhere (#3625 )	2025-10-06 14:27:40 +02:00
python-build-test.yml	chore(github-deps): bump astral-sh/setup-uv from 6.8.0 to 7.0.0 (#3782 )	2025-10-11 14:14:43 -07:00
README.md	fix: merge workflows to avoid GITHUB_TOKEN limitation	2025-10-03 12:04:02 -07:00
record-integration-tests.yml	feat(tests): make inference_recorder into api_recorder (include tool_invoke) (#3403 )	2025-10-09 14:27:51 -07:00
semantic-pr.yml	chore(github-deps): bump amannn/action-semantic-pull-request from 6.1.0 to 6.1.1 (#3248 )	2025-08-25 17:34:17 +02:00
stale_bot.yml	chore(github-deps): bump actions/stale from 10.0.0 to 10.1.0 (#3684 )	2025-10-08 12:16:54 +02:00
test-external-provider-module.yml	chore!: remove --env from `llama stack run` (#3711 )	2025-10-07 20:58:15 -07:00
test-external.yml	chore!: remove --env from `llama stack run` (#3711 )	2025-10-07 20:58:15 -07:00
ui-unit-tests.yml	chore(github-deps): bump actions/setup-node from 4.4.0 to 5.0.0 (#3353 )	2025-09-08 10:05:00 +02:00
unit-tests.yml	fix(ci): make all CI workflows have the correct concurrency defn	2025-08-21 16:05:25 -07:00

README.md

Llama Stack CI

Llama Stack uses GitHub Actions for Continuous Integration (CI). Below is a table detailing what CI the project includes and the purpose.

Name	File	Purpose
Update Changelog	changelog.yml	Creates PR for updating the CHANGELOG.md
API Conformance Tests	conformance.yml	Run the API Conformance test suite on the changes.
Installer CI	install-script-ci.yml	Test the installation script
Integration Auth Tests	integration-auth-tests.yml	Run the integration test suite with Kubernetes authentication
SqlStore Integration Tests	integration-sql-store-tests.yml	Run the integration test suite with SqlStore
Integration Tests (Replay)	integration-tests.yml	Run the integration test suites from tests/integration in replay mode
Vector IO Integration Tests	integration-vector-io-tests.yml	Run the integration test suite with various VectorIO providers
Pre-commit	pre-commit.yml	Run pre-commit checks
Pre-commit Bot	precommit-trigger.yml	Pre-commit bot for PR
Test Llama Stack Build	providers-build.yml	Test llama stack build
Python Package Build Test	python-build-test.yml	Test building the llama-stack PyPI project
Integration Tests (Record)	record-integration-tests.yml	Run the integration test suite from tests/integration
Check semantic PR titles	semantic-pr.yml	Ensure that PR titles follow the conventional commit spec
Close stale issues and PRs	stale_bot.yml	Run the Stale Bot action
Test External Providers Installed via Module	test-external-provider-module.yml	Test External Provider installation via Python module
Test External API and Providers	test-external.yml	Test the External API and Provider mechanisms
UI Tests	ui-unit-tests.yml	Run the UI test suite
Unit Tests	unit-tests.yml	Run the unit test suite