Commit graph

  • f6afb3c26b
    feat(ci): keep only one re-recording job because independent recordings will conflict (#2956) Ashwin Bharambe 2025-07-29 17:48:04 -07:00
  • b237df8f18
    feat(ci): use replay mode, setup ollama if specific label exists on PR (#2955) Ashwin Bharambe 2025-07-29 16:50:26 -07:00
  • 0ac503ec0d
    feat(tests): record responses for evals and telemetry tests (#2954) Ashwin Bharambe 2025-07-29 15:46:21 -07:00
  • 81c7d6fa2e
    chore(ci): disable post training tests (#2953) Ashwin Bharambe 2025-07-29 14:20:09 -07:00
  • 072d20a124
    feat(test): record agents, safety and vector_io integration tests (#2952) Ashwin Bharambe 2025-07-29 14:02:14 -07:00
  • 2d1ab3ca55
    fix: use same image_name logic for build & run config (#2949) Matthew Farrellee 2025-07-29 15:54:21 -04:00
  • 6ac973ec80
    chore: Delete coverage-badge (#2950) Francisco Arceo 2025-07-29 15:53:25 -04:00
  • 2e5ca3f15c chore: move recordings one directory upwards Ashwin Bharambe 2025-07-29 12:46:19 -07:00
  • 08b4a1deb3
    feat(tests): introduce inference record/replay to increase test reliability (#2941) Ashwin Bharambe 2025-07-29 12:41:31 -07:00
  • abf1d6a703 fix: random breakage in llama_stack/ui/package.json Ashwin Bharambe 2025-07-29 12:31:29 -07:00
  • fee365b71e fix: delete requirements.txt which crept back in Ashwin Bharambe 2025-07-29 11:30:25 -07:00
  • 58ffd82853
    fix: Update SFTConfig parameter to fix CI and Post Training Workflow (#2948) Nehanth Narendrula 2025-07-29 14:14:04 -04:00
  • c7dc0f21b4
    fix: error on failed job, do not wait for timeout (#2945) Matthew Farrellee 2025-07-29 14:07:51 -04:00
  • 870a37ff4b
    feat: add base64 encoded PDF support for OpenAI Chat Completions (#2881) Nathan Weinberg 2025-07-29 06:23:41 -04:00
  • cf8722079c build: Bump version to 0.2.16 github-actions[bot] 2025-07-28 23:13:50 +00:00
  • 19c90d9bfc
    docs: update using llama stack as library docs (#2931) Mark Campbell 2025-07-28 23:35:26 +01:00
  • 4019027070
    chore: revert #2855 (#2939) ehhuang 2025-07-28 15:30:25 -07:00
  • e189f65548
    chore(python-deps): bump pydantic from 2.10.6 to 2.11.7 (#2925) dependabot[bot] 2025-07-28 15:11:54 -07:00
  • 70469c84e9
    chore(packaging): remove requirements.txt (#2938) Ashwin Bharambe 2025-07-28 14:52:24 -07:00
  • cd24aaf3aa fix(pre-commit): push properly version 4 Ashwin Bharambe 2025-07-28 13:11:56 -07:00
  • 8fa77bc93e fix(pre-commit): push properly version 3 Ashwin Bharambe 2025-07-28 13:02:04 -07:00
  • 3058060e2b fix(pre-commit): push properly version 2 Ashwin Bharambe 2025-07-28 12:50:50 -07:00
  • 607574c26a fix(pre-commit): push properly Ashwin Bharambe 2025-07-28 12:43:49 -07:00
  • 8961706dea fix(pre-commit): dont error if pre-commit itself errors Ashwin Bharambe 2025-07-28 12:35:22 -07:00
  • dd4ea28b49
    fix(dependabot): run pre-commit on dependabot PRs (#2935) Ashwin Bharambe 2025-07-28 12:25:06 -07:00
  • 968fc132d3
    fix(openai-compat): restrict developer/assistant/system/tool messages to text-only content (#2932) Matthew Farrellee 2025-07-28 13:36:34 -04:00
  • 60bb5e307e
    feat(openai): add configurable base_url support with OPENAI_BASE_URL env var (#2919) Matthew Farrellee 2025-07-28 13:16:02 -04:00
  • b1c21a25ec
    docs: remove provider_id from external docs (#2922) Charlie Doern 2025-07-28 13:14:39 -04:00
  • 86fe2b8475
    fix: adjust provider type used in external provider test (#2921) Charlie Doern 2025-07-28 13:14:16 -04:00
  • 47c078fcef
    feat: implement dynamic model detection support for inference providers using litellm (#2886) Matthew Farrellee 2025-07-28 13:13:54 -04:00
  • c48dcafc77
    fix: Fix unit tests CI and failing tests (#2928) Christian Zaccaria 2025-07-28 18:07:26 +01:00
  • 46e2989312
    fix: switch refresh to debug log (#2933) Charlie Doern 2025-07-28 13:02:54 -04:00
  • 3c40c8e583
    fix: litellm_provider_name for llama-api (#2934) Matthew Farrellee 2025-07-28 13:02:16 -04:00
  • 09abdb0a37
    test: upload logs for external provider tests (#2914) Charlie Doern 2025-07-25 18:03:15 -04:00
  • 9583f468f8
    feat(starter)!: simplify starter distro; litellm model registry changes (#2916) Ashwin Bharambe 2025-07-25 15:02:04 -07:00
  • 3344d8a9e5
    fix: separate build and run provider types (#2917) Charlie Doern 2025-07-25 15:39:26 -04:00
  • 025163d8e6
    feat: add auto-generated CI documentation pre-commit hook (#2890) Nathan Weinberg 2025-07-25 11:57:01 -04:00
  • 52201612de
    feat: implement chunk deletion for vector stores (#2701) Derek Higgins 2025-07-25 15:30:30 +01:00
  • 9e77be1f72
    chore: Fix chroma unit tests (#2896) Francisco Arceo 2025-07-25 10:12:14 -04:00
  • ed07a58b50
    fix(registry): ensure clean shutdown (#2901) Ashwin Bharambe 2025-07-25 06:44:31 -07:00
  • de6919ecdd
    refactor: install external providers from module (#2637) Charlie Doern 2025-07-25 09:41:26 -04:00
  • 85223ccc4d
    chore(github-deps): bump astral-sh/setup-uv from 6.4.1 to 6.4.3 (#2902) dependabot[bot] 2025-07-25 10:08:24 +02:00
  • 34093fecd1
    ci: Remove open-pull-requests-limit: 0 from dependabot.yml (#2900) Yuan Tang 2025-07-25 03:49:18 -04:00
  • 3216765c26
    chore(deps): bump form-data from 4.0.2 to 4.0.4 in /llama_stack/ui (#2898) dependabot[bot] 2025-07-24 21:24:56 -04:00
  • 21bae296f2
    feat(auth): API access control (#2822) ehhuang 2025-07-24 15:30:48 -07:00
  • 7cc4819e90
    feat: add MCP Streamable HTTP support (#2554) Calum Murray 2025-07-24 18:04:27 -04:00
  • 632cf9eb72
    feat: Bring Your Own API (BYOA) (#2228) Sébastien Han 2025-07-24 22:41:14 +02:00
  • 341504869e
    fix: use logger for console telemetry (#2844) Charlie Doern 2025-07-24 16:26:59 -04:00
  • abade761e0
    docs: Update nvidia docs template (#2893) Kelly Brown 2025-07-24 16:11:34 -04:00
  • 226b877ca6
    chore: install script should use starter (#2891) Sébastien Han 2025-07-24 21:18:02 +02:00
  • cbe89d2bdd
    chore: return webmethod from find_matching_route (#2883) ehhuang 2025-07-24 11:37:21 -07:00
  • 1463b79218
    feat(registry): make the Stack query providers for model listing (#2862) Ashwin Bharambe 2025-07-24 10:39:53 -07:00
  • 537dc693ee
    chore: add mypy coverage to inspect.py and library_client.py in /distribution (#2707) Stefan Thaler 2025-07-24 17:51:46 +01:00
  • d4f0b430e2
    docs: update list of apis (#2697) Charlie Doern 2025-07-24 12:50:14 -04:00
  • af9c707eaf
    fix: various improvements on install.sh (#2724) Sébastien Han 2025-07-24 18:43:51 +02:00
  • 4ea1f2aa9f
    test: Add VLLM provider support to integration tests (#2757) Derek Higgins 2025-07-24 17:42:26 +01:00
  • 6ab5760a1b
    chore(test): migrate unit tests from unittest to pytest nvidia test safety (#2793) Mustafa Elbehery 2025-07-24 18:41:07 +02:00
  • 9069d878ef
    docs: Update CHANGELOG.md (#2874) Yuan Tang 2025-07-24 12:36:28 -04:00
  • 7f7b990b80
    docs: Document use cases for Responses and Agents APIs (#2756) Christian Zaccaria 2025-07-24 17:20:04 +01:00
  • 5ef2baacdc
    fix: update check-workflows-use-hashes to use github error format (#2875) Mohit Gaur 2025-07-24 21:11:17 +05:30
  • e33a50480d
    fix: starter template and litellm backward compat conflict for openai (#2885) Matthew Farrellee 2025-07-24 11:28:37 -04:00
  • cd8715d327
    chore: Added openai compatible vector io endpoints for chromadb (#2489) Sarthak Deshpande 2025-07-24 02:21:58 +05:30
  • fd2aab8582
    fix: prevent shell redirection issues with pip dependencies (#2867) Derek Higgins 2025-07-23 20:43:33 +01:00
  • 427136bb63
    fix: cleanup after build_container.sh (#2869) Derek Higgins 2025-07-23 19:54:54 +01:00
  • 51affe5783
    fix: fixed test_access_control.py unit test (#2876) IAN MILLER 2025-07-23 19:50:20 +01:00
  • 2fcfb0f0b5
    fix: bring back dell template (#2880) Ashwin Bharambe 2025-07-23 11:40:59 -07:00
  • 8353ad4981
    fix: search mode validation for rag query (#2857) Mark Campbell 2025-07-23 19:25:12 +01:00
  • 2aba2c1236
    chore: Moving vector store and vector store files helper methods to openai_vector_store_mixin (#2863) Francisco Arceo 2025-07-23 13:35:48 -04:00
  • e1ed152779
    chore: create OpenAIMixin for inference providers with an OpenAI-compat API that need to implement openai_* methods (#2835) Matthew Farrellee 2025-07-23 06:49:40 -04:00
  • fc67ad408a
    chore: add some documentation for access policy rules (#2785) grs 2025-07-23 09:27:27 +01:00
  • c0563c0560
    fix: honour deprecation of --config and --template (#2856) Sébastien Han 2025-07-23 05:48:23 +02:00
  • 340448e0aa
    fix: optimize container build by enabling uv cache (#2855) Derek Higgins 2025-07-23 00:51:52 +01:00
  • 3b83032555
    feat(registry): more flexible model lookup (#2859) Ashwin Bharambe 2025-07-22 15:22:48 -07:00
  • 9736f096f6
    chore(test): fix flaky telemetry tests (#2815) Mustafa Elbehery 2025-07-22 21:30:14 +02:00
  • c1a63fcd87
    fix(install): explicit docker.io usage (#2850) Omer Tuchfeld 2025-07-22 20:36:48 +02:00
  • 20c3197952
    chore: Making name optional in openai_create_vector_store (#2858) Francisco Arceo 2025-07-22 13:31:31 -04:00
  • 8e1a2b4703
    chore: remove *_openai_compat providers (#2849) ehhuang 2025-07-22 10:25:36 -07:00
  • 5e18d4d097
    fix(agent): ensure turns are sorted (#2854) Omer Tuchfeld 2025-07-22 19:24:51 +02:00
  • b5a6ecc331
    docs: minor fix of the pgvector provider spec description (#2847) Jeremy Bonghwan Choi 2025-07-22 15:10:35 +10:00
  • 2bc96613f9
    chore: Adding demo script and importing it into the docs (#2848) Francisco Arceo 2025-07-21 22:53:32 -04:00
  • c8f274347d
    chore: Adding Access Control for OpenAI Vector Stores methods (#2772) Francisco Arceo 2025-07-21 16:22:44 -04:00
  • 0d7a90b8bc
    chore: merge --config and --template in server.py (#2716) ehhuang 2025-07-21 13:19:27 -07:00
  • 9a03526672
    fix: uvicorn respect log_config (#2842) Charlie Doern 2025-07-21 15:50:39 -04:00
  • 019ddda138
    fix: graceful SIGINT on server (#2831) Sébastien Han 2025-07-21 20:35:15 +02:00
  • d0208df286
    test: skip flaky telemetry tests (#2814) ehhuang 2025-07-21 10:01:40 -07:00
  • 9e6860b9cf
    fix: remove @pytest.mark.asyncio from test_get_raw_document_text.py (#2840) IAN MILLER 2025-07-21 17:14:34 +01:00
  • 89c49eb003
    feat: Allow application/yaml as mime_type (#2575) Ondrej Metelka 2025-07-21 15:43:32 +02:00
  • b2c7543af7
    fix(vectordb): VectorDBInput has no provider_id (#2830) Mustafa Elbehery 2025-07-21 14:03:40 +02:00
  • ecd28f0085
    chore: add contribution guideline around PRs (#2811) Sébastien Han 2025-07-21 11:47:17 +02:00
  • 56269245c2
    fix: Add permissions for pull request creation in coverage-badge workflow (#2832) Christian Zaccaria 2025-07-21 10:40:00 +01:00
  • 28956f9447
    chore(github-deps): bump astral-sh/setup-uv from 6.3.1 to 6.4.1 (#2827) dependabot[bot] 2025-07-19 21:10:35 -05:00
  • 0a6e588f68
    feat: enable auth for LocalFS Files Provider (#2773) ehhuang 2025-07-18 19:11:01 -07:00
  • dd303327f3
    feat(ci): add a ci-tests distro (#2826) Ashwin Bharambe 2025-07-18 17:11:06 -07:00
  • 199f859eec
    feat(vllm): periodically refresh models (#2823) Ashwin Bharambe 2025-07-18 15:53:09 -07:00
  • ade075152e
    chore: kill inline::vllm (#2824) Ashwin Bharambe 2025-07-18 15:52:18 -07:00
  • 68a2dfbad7
    feat(ollama): periodically refresh models (#2805) Ashwin Bharambe 2025-07-18 12:20:36 -07:00
  • 6d55f2f137
    feat: enable ls client for files tests (#2769) ehhuang 2025-07-18 12:10:30 -07:00
  • 874b1cb00f
    fix: DPOAlignmentConfig schema to use correct DPO parameters (#2804) Nehanth Narendrula 2025-07-18 14:56:00 -04:00
  • d994305f0a
    fix: remove disabled providers from model dump (#2784) Charlie Doern 2025-07-18 13:44:35 -04:00
  • 15916852e8
    chore: Add slekkala1 to codeowners (#2817) slekkala1 2025-07-18 10:33:30 -07:00