Commit graph

  • ae975f96ce
    Fix pre-commit Tami Takamiya 2025-09-11 13:23:38 -04:00
  • b973c43c2e
    Unit test Tami Takamiya 2025-08-01 08:08:59 -04:00
  • 30768cc8c6
    Add items and title to ToolParameter/ToolParamDefinition Tami Takamiya 2025-07-31 14:03:04 -04:00
  • 6b855af96f
    feat: introduce api leveling proposal (#3317) Charlie Doern 2025-09-16 12:18:36 -04:00
  • 3e02dc5c2f feat: add embedding and dynamic model support to Together inference adapter Matthew Farrellee 2025-09-16 12:13:14 -04:00
  • 9fc0696052 fix: Fixing prompts import warning Francisco Javier Arceo 2025-09-16 09:11:17 -04:00
  • 046d85fa79
    feat: ability to use postgres as store for starter distro Sébastien Han 2025-07-22 11:31:30 +02:00
  • 86a85da877
    chore: re-add files in the package Sébastien Han 2025-09-16 14:45:37 +02:00
  • b4789c5941
    chore: exclude ci-test distro from the package Sébastien Han 2025-09-16 14:34:06 +02:00
  • 65d45c7318
    chore: various watsonx fixes (#3428) Sébastien Han 2025-09-16 13:55:10 +02:00
  • a1527b3528
    chore: rm tgi Sébastien Han 2025-09-16 13:21:25 +02:00
  • ed8b884a71
    chore: various watsonx fixes Sébastien Han 2025-09-12 16:57:29 +02:00
  • 6502779f09 Add duplicate comment in test_registry.py Omar Abdelwahab 2025-09-15 18:42:05 -07:00
  • 58a03caa31 Pre-commit Omar Abdelwahab 2025-09-15 18:37:05 -07:00
  • 87f1b6994c Fix duplicate comment in test_registry.py Omar Abdelwahab 2025-09-15 18:21:53 -07:00
  • 26bd6be4f2 updated the warning comment Omar Abdelwahab 2025-09-15 17:09:59 -07:00
  • 5ce41fa85e updated the warning message Omar Abdelwahab 2025-09-15 17:07:42 -07:00
  • a67b9d7523 minor linting change Omar Abdelwahab 2025-09-15 17:01:02 -07:00
  • b0cc00e490 ran pre-commit Omar Abdelwahab 2025-09-15 16:55:59 -07:00
  • 3492c9c8d1 Added a bug fix when registering new models Omar Abdelwahab 2025-09-15 16:47:35 -07:00
  • 921f1817ed Release candidate 0.2.21-dev.20250915 v0.2.21-dev.20250915 rc-0.2.21-dev.20250915 github-actions[bot] 2025-09-15 21:14:30 +00:00
  • aa9a572272 pr fix for #3300 Kai Wu 2025-09-15 13:18:37 -07:00
  • f4ab154ade
    feat: add dynamic model registration support to TGI inference (#3417) Matthew Farrellee 2025-09-15 15:52:40 -04:00
  • ab321739f2
    feat: create HTTP DELETE API endpoints to unregister ScoringFn and Benchmark resources in Llama Stack (#3371) IAN MILLER 2025-09-15 20:43:38 +01:00
  • 13d00b0376 fix issue 3330 Kai Wu 2025-09-15 12:32:34 -07:00
  • 91fb6f42cb Merge branch 'main' into use-openai-for-ollama Matthew Farrellee 2025-09-15 15:31:03 -04:00
  • 01bdcce4d2
    chore(recorder): update mocks to be closer to non-mock environment (#3442) Matthew Farrellee 2025-09-15 15:25:53 -04:00
  • 5900bce4a5 Added initial documentation for how to configure and launch llama stack Omar Abdelwahab 2025-09-15 12:10:42 -07:00
  • 5cc605deb5 Set mock_client_instance to async Akram Ben Aissi 2025-09-15 19:23:37 +02:00
  • 67728bfccf Update vLLM health check to use /health endpoint Akram Ben Aissi 2025-09-15 12:57:02 +02:00
  • 5e74bc7fcf Add dynamic authentication token forwarding support for vLLM provider Akram Ben Aissi 2025-09-12 20:21:53 +02:00
  • 0591451ed6
    feat: create HTTP DELETE API endpoints to allow users to free up ScoringFn and Benchmark resources in LS r3v5 2025-09-08 14:28:12 +01:00
  • 2aea2e6cd7
    chore!: remove old ui implementation Sébastien Han 2025-09-15 10:57:29 +02:00
  • 040ff976a3
    Merge 352d990176 into b6cb817897 Mustafa Elbehery 2025-09-15 09:52:04 +02:00
  • b6cb817897
    chore(ui-deps): bump @radix-ui/react-select from 2.2.5 to 2.2.6 in /llama_stack/ui (#3437) dependabot[bot] 2025-09-15 09:46:14 +02:00
  • 36fd97e306
    chore(ui-deps): bump next from 15.3.3 to 15.5.3 in /llama_stack/ui (#3438) dependabot[bot] 2025-09-15 09:46:05 +02:00
  • 64c65fd22c Release candidate 0.0.0.dev20250915001600 rc-0.0.0.dev20250915001600 github-actions[bot] 2025-09-15 00:16:41 +00:00
  • 1039c8e7cf docs: Fix incorrect vector_db_id usage in RAG tutorial Adam Young 2025-09-14 15:10:01 -05:00
  • 7b5685b1d9 update recordings Matthew Farrellee 2025-09-14 09:41:05 -04:00
  • ee79812da2 chore(recorder): update mocks to be closer to non-mock environment Matthew Farrellee 2025-09-14 07:04:44 -04:00
  • b9e09a3787 Release candidate 0.0.0.dev20250914001548 rc-0.0.0.dev20250914001548 github-actions[bot] 2025-09-14 00:16:28 +00:00
  • be81af962a
    chore(ui-deps): bump jest and @types/jest in /llama_stack/ui dependabot[bot] 2025-09-13 20:07:59 +00:00
  • 85a4708f62
    chore(ui-deps): bump @types/node in /llama_stack/ui dependabot[bot] 2025-09-13 20:07:40 +00:00
  • f1dfd4ba70
    chore(ui-deps): bump eslint from 9.26.0 to 9.35.0 in /llama_stack/ui dependabot[bot] 2025-09-13 20:07:32 +00:00
  • 88bd7f1891
    chore(ui-deps): bump next from 15.3.3 to 15.5.3 in /llama_stack/ui dependabot[bot] 2025-09-13 20:07:23 +00:00
  • 3a39c61dd8
    chore(ui-deps): bump @radix-ui/react-select in /llama_stack/ui dependabot[bot] 2025-09-13 20:07:04 +00:00
  • 55470432c4
    chore(python-deps): bump chromadb from 1.0.20 to 1.0.21 dependabot[bot] 2025-09-13 20:05:47 +00:00
  • 93d6c89701
    chore(python-deps): bump openai from 1.107.0 to 1.107.2 dependabot[bot] 2025-09-13 20:05:39 +00:00
  • 7224bfe744
    chore(python-deps): bump opentelemetry-exporter-otlp-proto-http dependabot[bot] 2025-09-13 20:05:32 +00:00
  • 08234a51e6
    chore(python-deps): bump locust from 2.40.1 to 2.40.4 dependabot[bot] 2025-09-13 20:05:26 +00:00
  • 12a2ad1fd1
    chore(python-deps): bump pydantic from 2.11.7 to 2.11.9 dependabot[bot] 2025-09-13 20:05:19 +00:00
  • 6787755c0c
    chore(recorder): add support for NOT_GIVEN (#3430) Matthew Farrellee 2025-09-13 14:11:38 -04:00
  • 8cf2128b40
    chore(tests): always show slowest tests (#3431) Matthew Farrellee 2025-09-13 12:28:04 -04:00
  • c83d649f27 chore(tests): always show slowest tests Matthew Farrellee 2025-09-13 06:40:01 -04:00
  • d37978508f chore(recorder): add support for NOT_GIVEN Matthew Farrellee 2025-09-13 05:06:07 -04:00
  • 1c42816b41 Release candidate 0.0.0.dev20250913001348 rc-0.0.0.dev20250913001348 github-actions[bot] 2025-09-13 00:14:33 +00:00
  • 3de9ad0a87
    chore(recorder, tests): add test for openai /v1/models (#3426) Matthew Farrellee 2025-09-12 17:59:56 -04:00
  • a7caacf1bf further simplify the async management Matthew Farrellee 2025-09-12 17:44:48 -04:00
  • c6403706b4 use task and async generator instead of a thread Matthew Farrellee 2025-09-12 16:41:48 -04:00
  • a673484e21 make sure the mock method identifies as a coroutine Matthew Farrellee 2025-09-12 13:13:13 -04:00
  • a14f42f1b8 chore(recorder, tests): add support for openai /v1/models Matthew Farrellee 2025-09-12 12:12:12 -04:00
  • 4e096bab96 feat: introduce api leveling documentation Charlie Doern 2025-09-03 10:25:01 -04:00
  • fa7144163e refactor(vector-db): remove redundant fallback assignment for provider_vector_db_id Habeb Nawatha 2025-09-12 12:26:22 +03:00
  • f67081d2d6
    feat: migrate to FIPS-validated cryptographic algorithms (#3423) Doug Edgar 2025-09-12 02:18:19 -07:00
  • d31e641d69
    fix: Improve pre-commit workflow error handling and feedback (#3400) Akram Ben Aissi 2025-09-12 10:10:59 +01:00
  • 48dda8bed8 chore(recorder, tests): add test for openai /v1/models Matthew Farrellee 2025-09-12 05:05:05 -04:00
  • f9b423b607
    Update chat_format.py reisnfz 2025-09-12 09:39:50 +08:00
  • c8b008c5cf Release candidate 0.0.0.dev20250912001446 rc-0.0.0.dev20250912001446 github-actions[bot] 2025-09-12 00:15:29 +00:00
  • aaea9fed12
    Merge branch 'main' into chroma Bwook (Byoungwook) Kim 2025-09-12 08:56:46 +09:00
  • f7e4395380 feat: migrate to FIPS-validated cryptographic algorithms Doug Edgar 2025-09-11 15:11:00 -07:00
  • 4271f7352e minor fix fix-fireworks-provider Swapna Lekkala 2025-09-11 16:10:23 -07:00
  • f9348a6bdf test-fireworks-fix Swapna Lekkala 2025-09-11 15:58:38 -07:00
  • 69a52213a1
    fix: oasdiff enhancements and stability (#3419) Charlie Doern 2025-09-11 16:30:09 -04:00
  • a14a164585 fix: Improve pre-commit workflow error handling and feedback Akram Ben Aissi 2025-09-10 13:09:43 +02:00
  • 4532a2f639 fix: oasdiff enhancements and stability Charlie Doern 2025-09-11 11:52:15 -04:00
  • c7ef1f13df
    feat: Add langchain llamastack Integration example notebook (#3314) slekkala1 2025-09-11 11:10:41 -07:00
  • 4375764074
    Merge branch 'main' into crewai Kai Wu 2025-09-11 09:27:27 -07:00
  • 72387b4bd2
    chore(unit tests): remove network use, update async test (#3418) Matthew Farrellee 2025-09-11 11:45:16 -04:00
  • 0afc4d10fa
    Update llama_stack/providers/utils/inference/openai_mixin.py Matthew Farrellee 2025-09-11 11:14:22 -04:00
  • fcda5e976c chore(unit tests): remove network use, update async test Matthew Farrellee 2025-09-11 10:32:04 -04:00
  • 571f998c78
    delete pre-commit in pyproject.toml kimbwook 2025-09-11 23:13:14 +09:00
  • f3bd532461
    delete blank line in vector_utils.py kimbwook 2025-09-11 23:11:24 +09:00
  • bfc8a3b99d
    change exception log parse to chunk kimbwook 2025-09-11 23:09:31 +09:00
  • 729e0f3fcb
    Merge branch 'main' into chroma Bwook (Byoungwook) Kim 2025-09-11 22:59:26 +09:00
  • 8ef1189be7
    chore: update the vLLM inference impl to use OpenAIMixin for openai-compat functions (#3404) Matthew Farrellee 2025-09-11 09:04:38 -04:00
  • 8ed3527a64 revert from Qwen/Qwen3-0.6B Matthew Farrellee 2025-09-11 08:57:43 -04:00
  • 897be1376e
    change Reranker to WeightedInMemoryAggregator kimbwook 2025-09-11 21:40:21 +09:00
  • 60318b659d
    Merge branch 'main' into chroma Bwook (Byoungwook) Kim 2025-09-11 21:30:50 +09:00
  • 6bdcfc2627
    Merge branch 'main' into chroma Bwook (Byoungwook) Kim 2025-09-11 20:51:31 +09:00
  • 11c71c958e
    Merge branch 'main' into chroma Bwook (Byoungwook) Kim 2025-09-11 20:46:53 +09:00
  • ee3df99de4
    feat: add Azure OpenAI inference provider support Sébastien Han 2025-09-01 16:41:30 +02:00
  • 4084158faa
    Merge branch 'main' into fix/vector-db-mandatory-provider-id Habeb Nawatha 2025-09-11 13:09:58 +03:00
  • 4374da02f3
    Merge branch 'main' into fix/vector-db-mandatory-provider-id Habeb Nawatha 2025-09-11 12:02:37 +03:00
  • e6a5ad5e35 chore(pre-commit): apply codegen and permissions fixes Habeb Nawatha 2025-09-11 11:57:07 +03:00
  • c3fc859257 feat: add dynamic model registration support to TGI inference Matthew Farrellee 2025-09-11 02:02:02 -04:00
  • d15368a302
    chore: Updating documentation, adding exception handling for Vector Stores in RAG Tool, more tests on migration, and migrate off of inference_api for context_retriever for RAG (#3367) Francisco Arceo 2025-09-11 06:20:11 -06:00
  • f31bcc11bc
    feat: add Azure OpenAI inference provider support (#3396) Sébastien Han 2025-09-11 13:48:38 +02:00
  • c2d281e01b
    chore(replay): improve replay robustness with un-validated construction (#3414) Matthew Farrellee 2025-09-11 07:48:19 -04:00
  • 2838d5a20f
    fix: AWS Bedrock inference profile ID conversion for region-specific endpoints (#3386) Sumanth Kamenani 2025-09-11 05:41:53 -04:00
  • 8e05c68d15
    chore: remove openai dependency from providers (#3398) Sébastien Han 2025-09-11 10:19:59 +02:00