Commit graph

  • 2eae8568e1
    chore: collapse all local hook under the same repo (#2217) Sébastien Han 2025-05-20 18:51:09 +02:00
  • 3f6368d56c
    ci: enable ruff output format for github (#2214) Sébastien Han 2025-05-20 18:04:03 +02:00
  • 90d7612f5f
    chore: Updated readme (#2219) Francisco Arceo 2025-05-20 09:06:20 -06:00
  • 55d88ac194 chore: Updated readme Francisco Javier Arceo 2025-05-20 10:40:38 -04:00
  • 189b6deb89 fix: unit tests Jash Gulabrai 2025-05-20 09:58:07 -04:00
  • 1d94f3617a fix: Pass model param as configuration name to NeMo Customizer Jash Gulabrai 2025-05-20 09:43:51 -04:00
  • dacd522f57 feat(quota): support per‑client and anonymous server‑side request quotas Wen Liang 2025-05-02 16:58:20 -04:00
  • 602e4a90c1
    chore: collapse all local hook under the same repo Sébastien Han 2025-05-20 14:58:43 +02:00
  • b1ab9dce81 fix: synchronize concurrent coroutines checking key set Gordon Sim 2025-05-20 13:02:31 +01:00
  • ed7b4731aa
    fix: Setting default value for metadata_token_count in case the key is not found (#2199) Francisco Arceo 2025-05-20 06:03:22 -06:00
  • c482dfb5f7 feat: add llama stack rm and llama stack list commands Abhishek koserwal 2025-05-13 13:41:33 +05:30
  • f0a142f5a8
    Merge branch 'main' into patch-metadata Francisco Arceo 2025-05-20 03:08:53 -06:00
  • 9d28b731e3
    ci: enable ruff output format for github Sébastien Han 2025-05-20 10:53:20 +02:00
  • 6d20b720b8
    feat: Propagate W3C trace context headers from clients (#2153) Ben Browning 2025-05-19 21:56:54 -04:00
  • 82778ecbb0
    fix: remove wrong deprecated warning (#2202) Sébastien Han 2025-05-19 22:02:23 +02:00
  • 490e77bffa feat: allow access attributes for resources to be configured Gordon Sim 2025-05-06 18:54:58 +01:00
  • 9c8167edd5 feat: Add "instructions" support to responses API Derek Higgins 2025-05-19 17:21:47 +01:00
  • 51b68b4be6 Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-05-19 09:23:07 -04:00
  • 0cc0731189
    fix: Pass external_config_dir to BuildConfig (#2190) Michael Anstis 2025-05-19 13:01:28 +01:00
  • e9bcb0e827
    feat: export distribution container build artifacts Sébastien Han 2025-05-16 11:37:56 +02:00
  • 856e27c6df Update BuildConfig.external_providers_dir datatype plus fallout. Michael Anstis 2025-05-19 09:42:01 +01:00
  • de8105e7bf
    fix: remove wrong deprecated warning Sébastien Han 2025-05-19 10:34:02 +02:00
  • e89e1d0cc2 Pass external_config_dir to BuildConfig Michael Anstis 2025-05-16 14:07:09 +01:00
  • 047303e339
    feat: introduce APIs for retrieving chat completion requests (#2145) ehhuang 2025-05-18 21:43:19 -07:00
  • 3bc175320b apis, alt Eric Huang 2025-05-18 21:20:00 -07:00
  • 5a807da6af
    Merge branch 'main' into patch-metadata Francisco Arceo 2025-05-18 19:34:14 -06:00
  • 50bc9053f8 fix: Setting default value for metadata_token_count in case the key is not found Francisco Javier Arceo 2025-05-18 21:29:55 -04:00
  • c7015d3d60
    feat: introduce OAuth2TokenAuthProvider and notion of "principal" (#2185) Ashwin Bharambe 2025-05-18 17:54:19 -07:00
  • 35dcfff203 fix(tests): enable post-training tests Ihar Hrachyshka 2025-03-25 21:42:12 +00:00
  • 736c41332f add invalid token test Ashwin Bharambe 2025-05-18 08:08:44 -07:00
  • cc77a1b4c8 more fixes Ashwin Bharambe 2025-05-18 07:40:11 -07:00
  • b5d5d1fba0 fix deps and tests Ashwin Bharambe 2025-05-15 17:26:42 -07:00
  • b20cce5c43 minor Ashwin Bharambe 2025-05-15 17:16:14 -07:00
  • 529b12dc5e updates Ashwin Bharambe 2025-05-15 17:12:56 -07:00
  • fd86961c88 feat: introduce JWKSAuthProvider Ashwin Bharambe 2025-05-15 17:02:23 -07:00
  • 1341916caf
    chore(github-deps): bump astral-sh/setup-uv from 5.4.1 to 6.0.1 (#2197) dependabot[bot] 2025-05-18 02:09:56 -04:00
  • 2f16e7b0f1
    chore(github-deps): bump astral-sh/setup-uv from 5.4.1 to 6.0.1 dependabot[bot] 2025-05-17 20:38:19 +00:00
  • f40693e720
    feat: --image-type argument overrides value in --config build.yaml (#2179) Matthew Farrellee 2025-05-16 17:45:41 -04:00
  • f02f7b28c1
    feat: add huggingface post_training impl (#2132) Charlie Doern 2025-05-16 17:41:28 -04:00
  • 8f9964f46b
    fix: update llama stack build --run to use new start_stack.sh signature (#2191) Matthew Farrellee 2025-05-16 17:32:02 -04:00
  • 1ae61e8d5f
    fix: replace all instances of --yaml-config with --config (#2196) Charlie Doern 2025-05-16 17:31:12 -04:00
  • 46c5b14a22 feat: handle graceful shutdown Charlie Doern 2025-05-14 15:43:41 -04:00
  • ff246d890a feat: add integration tests for post_training Charlie Doern 2025-05-13 17:21:30 -04:00
  • 7dcb997f17 feat: add huggingface post_training and dataset provider to template Charlie Doern 2025-05-11 21:24:08 -04:00
  • 6c3a40e3d2 feat: add huggingface post_training impl Charlie Doern 2025-05-11 21:23:59 -04:00
  • 65cf076f13 build: Bump version to 0.2.7 github-actions[bot] 2025-05-16 20:32:06 +00:00
  • c281a1a909 build: Bump version to 0.2.7 v0.2.7 release-0.2.7 github-actions[bot] 2025-05-16 20:31:24 +00:00
  • b8f7e1504d
    feat: allow the interface on which the server will listen to be configured (#2015) grs 2025-05-16 15:59:31 -04:00
  • 64f8d4c3ad
    feat: use openai-python for openai inference provider (#2193) Matthew Farrellee 2025-05-16 15:57:56 -04:00
  • 8bc3d83bb1 Release candidate 0.2.7rc1 v0.2.7rc1 github-actions[bot] 2025-05-16 19:30:00 +00:00
  • 953ccffca2
    test: catch BadRequestError for non-library client (#2195) ehhuang 2025-05-16 12:26:59 -07:00
  • 0cf156f27c fix: replace all instances of --yaml-config with --config Charlie Doern 2025-05-16 15:20:59 -04:00
  • 9a835c1452 test: catch BadRequestError for non-library client Eric Huang 2025-05-16 12:19:59 -07:00
  • 966b482b2e feat: allow the interface on which the server will listen to be configured Gordon Sim 2025-05-08 14:21:18 +01:00
  • f316dffe80 feat: add cpu/cuda config for prompt guard Michael Dawson 2025-05-16 14:26:44 -04:00
  • 16efeeb487 Merge branch 'main' into use-openai-for-openai Matthew Farrellee 2025-05-16 14:09:21 -04:00
  • 7f1f21fd6c
    feat: Adding dark mode, cleaning the UI a small bit, adding a link to the API documentation, and linting the code. (#2182) Francisco Arceo 2025-05-16 11:48:26 -06:00
  • 99bd39cc30 feat: use openai-python for openai inference provider Matthew Farrellee 2025-05-10 07:05:30 -04:00
  • 71cfda6a04 fix: update llama stack build --run to use new start_stack.sh signature Matthew Farrellee 2025-05-16 08:58:01 -04:00
  • 7071932212
    Merge branch 'main' into small-ui-patches Francisco Arceo 2025-05-16 04:34:09 -06:00
  • b4f6a6e011 Merge branch 'main' into issue-2163 Matthew Farrellee 2025-05-16 06:09:50 -04:00
  • 7aae8fadbf
    fix: dev -> starter rename in ci (#2183) Matthew Farrellee 2025-05-16 03:41:53 -04:00
  • 8438e9f5f0 fix: dev -> starter rename in ci Matthew Farrellee 2025-05-15 19:00:00 -04:00
  • 52edbe7088 Merge branch 'main' into issue-2163 Matthew Farrellee 2025-05-15 18:51:34 -04:00
  • e9cce9ed38
    Merge branch 'main' into small-ui-patches Francisco Arceo 2025-05-15 14:10:12 -06:00
  • 3cc15f7d15
    fix: misc UI changes (#2175) Sébastien Han 2025-05-15 22:03:05 +02:00
  • 9ee60ab341 updated readme Francisco Javier Arceo 2025-05-15 16:00:15 -04:00
  • 045f8e4a23 feat: Adding dark mode, cleaning the UI a small bit, adding a link to the API documentation, and linting the code Francisco Javier Arceo 2025-05-15 15:54:40 -04:00
  • 1a6d4af5e9
    refactor: rename dev distro as starter (#2181) Ashwin Bharambe 2025-05-15 12:52:34 -07:00
  • 87e284f1a0 chore: update CODEOWNERS Ashwin Bharambe 2025-05-15 12:31:12 -07:00
  • 20a9e30592 fix name Ashwin Bharambe 2025-05-15 12:27:34 -07:00
  • d080b42a9b refactor: rename dev distro as starter Ashwin Bharambe 2025-05-15 12:16:38 -07:00
  • 10b1056dea
    fix: multiple tool calls in remote-vllm chat_completion (#2161) Ben Browning 2025-05-15 14:23:29 -04:00
  • bb5fca9521
    chore: more API validators (#2165) Sébastien Han 2025-05-15 20:22:51 +02:00
  • e46de23be6
    feat: refactor external providers dir (#2049) Charlie Doern 2025-05-15 14:17:03 -04:00
  • 3aba169dd4 remove default for --image-type so we can detect user intent Matthew Farrellee 2025-05-15 12:36:58 -04:00
  • 7e25c8df28
    fix: ReadTheDocs should display all versions (#2172) Yuan Tang 2025-05-15 11:41:15 -04:00
  • c3f27de3ea
    chore: Update triagers list with new additions (#2180) Ihar Hrachyshka 2025-05-15 11:39:25 -04:00
  • 65b7f869f7 chore: Update triagers list with new additions Ihar Hrachyshka 2025-05-15 11:35:39 -04:00
  • 354faa15ce
    feat: Allow to print usage information for install script (#2171) Yuan Tang 2025-05-15 10:50:56 -04:00
  • d52d40dafc feat: refactor external providers dir Charlie Doern 2025-04-28 10:53:17 -04:00
  • 2329bf1004
    address feedback Yuan Tang 2025-05-15 10:38:15 -04:00
  • ce93acdbdf
    Update install.sh Yuan Tang 2025-05-15 10:33:33 -04:00
  • 8a18d5ecbf feat: --image-type argument overrides value in --config build.yaml Matthew Farrellee 2025-05-15 10:26:42 -04:00
  • f9095ce3df
    wip: experiment running the UI from fastapi Sébastien Han 2025-05-15 12:06:17 +02:00
  • 8154dc7500
    Merge e6c9aebe47 into 8e7ab146f8 Roland Huß 2025-05-15 11:50:47 +02:00
  • 93c35ff9ea
    Update install.sh Yuan Tang 2025-05-15 05:47:02 -04:00
  • 9f16693c71
    fix: misc UI changes Sébastien Han 2025-05-15 11:15:00 +02:00
  • bc6bdb7574 remove yq edits in CI test Michele Dolfi 2025-05-15 10:22:28 +02:00
  • 8e7ab146f8
    feat: Adding support for customizing chunk context in RAG insertion and querying (#2134) Francisco Arceo 2025-05-14 19:56:20 -06:00
  • 38f6b6ff9e adding comment into docstring with default value example Francisco Javier Arceo 2025-05-14 21:45:24 -04:00
  • 4a8d738e48
    fix lint Yuan Tang 2025-05-14 21:28:38 -04:00
  • ed9ac853cd
    fix: ReadTheDocs should display all versions Yuan Tang 2025-05-14 21:25:14 -04:00
  • 14497edf89
    Fix lint Yuan Tang 2025-05-14 21:14:07 -04:00
  • 4df07a55f3
    Fix URL Yuan Tang 2025-05-14 21:11:00 -04:00
  • d2cd1d669f
    Update Yuan Tang 2025-05-14 21:09:03 -04:00
  • 18c2494259 chore: simplify true/false evaluation of input/output shields Ben Browning 2025-05-14 16:41:21 -04:00
  • b3493ee94f Update test_agents.py for Llama 4 models and remote-vllm Ben Browning 2025-05-14 10:41:30 -04:00
  • 9f2a7e6a74 fix: multiple tool calls in remote-vllm chat_completion Ben Browning 2025-05-14 07:00:53 -04:00
  • 2bcfbb34ea
    feat: Allow to print usage information for install script Yuan Tang 2025-05-14 20:44:43 -04:00