Commit graph

  • 248a4a3f72 feat: add function tools to openai responses Ashwin Bharambe 2025-04-30 13:06:33 -07:00
  • 84adbff5fe chore(changelog): add missing newline reluctantfuturist 2025-04-30 11:50:01 -07:00
  • 7d7b919d85
    Merge branch 'main' into install-script-podman-fix Alexey Rybak 2025-04-30 11:46:59 -07:00
  • dc94433072
    feat(pre-commit): enhance pre-commit hooks with additional checks (#2014) Sébastien Han 2025-04-30 20:35:49 +02:00
  • 17992330a2 chore(installer): harden install.sh for Podman macOS & ARM64 reluctantfuturist 2025-04-30 11:35:30 -07:00
  • d897313e0b
    feat: add additional logging to llama stack build (#1689) Nathan Weinberg 2025-04-30 14:06:24 -04:00
  • e2960e9e44 fix: inference providers still using tools with tool_choice="none" Ben Browning 2025-04-28 08:43:32 -04:00
  • 0cd3962272 docs: edit expected output in zero to hero guide Nathan Weinberg 2025-03-18 22:49:51 -04:00
  • e91ee75497 feat: add additional logging to llama stack build Nathan Weinberg 2025-03-18 22:39:49 -04:00
  • 2c7aba4158
    fix: enforce stricter ASCII rules lint rules in Ruff (#2062) Sébastien Han 2025-04-30 18:05:27 +02:00
  • 012dd6891f Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-04-30 12:05:11 -04:00
  • eab550f7d2
    fix: Fix messages format in NVIDIA safety check request body (#2063) Jash Gulabrai 2025-04-30 12:01:28 -04:00
  • 4f5a1d5afe
    fix: enforce stricter ASCII rules lint rules in Ruff Sébastien Han 2025-04-30 14:33:46 +02:00
  • 4412694018
    chore: Remove zero-width space characters from OTEL service name env var defaults (#2060) Sébastien Han 2025-04-30 17:56:46 +02:00
  • bfbaf09fa8 fix unit tests Jash Gulabrai 2025-04-30 10:57:40 -04:00
  • f8f59c8335 fix: Update datasets metadata field from provider to provider_id Jash Gulabrai 2025-04-30 10:52:12 -04:00
  • 5fcf20d934 fix: Fix messages format in NVIDIA safety check request body Jash Gulabrai 2025-04-30 10:20:15 -04:00
  • 653e8526ec
    chore(ci): misc Ollama improvements (#2052) Sébastien Han 2025-04-30 16:05:28 +02:00
  • 78ef6a6099
    chore: Increase unit test coverage of routing_tables.py (#2057) Derek Higgins 2025-04-30 15:00:43 +01:00
  • 4799e09f4d
    chore: Remove zero-width space characters from OTEL service name env var defaults Sébastien Han 2025-04-30 12:07:39 +02:00
  • 7275bf7fda chore: Increase unit test coverage of routing_tables.py Derek Higgins 2025-04-29 12:14:19 +01:00
  • 17b5302543
    fix: Fix precommit-hook (#2059) Derek Higgins 2025-04-30 11:03:19 +01:00
  • 07688f5960 fix: Fix precommit-hook Derek Higgins 2025-04-30 10:52:43 +01:00
  • afd7e750d9
    ci: add UBI 9 container-build gate (#2039) Alexey Rybak 2025-04-30 00:52:57 -07:00
  • 5a2bfd6ad5
    refactor: Replace SQLITE_DB_PATH by SQLITE_STORE_DIR env in templates (#2055) Roland Huß 2025-04-30 00:28:10 +02:00
  • 2ada832e69 ci: validate UBI9 base reluctantfuturist 2025-04-29 13:54:58 -07:00
  • 7532f4cdb2
    chore(github-deps): bump astral-sh/setup-uv from 5 to 6 (#2051) Yuan Tang 2025-04-29 14:41:41 -04:00
  • 10ae03eb83 fix(ci): correctly override UBI9 image and switch to full UBI9 reluctantfuturist 2025-04-29 10:45:46 -07:00
  • 799286fe52 fix: Bump version to 0.2.4 Ashwin Bharambe 2025-04-29 10:34:17 -07:00
  • 5c0680cd3f build: Bump version to 0.2.4 v0.2.4 release-0.2.4 github-actions[bot] 2025-04-29 17:23:26 +00:00
  • 302b3050c2 Release candidate 0.2.4rc1 v0.2.4rc1 github-actions[bot] 2025-04-29 17:18:17 +00:00
  • 4d0bfbf984
    feat: add api.llama provider, llama-guard-4 model (#2058) Ashwin Bharambe 2025-04-29 10:07:41 -07:00
  • 96afc98b88 Add reference to notebook in docs Jash Gulabrai 2025-04-29 13:06:43 -04:00
  • 96d4e7241c
    Update config.py Ashwin Bharambe 2025-04-29 10:05:08 -07:00
  • ef2b686ff4
    Update safety_models.py Ashwin Bharambe 2025-04-29 10:03:37 -07:00
  • 2f60f3c347 fix: Consistently prefix customized models with the namespace Jash Gulabrai 2025-04-29 12:57:49 -04:00
  • 38b580db02 feat: add api.llama provider, llama-guard-4 model Ashwin Bharambe 2025-04-29 09:56:46 -07:00
  • ec9fa30d36
    Merge a083465ba4 into 934446ddb4 Neil Mehta 2025-04-29 08:22:55 -04:00
  • 36373e44f2
    refactor: Remove SQLITE_DB_PATH Roland Huß 2025-04-29 12:31:52 +02:00
  • 9f869df356
    chore(ci): misc Ollama improvements Sébastien Han 2025-04-29 10:13:22 +02:00
  • 934446ddb4
    fix: ollama still using tools with tool_choice="none" (#2047) Ben Browning 2025-04-29 04:45:28 -04:00
  • 2aca7265b3
    fix: add todo for schema validation (#1991) Kevin Postlethwait 2025-04-29 03:59:35 -04:00
  • d96a4bc9b2
    Update integration-tests.yml Yuan Tang 2025-04-28 20:43:57 -04:00
  • fe9b5ef08b
    fix: tools page on playground resets agent after every interaction (#2044) Michael Clifford 2025-04-28 17:13:27 -04:00
  • ef3009ca26
    chore(github-deps): bump astral-sh/setup-uv from 5 to 6 dependabot[bot] 2025-04-28 21:11:29 +00:00
  • 7807a86358
    ci: simplify external provider integration test (#2050) Sébastien Han 2025-04-28 23:10:27 +02:00
  • d58c2d157e
    ci: simplify external provider integration test Sébastien Han 2025-04-28 23:00:59 +02:00
  • 8dfce2f596
    feat: OpenAI Responses API (#1989) Ben Browning 2025-04-28 17:06:00 -04:00
  • 7323d8e86f fix tool calling by not relying on finish reason but tool_calls Ashwin Bharambe 2025-04-28 13:39:50 -07:00
  • 79851d93aa
    feat: Add Kubernetes authentication (#1778) Sébastien Han 2025-04-28 22:24:58 +02:00
  • a1524390b9 update the run.yaml Ashwin Bharambe 2025-04-28 12:51:07 -07:00
  • ae012bb857 rename response to responses in verifications, update provider Ashwin Bharambe 2025-04-28 10:46:09 -07:00
  • 78da66016f raise when you find a Literal type we dont support in openapi generator Ashwin Bharambe 2025-04-28 10:37:14 -07:00
  • abd6280cb8 fold openai responses into the Agents API Ashwin Bharambe 2025-04-28 10:27:28 -07:00
  • 207224a811 OpenAPI Responses - move tests under tests/verifications Ben Browning 2025-04-18 15:26:34 -04:00
  • 591e6a3972 OpenAI Responses - streaming handling for text chat responses Ben Browning 2025-04-18 09:45:41 -04:00
  • d523c8692a OpenAI Responses - image support and multi-turn tool calling Ben Browning 2025-04-18 09:13:48 -04:00
  • 35b2e2646f OpenAI Responses API: Stub in basic web_search tool Ben Browning 2025-04-17 20:25:36 -04:00
  • 52a69f0bf9 Extract some helper methods out in openai_responses impl Ben Browning 2025-04-17 15:10:22 -04:00
  • 70c088af3a Stub in an initial OpenAI Responses API Ben Browning 2025-04-17 14:47:24 -04:00
  • 29f57d528d Remove unused env vars; change the other tmp folder name; fix examples Jash Gulabrai 2025-04-28 13:08:36 -04:00
  • c3d8940c95 Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-04-28 12:52:46 -04:00
  • c7ab6eeedb Minor unit test updates Jash Gulabrai 2025-04-28 12:49:50 -04:00
  • e6bbf8d20b
    feat: Add NVIDIA NeMo datastore (#1852) Rashmi Pawar 2025-04-28 22:11:59 +05:30
  • e64961697a Rename tmp dir to sample_data; remove print statements Jash Gulabrai 2025-04-28 12:04:36 -04:00
  • 73275f07b7 Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-04-28 12:00:11 -04:00
  • a083465ba4 Add openai completion/chat completion Matt Clayton 2025-04-28 09:21:23 -04:00
  • ee1f06417d
    feat: Add Kubernetes authentication Sébastien Han 2025-03-25 18:27:33 +01:00
  • 53f474845b fix: ollama still using tools with tool_choice="none" Ben Browning 2025-04-28 07:55:55 -04:00
  • aa8b2aa31f fix template validation Rashmi Pawar 2025-04-28 16:04:57 +05:30
  • 00e57d693f update provider to provider_id Rashmi Pawar 2025-04-16 14:55:21 +05:30
  • 4491a51149 skip nvidia integration in github actions raspawar 2025-04-09 13:06:06 +00:00
  • 1381d3f3e8 linting fix raspawar 2025-04-09 12:34:36 +00:00
  • 60bf0eb532 datastore documentation raspawar 2025-04-09 12:31:46 +00:00
  • a3c07ac10a update tests raspawar 2025-04-09 12:25:13 +00:00
  • 234f4e4583 add integration test raspawar 2025-04-09 11:10:01 +00:00
  • c139f787c8 add unit tests raspawar 2025-04-02 15:18:15 +00:00
  • cf3f3ff130 linting fix raspawar 2025-04-02 14:26:48 +00:00
  • 2baf252f71 add code for register, unregister raspawar 2025-04-02 14:20:10 +00:00
  • 1e77873a02 add datasetio to distribution raspawar 2025-03-26 15:21:38 +05:30
  • ae973c9595 add datasetio code raspawar 2025-03-26 15:04:20 +05:30
  • c149cf2e0f
    chore(github-deps): bump actions/setup-python from 5.5.0 to 5.6.0 (#2038) dependabot[bot] 2025-04-28 11:46:29 +02:00
  • 1050837622
    feat: Llama Stack Meta Reference installation script (#1383) Alexey Rybak 2025-04-28 02:25:59 -07:00
  • 3b4024bdcc docs: update prompt in quickstart guide to reflect output Bobbins228 2025-04-28 10:25:22 +01:00
  • df2320d302
    chore(github-deps): bump astral-sh/setup-uv from 5 to 6 dependabot[bot] 2025-04-28 00:52:53 +00:00
  • 59e1c5f4a0 Pass 1 for pre-commit fixes Matt Clayton 2025-04-27 15:24:37 -04:00
  • 921ce36480
    docs: Add changelog for v0.2.2 and v0.2.3 (#2040) Yuan Tang 2025-04-27 14:46:13 -04:00
  • 28687b0e85
    fix: Bump h11 to 0.16.0 to fix cve-2025-43859 (#2041) Yuan Tang 2025-04-27 14:45:35 -04:00
  • fdb1109491 fix: tools page on playground resets agent after every interaction Michael Clifford 2025-04-27 13:54:44 -04:00
  • 40160719c8 address disagreement between ruff versions (again) Matthew Farrellee 2025-04-27 10:59:11 -04:00
  • 7fd8a61b4d Merge branch 'main' into test-modelregistryhelper Matthew Farrellee 2025-04-27 10:56:30 -04:00
  • c590674ee2 live listing overrides static listing for ollama & vllm model registration Matthew Farrellee 2025-04-27 10:44:45 -04:00
  • a4c8a849b6 Revert "vllm unit test, check for exception on error" Matthew Farrellee 2025-04-27 10:36:54 -04:00
  • e89fbb8213
    Lint fix Yuan Tang 2025-04-26 21:27:21 -04:00
  • a7fd3c8848
    fix: Bump h11 to 0.16.0 to fix cve-2025-43859 Yuan Tang 2025-04-26 21:23:20 -04:00
  • d840037a15
    docs: Add changelog for v0.2.2 and v0.2.3 Yuan Tang 2025-04-26 21:07:48 -04:00
  • fdaa7adbab ci: add UBI 9 container-build gate reluctantfuturist 2025-04-26 14:59:05 -07:00
  • 9132530ec6
    chore(github-deps): bump actions/setup-python from 5.5.0 to 5.6.0 dependabot[bot] 2025-04-26 20:53:48 +00:00
  • 6cf6791de1
    fix: updated watsonx inference chat apis with new repo changes (#2033) Sajikumar JS 2025-04-26 22:47:52 +05:30
  • 0ec5151ab5 feat: add post_training RuntimeConfig Charlie Doern 2025-04-26 10:40:58 -04:00