Commit graph

  • dd49ef31f1
    docs: Update changelog to include recent releases (#2108) Yuan Tang 2025-05-06 17:42:06 -04:00
  • a57985eeac
    fix: add check for interleavedContent (#1973) Kevin Postlethwait 2025-05-06 12:55:07 -04:00
  • 1a529705da
    chore: more mypy fixes (#2029) Sébastien Han 2025-05-06 18:52:31 +02:00
  • feb9eb8b0d
    docs: Remove datasets.rst and fix llama-stack build commands (#2061) Christian Zaccaria 2025-05-06 17:51:20 +01:00
  • c219a74fa0
    fix: Don't require efficiency_config for torchtune (#2104) Ihar Hrachyshka 2025-05-06 12:50:44 -04:00
  • 7377a5c83e
    docs: contrib add a note about unicode in code (#2106) Sébastien Han 2025-05-06 18:50:30 +02:00
  • b9b13a3670
    chore: factor kube auth test distro (#2105) Sébastien Han 2025-05-06 18:49:49 +02:00
  • 2413447467
    ci: add new action to install ollama, cache the model (#2054) Ignas Baranauskas 2025-05-06 13:56:20 +01:00
  • 3022f7b642
    feat: Adding TLS support for Remote::Milvus vector_io (#2011) Divya 2025-05-06 17:45:34 +05:30
  • 65cc971877
    docs: Add TrustyAI LM-Eval to list of known external providers (#2020) Christina Xu 2025-05-06 08:11:55 -04:00
  • 18d2312690
    fix: test_datasets HF scenario in CI (#2090) Christian Zaccaria 2025-05-06 13:09:15 +01:00
  • 2e807b38cc
    chore: Add fixtures to conftest.py (#2067) Derek Higgins 2025-05-06 12:57:48 +01:00
  • 4597145011
    chore: remove recordable mock (#2088) ehhuang 2025-05-05 10:08:55 -07:00
  • a5d151e912
    docs: fix typo mivus.md -> milvus.md (#2102) Sébastien Han 2025-05-05 18:48:38 +02:00
  • a4247ce0a8
    docs: expand contribution guidelines for linting exceptions (#2101) Sébastien Han 2025-05-05 11:36:30 +02:00
  • 1fbda6bfaa
    chore(github-deps): bump actions/setup-python from 5.5.0 to 5.6.0 (#2099) dependabot[bot] 2025-05-05 10:25:45 +02:00
  • 16e163da0e
    docs: List external kubeflow pipelines provider prototype (#2100) Ihar Hrachyshka 2025-05-05 04:24:52 -04:00
  • 15a1648be6
    fix(installer): harden install.sh for Podman macOS (#2068) Alexey Rybak 2025-05-05 00:31:58 -07:00
  • d27a0f276c fix: pytest.mark.skip, not pytest.skip Ashwin Bharambe 2025-05-04 13:21:06 -07:00
  • 6b4c218788 build: Bump version to 0.2.5 github-actions[bot] 2025-05-03 21:31:01 +00:00
  • c69f14bfaa fix: disable rag_and_code_agent test because no code interpreter anymore Ashwin Bharambe 2025-05-03 14:29:06 -07:00
  • 9f27578929
    fix: improve Mermaid diagram visibility in dark mode (#2092) Christian Zaccaria 2025-05-02 21:09:45 +01:00
  • f1b103e6c8
    fix: openai_compat messages system/assistant non-str content (#2095) Ben Browning 2025-05-02 16:09:27 -04:00
  • 272d3359ee
    fix: remove code interpeter implementation (#2087) Ashwin Bharambe 2025-05-01 14:35:08 -07:00
  • 9e6561a1ec
    chore: enable pyupgrade fixes (#1806) Ihar Hrachyshka 2025-05-01 17:23:50 -04:00
  • ffe3d0b2cd
    fix: nullable param type for function call (#2086) ehhuang 2025-05-01 13:17:36 -07:00
  • 88a796ca5a
    fix: allow use of models registered at runtime (#1980) Matthew Farrellee 2025-05-01 15:00:58 -04:00
  • 64829947d0
    feat: Add temperature support to responses API (#2065) Derek Higgins 2025-05-01 19:47:58 +01:00
  • f36f68c590
    ci: Disable no-commit-to-branch (#2084) Ihar Hrachyshka 2025-05-01 14:43:43 -04:00
  • 6378c2a2f3
    fix: resolve BuiltinTools to strings for vllm tool_call messages (#2071) Ben Browning 2025-05-01 08:47:29 -04:00
  • 293d95b955 fix: pre-commit cleanup Ashwin Bharambe 2025-04-30 15:08:14 -07:00
  • dc94433072
    feat(pre-commit): enhance pre-commit hooks with additional checks (#2014) Sébastien Han 2025-04-30 20:35:49 +02:00
  • d897313e0b
    feat: add additional logging to llama stack build (#1689) Nathan Weinberg 2025-04-30 14:06:24 -04:00
  • 2c7aba4158
    fix: enforce stricter ASCII rules lint rules in Ruff (#2062) Sébastien Han 2025-04-30 18:05:27 +02:00
  • eab550f7d2
    fix: Fix messages format in NVIDIA safety check request body (#2063) Jash Gulabrai 2025-04-30 12:01:28 -04:00
  • 4412694018
    chore: Remove zero-width space characters from OTEL service name env var defaults (#2060) Sébastien Han 2025-04-30 17:56:46 +02:00
  • 653e8526ec
    chore(ci): misc Ollama improvements (#2052) Sébastien Han 2025-04-30 16:05:28 +02:00
  • 78ef6a6099
    chore: Increase unit test coverage of routing_tables.py (#2057) Derek Higgins 2025-04-30 15:00:43 +01:00
  • 17b5302543
    fix: Fix precommit-hook (#2059) Derek Higgins 2025-04-30 11:03:19 +01:00
  • afd7e750d9
    ci: add UBI 9 container-build gate (#2039) Alexey Rybak 2025-04-30 00:52:57 -07:00
  • 5a2bfd6ad5
    refactor: Replace SQLITE_DB_PATH by SQLITE_STORE_DIR env in templates (#2055) Roland Huß 2025-04-30 00:28:10 +02:00
  • 7532f4cdb2
    chore(github-deps): bump astral-sh/setup-uv from 5 to 6 (#2051) Yuan Tang 2025-04-29 14:41:41 -04:00
  • 799286fe52 fix: Bump version to 0.2.4 Ashwin Bharambe 2025-04-29 10:34:17 -07:00
  • 4d0bfbf984
    feat: add api.llama provider, llama-guard-4 model (#2058) Ashwin Bharambe 2025-04-29 10:07:41 -07:00
  • 934446ddb4
    fix: ollama still using tools with tool_choice="none" (#2047) Ben Browning 2025-04-29 04:45:28 -04:00
  • 2aca7265b3
    fix: add todo for schema validation (#1991) Kevin Postlethwait 2025-04-29 03:59:35 -04:00
  • fe9b5ef08b
    fix: tools page on playground resets agent after every interaction (#2044) Michael Clifford 2025-04-28 17:13:27 -04:00
  • 7807a86358
    ci: simplify external provider integration test (#2050) Sébastien Han 2025-04-28 23:10:27 +02:00
  • 8dfce2f596
    feat: OpenAI Responses API (#1989) Ben Browning 2025-04-28 17:06:00 -04:00
  • 79851d93aa
    feat: Add Kubernetes authentication (#1778) Sébastien Han 2025-04-28 22:24:58 +02:00
  • e6bbf8d20b
    feat: Add NVIDIA NeMo datastore (#1852) Rashmi Pawar 2025-04-28 22:11:59 +05:30
  • c149cf2e0f
    chore(github-deps): bump actions/setup-python from 5.5.0 to 5.6.0 (#2038) dependabot[bot] 2025-04-28 11:46:29 +02:00
  • 1050837622
    feat: Llama Stack Meta Reference installation script (#1383) Alexey Rybak 2025-04-28 02:25:59 -07:00
  • 921ce36480
    docs: Add changelog for v0.2.2 and v0.2.3 (#2040) Yuan Tang 2025-04-27 14:46:13 -04:00
  • 28687b0e85
    fix: Bump h11 to 0.16.0 to fix cve-2025-43859 (#2041) Yuan Tang 2025-04-27 14:45:35 -04:00
  • 6cf6791de1
    fix: updated watsonx inference chat apis with new repo changes (#2033) Sajikumar JS 2025-04-26 22:47:52 +05:30
  • 0266b20535
    docs: update prompt_format.md for llama4 (#2035) ehhuang 2025-04-25 15:52:15 -07:00
  • bb1a85c9a0 fix: make sure test works equally well against llama stack as a server Ashwin Bharambe 2025-04-25 15:23:53 -07:00
  • 8713d67ce3
    fix: Correctly parse algorithm_config when launching NVIDIA customization job; fix internal request handler (#2025) Jash Gulabrai 2025-04-25 16:21:50 -04:00
  • b5d8e44e81 fix: only sleep for tests when they pass or fail Ashwin Bharambe 2025-04-25 13:15:52 -07:00
  • 1b2e116a2a
    fix: tool call encoded twice (#2034) ehhuang 2025-04-25 13:16:16 -07:00
  • 4fb583b407
    fix: check that llama stack client plain can be used as a subst for OpenAI client (#2032) Ashwin Bharambe 2025-04-25 12:23:33 -07:00
  • 0e4307de0f
    docs: Fix missing --gpu all flag in Docker run commands (#2026) Derek Higgins 2025-04-25 20:17:31 +01:00
  • 1deab94ea0
    chore: exclude test, provider, and template directories from coverage (#2028) Sébastien Han 2025-04-25 21:16:57 +02:00
  • 1bb1d9b2ba
    feat: Add watsonx inference adapter (#1895) Sajikumar JS 2025-04-25 23:59:21 +05:30
  • 29072f40ab
    feat: new system prompt for llama4 (#2031) ehhuang 2025-04-25 11:29:08 -07:00
  • 4bbd0c0693 fix: add endpoint route debugs Ashwin Bharambe 2025-04-25 10:39:30 -07:00
  • f5dae0517c
    feat: Support ReAct Agent on Tools Playground (#2012) Andy Xie 2025-04-25 11:01:51 -04:00
  • 121c73c2f5
    feat(cli): add interactive tab completion for image type selection (#2027) Roland Huß 2025-04-25 16:57:42 +02:00
  • 59b7593609
    feat: Enhance tool display in Tools sidebar by simplifying tool identifiers (#2024) Surya Prakash Pathak 2025-04-25 01:22:22 -07:00
  • d9e00fca66
    fix: specify nbformat version in nb (#2023) Kevin Postlethwait 2025-04-25 04:10:37 -04:00
  • ace82836c1
    feat: NVIDIA allow non-llama model registration (#1859) Rashmi Pawar 2025-04-25 05:43:33 +05:30
  • cc77f79f55
    feat: Add NVIDIA Eval integration (#1890) Jash Gulabrai 2025-04-24 20:12:42 -04:00
  • 0b6cd45950
    fix: Additional streaming error handling (#2007) Ben Browning 2025-04-24 20:01:45 -04:00
  • c8797f1125
    fix: Including tool call in chat (#1931) Derek Higgins 2025-04-25 00:59:10 +01:00
  • 7ed137e963
    fix: meta ref inference (#2022) ehhuang 2025-04-24 13:03:35 -07:00
  • a5d6ab16b2 fix: meta-reference parallel utils bug, use isinstance not equality Ashwin Bharambe 2025-04-24 11:27:49 -07:00
  • 70488abe9c
    chore: Remove distributions/** from integration, external provider, and unit tests (#2018) Francisco Arceo 2025-04-24 09:39:31 -06:00
  • dc0d4763a0
    chore: Update External Providers CI to not run on changes to docs, rfcs, and scripts (#2009) Francisco Arceo 2025-04-24 09:24:07 -06:00
  • e664ba91d8
    fix: prevent the knowledge search tool from confusing the model with long content (#1908) Ilya Kolchinsky 2025-04-24 16:38:38 +02:00
  • 14e60e3c02
    feat: include run.yaml in the container image (#2005) Sébastien Han 2025-04-24 11:29:53 +02:00
  • a673697858
    chore: rename ramalama provider (#2008) Charlie Doern 2025-04-24 03:34:15 -04:00
  • fa5dfee07b
    fix: Return HTTP 400 for OpenAI API validation errors (#2002) Ben Browning 2025-04-23 11:48:32 -04:00
  • 6a44e7ba20
    docs: add API to external providers table (#2006) Nathan Weinberg 2025-04-23 09:58:10 -04:00
  • 64f747fe09
    feat: add tool name to chat output in playground (#1996) Michael Clifford 2025-04-23 09:57:54 -04:00
  • dc46725f56
    fix: properly handle streaming client disconnects (#2000) Ben Browning 2025-04-23 09:44:28 -04:00
  • e0fa67c81c
    docs: add examples for how to define RAG docs (#1981) Kevin Postlethwait 2025-04-23 09:39:18 -04:00
  • deee355952
    fix: Added lazy initialization of the remote vLLM client to avoid issues with expired asyncio event loop (#1969) Ilya Kolchinsky 2025-04-23 15:33:19 +02:00
  • d39462d073
    feat: Hide tool output under an expander in Playground UI (#2003) Ilya Kolchinsky 2025-04-23 15:32:12 +02:00
  • d6e88e0bc6
    docs: add RamaLama to list of known external providers (#2004) Nathan Weinberg 2025-04-23 03:44:18 -04:00
  • 825ce39879
    fix: Together provider shutdown and default to non-streaming (#2001) Ben Browning 2025-04-22 11:47:53 -04:00
  • e4d001c4e4
    feat: cleanup sidebar formatting on tools playground (#1998) Michael Clifford 2025-04-22 04:40:37 -04:00
  • 3110ad1e7c
    fix: update ref to raw_errors due to new version of pydantic (#1995) Kevin Postlethwait 2025-04-21 14:50:12 -04:00
  • 602e949a46
    fix: OpenAI Completions API and Fireworks (#1997) Ben Browning 2025-04-21 14:49:12 -04:00
  • 0d06c654d0
    feat: Update NVIDIA to GA docs; remove notebook reference until ready (#1999) Jash Gulabrai 2025-04-18 19:13:18 -04:00
  • 94f83382eb
    feat: allow building distro with external providers (#1967) Sébastien Han 2025-04-18 17:18:28 +02:00
  • c4570bcb48
    docs: Add tips for debugging remote vLLM provider (#1992) Yuan Tang 2025-04-18 08:47:47 -04:00
  • 9845631d51
    feat: update nvidia inference provider to use model_store (#1988) Matthew Farrellee 2025-04-18 04:16:43 -04:00
  • e72b1076ca
    fix(build): add UBI 9 compiler tool‑chain (#1983) Alexey Rybak 2025-04-18 00:49:10 -07:00
  • 4c6b7005fa
    fix: Fix docs lint issues (#1993) Yuan Tang 2025-04-18 02:33:13 -04:00