Commit graph

  • ed7b4731aa
    fix: Setting default value for metadata_token_count in case the key is not found (#2199) Francisco Arceo 2025-05-20 06:03:22 -06:00
  • 6d20b720b8
    feat: Propagate W3C trace context headers from clients (#2153) Ben Browning 2025-05-19 21:56:54 -04:00
  • 82778ecbb0
    fix: remove wrong deprecated warning (#2202) Sébastien Han 2025-05-19 22:02:23 +02:00
  • 0cc0731189
    fix: Pass external_config_dir to BuildConfig (#2190) Michael Anstis 2025-05-19 13:01:28 +01:00
  • 047303e339
    feat: introduce APIs for retrieving chat completion requests (#2145) ehhuang 2025-05-18 21:43:19 -07:00
  • c7015d3d60
    feat: introduce OAuth2TokenAuthProvider and notion of "principal" (#2185) Ashwin Bharambe 2025-05-18 17:54:19 -07:00
  • 1341916caf
    chore(github-deps): bump astral-sh/setup-uv from 5.4.1 to 6.0.1 (#2197) dependabot[bot] 2025-05-18 02:09:56 -04:00
  • f40693e720
    feat: --image-type argument overrides value in --config build.yaml (#2179) Matthew Farrellee 2025-05-16 17:45:41 -04:00
  • f02f7b28c1
    feat: add huggingface post_training impl (#2132) Charlie Doern 2025-05-16 17:41:28 -04:00
  • 8f9964f46b
    fix: update llama stack build --run to use new start_stack.sh signature (#2191) Matthew Farrellee 2025-05-16 17:32:02 -04:00
  • 1ae61e8d5f
    fix: replace all instances of --yaml-config with --config (#2196) Charlie Doern 2025-05-16 17:31:12 -04:00
  • 075c5401f5 docs: update CHANGELOG.md for v0.2.7 create-pull-request/changelog ashwinb 2025-05-16 21:31:11 +00:00
  • 65cf076f13 build: Bump version to 0.2.7 github-actions[bot] 2025-05-16 20:32:06 +00:00
  • c281a1a909 build: Bump version to 0.2.7 v0.2.7 release-0.2.7 github-actions[bot] 2025-05-16 20:31:24 +00:00
  • b8f7e1504d
    feat: allow the interface on which the server will listen to be configured (#2015) grs 2025-05-16 15:59:31 -04:00
  • 64f8d4c3ad
    feat: use openai-python for openai inference provider (#2193) Matthew Farrellee 2025-05-16 15:57:56 -04:00
  • 8bc3d83bb1 Release candidate 0.2.7rc1 v0.2.7rc1 github-actions[bot] 2025-05-16 19:30:00 +00:00
  • 953ccffca2
    test: catch BadRequestError for non-library client (#2195) ehhuang 2025-05-16 12:26:59 -07:00
  • 7f1f21fd6c
    feat: Adding dark mode, cleaning the UI a small bit, adding a link to the API documentation, and linting the code. (#2182) Francisco Arceo 2025-05-16 11:48:26 -06:00
  • 7aae8fadbf
    fix: dev -> starter rename in ci (#2183) Matthew Farrellee 2025-05-16 03:41:53 -04:00
  • 3cc15f7d15
    fix: misc UI changes (#2175) Sébastien Han 2025-05-15 22:03:05 +02:00
  • 1a6d4af5e9
    refactor: rename dev distro as starter (#2181) Ashwin Bharambe 2025-05-15 12:52:34 -07:00
  • 87e284f1a0 chore: update CODEOWNERS Ashwin Bharambe 2025-05-15 12:31:12 -07:00
  • 10b1056dea
    fix: multiple tool calls in remote-vllm chat_completion (#2161) Ben Browning 2025-05-15 14:23:29 -04:00
  • bb5fca9521
    chore: more API validators (#2165) Sébastien Han 2025-05-15 20:22:51 +02:00
  • e46de23be6
    feat: refactor external providers dir (#2049) Charlie Doern 2025-05-15 14:17:03 -04:00
  • 7e25c8df28
    fix: ReadTheDocs should display all versions (#2172) Yuan Tang 2025-05-15 11:41:15 -04:00
  • c3f27de3ea
    chore: Update triagers list with new additions (#2180) Ihar Hrachyshka 2025-05-15 11:39:25 -04:00
  • 354faa15ce
    feat: Allow to print usage information for install script (#2171) Yuan Tang 2025-05-15 10:50:56 -04:00
  • 8e7ab146f8
    feat: Adding support for customizing chunk context in RAG insertion and querying (#2134) Francisco Arceo 2025-05-14 19:56:20 -06:00
  • ff247e35be
    feat: scaffolding for Llama Stack UI (#2149) ehhuang 2025-05-14 17:22:46 -07:00
  • b42eb1ccbc
    fix: Responses API: handle type=None in streaming tool calls (#2166) Ben Browning 2025-05-14 17:16:33 -04:00
  • aa5bef8e05
    feat: expand set of known openai models, allow using openai canonical model names (#2164) Matthew Farrellee 2025-05-14 16:18:15 -04:00
  • 5052c3cbf3
    fix: Fixed an "out of token budget" error when attempting a tool call via remote vLLM provider (#2114) Ilya Kolchinsky 2025-05-14 22:11:02 +02:00
  • 268725868e
    chore: enforce no git tags or branches in external github actions (#2159) Ihar Hrachyshka 2025-05-14 14:40:06 -04:00
  • a1fbfb51e2
    ci(chore): use hashes for all version pinning (#2157) Nathan Weinberg 2025-05-14 08:59:58 -04:00
  • 43d4447ff0
    fix: remote vLLM tool execution now works when the last chunk contains the call arguments (#2112) Ilya Kolchinsky 2025-05-14 11:38:00 +02:00
  • 1de0dfaab5
    docs: Clarify kfp provider is both inline and remote (#2144) Ihar Hrachyshka 2025-05-14 03:37:07 -04:00
  • dd07c7a5b5
    fix: Make search tool talk about models (#2151) Derek Higgins 2025-05-14 06:41:51 +01:00
  • 26dffff92a
    chore: remove pytest reports (#2156) Sébastien Han 2025-05-14 07:40:15 +02:00
  • 8e316c9b1e
    feat: function tools in OpenAI Responses (#2094) Ben Browning 2025-05-13 14:29:15 -04:00
  • e0d10dd0b1
    docs: revamp testing documentation (#2155) Nathan Weinberg 2025-05-13 14:28:29 -04:00
  • 62476a5373
    fix: pytest reports (#2152) Sébastien Han 2025-05-13 20:27:29 +02:00
  • e3ad17ec5e
    feat: enable mutual tls (#2140) grs 2025-05-12 17:08:36 -04:00
  • a5d14749a5
    chore: rehydrate requirements.txt (#2146) Sébastien Han 2025-05-12 21:45:35 +02:00
  • 23d9f3b1fb build: Bump version to 0.2.6 github-actions[bot] 2025-05-12 18:02:05 +00:00
  • 2669cb5a33 build: Bump version to 0.2.6 v0.2.6 release-0.2.6 github-actions[bot] 2025-05-12 18:01:15 +00:00
  • c985ea6326
    fix: Adding Embedding model to watsonx inference (#2118) Divya 2025-05-12 23:28:22 +05:30
  • 136e6b3cf7
    fix: ollama openai completion and chat completion params (#2125) Ben Browning 2025-05-12 13:57:53 -04:00
  • 80c349965f
    chore(refact): move paginate_records fn outside of datasetio (#2137) Sébastien Han 2025-05-12 19:56:14 +02:00
  • 53b7f50828
    chore: force ellipsis in API webmethods (#2141) Sébastien Han 2025-05-12 19:55:39 +02:00
  • 43e623eea6
    chore: remove last instances of code-interpreter provider (#2143) Sébastien Han 2025-05-12 19:54:43 +02:00
  • 8bdd0ef2c5 Release candidate 0.2.6rc1 v0.2.6rc1 github-actions[bot] 2025-05-12 17:42:28 +00:00
  • 675f34e79d
    fix: Syntax error with missing stubs at the end of some function calls (#2116) Krzysztof Malczuk 2025-05-12 16:05:40 +01:00
  • 9a6e91cd93
    fix: chromadb type hint (#2136) Matthew Farrellee 2025-05-12 09:27:01 -04:00
  • db21eab713
    fix: catch TimeoutError in place of asyncio.TimeoutError (#2131) Ihar Hrachyshka 2025-05-12 05:49:59 -04:00
  • dd7be274b9
    fix: raise an error when no vector DB IDs are provided to the RAG tool (#1911) Ilya Kolchinsky 2025-05-12 11:25:13 +02:00
  • f2b83800cc
    docs: Add link to Discord to README (#2126) Yuan Tang 2025-05-10 21:32:44 -04:00
  • 473a07f624
    fix: revert "feat(provider): adding llama4 support in together inference provider (#2123)" (#2124) Ashwin Bharambe 2025-05-08 15:18:16 -07:00
  • 0f878ad87a
    feat(provider): adding llama4 support in together inference provider (#2123) Yogish Baliga 2025-05-08 14:27:56 -07:00
  • fe5f5e530c
    feat: add metrics query API (#1394) Dinesh Yeduguru 2025-05-07 10:11:26 -07:00
  • 6371bb1b33
    chore(refact)!: simplify config management (#1105) Sébastien Han 2025-05-07 18:18:12 +02:00
  • c91e3552a3
    feat: implementation for agent/session list and describe (#1606) Sébastien Han 2025-05-07 14:49:23 +02:00
  • 40e71758d9
    fix: inference providers still using tools with tool_choice="none" (#2048) Ben Browning 2025-05-07 08:34:47 -04:00
  • 6f1badc934
    test: Document how users can run a subset of tests (#2066) Derek Higgins 2025-05-07 13:05:36 +01:00
  • 664161c462
    fix: llama4 tool use prompt fix (#2103) ehhuang 2025-05-06 22:18:31 -07:00
  • b2b00a216b
    feat(providers): sambanova updated to use LiteLLM openai-compat (#1596) Jorge Piedrahita Ortiz 2025-05-06 18:50:22 -05:00
  • dd49ef31f1
    docs: Update changelog to include recent releases (#2108) Yuan Tang 2025-05-06 17:42:06 -04:00
  • a57985eeac
    fix: add check for interleavedContent (#1973) Kevin Postlethwait 2025-05-06 12:55:07 -04:00
  • 1a529705da
    chore: more mypy fixes (#2029) Sébastien Han 2025-05-06 18:52:31 +02:00
  • feb9eb8b0d
    docs: Remove datasets.rst and fix llama-stack build commands (#2061) Christian Zaccaria 2025-05-06 17:51:20 +01:00
  • c219a74fa0
    fix: Don't require efficiency_config for torchtune (#2104) Ihar Hrachyshka 2025-05-06 12:50:44 -04:00
  • 7377a5c83e
    docs: contrib add a note about unicode in code (#2106) Sébastien Han 2025-05-06 18:50:30 +02:00
  • b9b13a3670
    chore: factor kube auth test distro (#2105) Sébastien Han 2025-05-06 18:49:49 +02:00
  • 2413447467
    ci: add new action to install ollama, cache the model (#2054) Ignas Baranauskas 2025-05-06 13:56:20 +01:00
  • 3022f7b642
    feat: Adding TLS support for Remote::Milvus vector_io (#2011) Divya 2025-05-06 17:45:34 +05:30
  • 65cc971877
    docs: Add TrustyAI LM-Eval to list of known external providers (#2020) Christina Xu 2025-05-06 08:11:55 -04:00
  • 18d2312690
    fix: test_datasets HF scenario in CI (#2090) Christian Zaccaria 2025-05-06 13:09:15 +01:00
  • 2e807b38cc
    chore: Add fixtures to conftest.py (#2067) Derek Higgins 2025-05-06 12:57:48 +01:00
  • 4597145011
    chore: remove recordable mock (#2088) ehhuang 2025-05-05 10:08:55 -07:00
  • a5d151e912
    docs: fix typo mivus.md -> milvus.md (#2102) Sébastien Han 2025-05-05 18:48:38 +02:00
  • a4247ce0a8
    docs: expand contribution guidelines for linting exceptions (#2101) Sébastien Han 2025-05-05 11:36:30 +02:00
  • 1fbda6bfaa
    chore(github-deps): bump actions/setup-python from 5.5.0 to 5.6.0 (#2099) dependabot[bot] 2025-05-05 10:25:45 +02:00
  • 16e163da0e
    docs: List external kubeflow pipelines provider prototype (#2100) Ihar Hrachyshka 2025-05-05 04:24:52 -04:00
  • 15a1648be6
    fix(installer): harden install.sh for Podman macOS (#2068) Alexey Rybak 2025-05-05 00:31:58 -07:00
  • d27a0f276c fix: pytest.mark.skip, not pytest.skip Ashwin Bharambe 2025-05-04 13:21:06 -07:00
  • 6b4c218788 build: Bump version to 0.2.5 github-actions[bot] 2025-05-03 21:31:01 +00:00
  • d9c5563c0e build: Bump version to 0.2.5 v0.2.5 release-0.2.5 github-actions[bot] 2025-05-03 21:30:14 +00:00
  • c69f14bfaa fix: disable rag_and_code_agent test because no code interpreter anymore Ashwin Bharambe 2025-05-03 14:29:06 -07:00
  • ee0be5fa1e Release candidate 0.2.5rc1 v0.2.5rc1 github-actions[bot] 2025-05-03 18:22:35 +00:00
  • 9f27578929
    fix: improve Mermaid diagram visibility in dark mode (#2092) Christian Zaccaria 2025-05-02 21:09:45 +01:00
  • f1b103e6c8
    fix: openai_compat messages system/assistant non-str content (#2095) Ben Browning 2025-05-02 16:09:27 -04:00
  • 272d3359ee
    fix: remove code interpeter implementation (#2087) Ashwin Bharambe 2025-05-01 14:35:08 -07:00
  • 9e6561a1ec
    chore: enable pyupgrade fixes (#1806) Ihar Hrachyshka 2025-05-01 17:23:50 -04:00
  • ffe3d0b2cd
    fix: nullable param type for function call (#2086) ehhuang 2025-05-01 13:17:36 -07:00
  • 88a796ca5a
    fix: allow use of models registered at runtime (#1980) Matthew Farrellee 2025-05-01 15:00:58 -04:00
  • 64829947d0
    feat: Add temperature support to responses API (#2065) Derek Higgins 2025-05-01 19:47:58 +01:00
  • f36f68c590
    ci: Disable no-commit-to-branch (#2084) Ihar Hrachyshka 2025-05-01 14:43:43 -04:00
  • 6378c2a2f3
    fix: resolve BuiltinTools to strings for vllm tool_call messages (#2071) Ben Browning 2025-05-01 08:47:29 -04:00
  • 293d95b955 fix: pre-commit cleanup Ashwin Bharambe 2025-04-30 15:08:14 -07:00