Commit graph

  • 8747210470 fix: ollama openai completion and chat completion params Ben Browning 2025-05-08 18:12:51 -04:00
  • 21235f0045 Revert "feat(provider): adding llama4 support in together inference provider (#2123)" Ashwin Bharambe 2025-05-08 14:57:43 -07:00
  • af6f7bc535 adding llama4 support in together inference provider Yogish Baliga 2025-05-08 14:16:35 -07:00
  • 4df8caab41 Fixes for multi-turn tool calls in Responses API Ben Browning 2025-05-08 16:21:15 -04:00
  • 65c56d0ee8 chore: Address review feedback with minor code cleanups Ben Browning 2025-05-08 06:47:53 -04:00
  • 9166baa716 Load OpenAIChatCompletion directly from YAML Derek Higgins 2025-05-06 11:24:38 +01:00
  • dfb641a3de Handle more input content types in Responses API Ben Browning 2025-05-02 14:38:25 -04:00
  • 11081c2029 foo Ben Browning 2025-05-02 14:38:00 -04:00
  • a1955215c4 Update verification run.yaml based on latest main Ben Browning 2025-05-02 12:58:40 -04:00
  • 924213a689 Responses API: Finish wiring up function tool calls Ben Browning 2025-04-30 17:01:00 -04:00
  • 1990df2c50 feat: add function tools to openai responses Ashwin Bharambe 2025-04-30 13:06:33 -07:00
  • b90bb66f28 fix: Restore previous responses to input list, not messages Ben Browning 2025-05-02 11:08:09 -04:00
  • 5b2e850754 fix: Responses API previous_response input items Ben Browning 2025-04-30 12:25:38 -04:00
  • 150b9a0834 feat(openai-responses): Support multiple message roles in API inputs Derek Higgins 2025-05-02 11:10:07 +01:00
  • 1369b5858e chore: Refactor OpenAIChatCompletion's to be loaded from yaml Derek Higgins 2025-05-02 11:07:58 +01:00
  • 7a8e398f09 Removed an accidentally committed debug print. ilya-kolchinsky 2025-05-08 11:08:41 +02:00
  • 57e28b5b9b Reverted the change in handling the case of no chunks found. ilya-kolchinsky 2025-05-08 10:52:00 +02:00
  • 7784307a5f Fixed an "out of token budget" tool execution bug in the remote vLLM provider. ilya-kolchinsky 2025-05-08 10:42:26 +02:00
  • 55da406471 Fixed an error where a tool call fails when the arguments are passed in the chunk containing finish_reason. ilya-kolchinsky 2025-05-08 09:46:35 +02:00
  • 5d70a6baba fix Ashwin Bharambe 2025-05-07 10:00:22 -07:00
  • 98db4a226f
    chore(refact)!: simplify config management Sébastien Han 2025-05-07 16:37:19 +02:00
  • 43d9a20ee6 Update tools notebook to follow CustomTool->ClientTool change. Heiko W. Rupp 2025-05-07 10:13:18 +02:00
  • 72a3837274
    feat: implementation for agent/session list and describe Sébastien Han 2025-04-18 17:58:30 +02:00
  • eda8c272cf test: Document how users can run unit-tests.sh Derek Higgins 2025-04-30 17:21:56 +01:00
  • 471745fff3 get_metrics -> query_metrics Ashwin Bharambe 2025-05-06 17:10:52 -07:00
  • 7073efede5 address feedback Dinesh Yeduguru 2025-03-06 10:15:25 -08:00
  • 3a82b57a26 return metric labels as list Dinesh Yeduguru 2025-03-05 16:30:46 -08:00
  • 49e93085db generate openapi spec Dinesh Yeduguru 2025-02-28 12:25:41 -08:00
  • ba6334475f feat: add metrics query API Dinesh Yeduguru 2025-02-28 12:21:27 -08:00
  • 05e2fa0e4d
    Merge branch 'main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-05-06 17:11:40 -05:00
  • 7aeecb3c5c
    docs: Update changelog to include recent releases Yuan Tang 2025-05-06 15:42:08 -04:00
  • c91c45756b
    Merge branch 'main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-05-06 11:56:33 -05:00
  • f592408978 fix typo jhpiedrahitao 2025-05-06 11:55:53 -05:00
  • d42a9ea53d
    Merge branch 'main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-05-06 11:50:55 -05:00
  • f149d6a8dd
    Merge branch 'main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-05-06 11:50:38 -05:00
  • 2fa500ee68
    Create generator-generic-ossf-slsa3-publish.yml PATRICK Ufuoma 2025-05-06 17:09:36 +01:00
  • 4999c8f9cc fix: missing key Jash Gulabrai 2025-05-06 11:21:38 -04:00
  • b1d941e1f0 Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-05-06 11:12:34 -04:00
  • 21125f725f
    Merge branch 'main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-05-06 09:56:22 -05:00
  • b252f32547 fix: test_datasets HF scenario in CI ChristianZaccaria 2025-05-02 13:10:04 +01:00
  • 093aa7ac64 chore: Add fixtures to conftest.py Derek Higgins 2025-04-30 09:49:21 +01:00
  • d0bd6ce2ce
    docs: contrib add a note about unicode in code Sébastien Han 2025-05-06 11:32:50 +02:00
  • 666f009836
    chore: factor kube auth test distro Sébastien Han 2025-05-06 11:14:37 +02:00
  • 54da74a6b0 fix: llama4 tool use prompt fix Eric Huang 2025-05-05 12:15:58 -07:00
  • 0f765f00c2 feat(llama_stack/apis/common/responses.py, llama_stack/distribution/server/server.py, tests/integration/datasets/test_datasets.py): enhance pagination response with next URL RepoBird.ai Agent 2025-04-27 23:50:39 +00:00
  • 75c54547eb fix: Don't require efficiency_config for torchtune Ihar Hrachyshka 2025-05-05 15:53:05 -04:00
  • eb59652e57 fix for pre-commit error Divya 2025-05-05 10:55:49 -07:00
  • f3a3461271 Fix formatting issues via pre-commit hooks Divya 2025-04-30 12:58:36 -07:00
  • 6f2e8fcb82 updated doc for TLS Divya 2025-04-30 12:33:12 -07:00
  • 2845e50cc9 Update docs Divya 2025-04-30 11:58:41 -07:00
  • 77eaffea59 Updated acc to review comments Divya 2025-04-30 09:05:55 -07:00
  • 67db749d15 Made Changes based on review comments Divya 2025-04-29 22:29:12 -07:00
  • 2c682b086a Adding TLS support one-way Divya 2025-04-23 04:09:49 -07:00
  • 2867983cb7
    Merge branch 'main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-05-05 12:18:27 -05:00
  • c4add97fd6 rm inline code_tool jhpiedrahitao 2025-05-05 12:17:52 -05:00
  • 57b5449e85
    Merge branch 'main' into pr2088 Ashwin Bharambe 2025-05-05 09:54:05 -07:00
  • b7f16ac7a6 Merge branch 'main' into feat/litellm_sambanova_usage jhpiedrahitao 2025-05-05 11:49:58 -05:00
  • 1672e11f54
    docs: fix typo mivus.md -> milvus.md Sébastien Han 2025-05-05 11:37:25 +02:00
  • fb2c37b03a
    docs: expand contribution guidelines for linting exceptions Sébastien Han 2025-05-05 11:31:48 +02:00
  • cbb6992896
    chore: more mypy fixes Sébastien Han 2025-04-25 15:43:25 +02:00
  • 47835c8640 docs: List external kubeflow pipelines provider prototype Ihar Hrachyshka 2025-05-04 18:48:57 -04:00
  • d9c5563c0e build: Bump version to 0.2.5 v0.2.5 release-0.2.5 github-actions[bot] 2025-05-03 21:30:14 +00:00
  • 2b6e89cf7a
    Discard changes to CHANGELOG.md Alexey Rybak 2025-05-03 13:55:56 -07:00
  • 5ed34c5651
    chore(github-deps): bump actions/setup-python from 5.5.0 to 5.6.0 dependabot[bot] 2025-05-03 20:50:45 +00:00
  • ee0be5fa1e Release candidate 0.2.5rc1 v0.2.5rc1 github-actions[bot] 2025-05-03 18:22:35 +00:00
  • 49148bb26a fix: openai_compat messages system/assistant non-str content Ben Browning 2025-05-02 15:31:22 -04:00
  • 263eb6fd37 fix: Restore previous responses to input list, not messages Ben Browning 2025-05-02 11:08:09 -04:00
  • 467fb19345 fix: Responses API previous_response input items Ben Browning 2025-04-30 12:25:38 -04:00
  • 97742a1a10 fix: improve Mermaid diagram visibility in dark mode ChristianZaccaria 2025-05-02 15:15:40 +01:00
  • 8620739a17 docs: Remove datasets.rst and fix llama-stack build commands ChristianZaccaria 2025-04-30 12:24:50 +01:00
  • 4657077418 feat(openai-responses): Support multiple message roles in API inputs Derek Higgins 2025-05-02 11:10:07 +01:00
  • e4888b930b chore: Refactor OpenAIChatCompletion's to be loaded from yaml Derek Higgins 2025-05-02 11:07:58 +01:00
  • 9ee93c1477 fix(installer): add shared ‘llama-net’ CNI network for reliable connectivity reluctantfuturist 2025-05-01 16:56:15 -07:00
  • 2887adc33c chore: remove recordable mock Eric Huang 2025-05-01 15:13:27 -07:00
  • 96225e5d5e update docs Ashwin Bharambe 2025-05-01 14:21:23 -07:00
  • 432fd60fe8 skip code interp test Ashwin Bharambe 2025-05-01 14:03:18 -07:00
  • 0f5db647fe no-inline Ashwin Bharambe 2025-05-01 13:53:01 -07:00
  • 9bfb2b9b01 fix: remove code interpeter implementation Ashwin Bharambe 2025-05-01 13:43:01 -07:00
  • 1deb95f922 chore: enable pyupgrade fixes Ihar Hrachyshka 2025-03-26 18:33:23 -04:00
  • cb74e6d9c2 fix: nullable param type for function call Eric Huang 2025-05-01 12:35:55 -07:00
  • a363642734 chore(installer): document libkrun for GPU passthrough on macOS reluctantfuturist 2025-05-01 11:47:28 -07:00
  • 7a2bc1b728 fix(installer): enforce port-publishing and improve cross-platform stability - swap --network host for port-publishing on Podman to fix connection errors - replace GNU timeout with OS-agnostic wait_for_service wrapper - extra logging around Ollama model pull - fix Podman logging reluctantfuturist 2025-05-01 11:35:50 -07:00
  • 97627a5ac4 ci: Disable no-commit-to-branch Ihar Hrachyshka 2025-05-01 11:11:07 -04:00
  • e000ea88b6
    build: Remove no-commit-to-branch check from pre-commit hooks Yuan Tang 2025-05-01 08:50:36 -04:00
  • ddd0c3e435
    ci: add new action to install ollama, pull the image Ignas Baranauskas 2025-04-29 10:36:19 +01:00
  • 70dde487c3 feat: Add temperature support to responses API Derek Higgins 2025-04-30 12:35:09 +01:00
  • fd9d52564b fix: resolve BuiltinTools to strings for vllm tool_call messages Ben Browning 2025-04-30 20:10:33 -04:00
  • 248a4a3f72 feat: add function tools to openai responses Ashwin Bharambe 2025-04-30 13:06:33 -07:00
  • 84adbff5fe chore(changelog): add missing newline reluctantfuturist 2025-04-30 11:50:01 -07:00
  • 7d7b919d85
    Merge branch 'main' into install-script-podman-fix Alexey Rybak 2025-04-30 11:46:59 -07:00
  • 17992330a2 chore(installer): harden install.sh for Podman macOS & ARM64 reluctantfuturist 2025-04-30 11:35:30 -07:00
  • e2960e9e44 fix: inference providers still using tools with tool_choice="none" Ben Browning 2025-04-28 08:43:32 -04:00
  • 0cd3962272 docs: edit expected output in zero to hero guide Nathan Weinberg 2025-03-18 22:49:51 -04:00
  • e91ee75497 feat: add additional logging to llama stack build Nathan Weinberg 2025-03-18 22:39:49 -04:00
  • 012dd6891f Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-04-30 12:05:11 -04:00
  • 4f5a1d5afe
    fix: enforce stricter ASCII rules lint rules in Ruff Sébastien Han 2025-04-30 14:33:46 +02:00
  • bfbaf09fa8 fix unit tests Jash Gulabrai 2025-04-30 10:57:40 -04:00
  • f8f59c8335 fix: Update datasets metadata field from provider to provider_id Jash Gulabrai 2025-04-30 10:52:12 -04:00
  • 5fcf20d934 fix: Fix messages format in NVIDIA safety check request body Jash Gulabrai 2025-04-30 10:20:15 -04:00
  • 4799e09f4d
    chore: Remove zero-width space characters from OTEL service name env var defaults Sébastien Han 2025-04-30 12:07:39 +02:00