Commit graph

  • 72b72220ac Updated the llama-stack version in README.md Surya Prakash Pathak 2025-02-28 20:09:51 +00:00
  • 75cda30df7 fix: replace eval with json decoding for format_adapter (#1328) Xi Yan 2025-02-28 11:25:23 -08:00
  • 31c9c6c62f fix: replace eval with json decoding (#1327) Xi Yan 2025-02-28 11:10:45 -08:00
  • bdc1fb1618 eval in post training Xi Yan 2025-02-28 11:16:01 -08:00
  • 317bd21f81 eval to json decoding Xi Yan 2025-02-28 10:46:28 -08:00
  • df2d833e72
    more concise Yuan Tang 2025-02-28 12:46:31 -05:00
  • 208f02eb95
    docs: Add link to distributions guide in quick start guide Yuan Tang 2025-02-28 12:06:26 -05:00
  • d6204b072f
    fix: Make remote::vllm compatible with vLLM <= v0.6.3 Yuan Tang 2025-02-28 11:56:17 -05:00
  • d1f2d57768
    chore(lint): update Ruff ignores for project conventions and maintainability Sébastien Han 2025-02-20 18:22:35 +01:00
  • 54264b781c cleanup embedding model test suite Matthew Farrellee 2025-02-28 09:04:39 -05:00
  • ff3384dad6 In case of missing provider_id, use the first one (if any) to register an ephemeral vector db Daniele Martinoli 2025-02-28 11:39:23 +01:00
  • 8336db4976
    build(container): simply container build command Sébastien Han 2025-02-27 14:01:11 +01:00
  • 7db80df116
    build(container): remove uv once done Sébastien Han 2025-02-27 14:00:07 +01:00
  • 44eb308d9c
    build(container): fix ubi9 container Sébastien Han 2025-02-27 13:59:28 +01:00
  • 41361719fa
    build(container): remove unused variables Sébastien Han 2025-02-27 13:56:56 +01:00
  • 9f9a140fbc
    build(container): avoid forcing shebang Sébastien Han 2025-02-27 13:41:59 +01:00
  • 2d0ad6ba3f fixed RAG doc (broken URL) Daniele Martinoli 2025-02-28 10:11:56 +01:00
  • 1181754c5b fixed test_chat_agent Daniele Martinoli 2025-02-28 10:10:11 +01:00
  • 5ca575eefe restored from upstream Daniele Martinoli 2025-02-28 09:47:47 +01:00
  • aa546de8d6 renamed insert_vector_db_id to documents_db_id, removed vector_db_id from session info Daniele Martinoli 2025-02-28 08:32:02 +01:00
  • 3076977937 adding the 1st configured vector_db_id, if any Daniele Martinoli 2025-02-26 12:05:08 +01:00
  • 9ac7f1c8da Bump version to 0.1.5 v0.1.5 github-actions[bot] 2025-02-28 08:14:18 +00:00
  • 56798fbdda Release candidate 0.1.5rc3 v0.1.5rc3 github-actions[bot] 2025-02-28 08:14:18 +00:00
  • a075dded3b minor Ashwin Bharambe 2025-02-27 23:52:13 -08:00
  • 67d34cfbe3 feat: allow conditionally enabling providers in run.yaml Ashwin Bharambe 2025-02-27 23:46:35 -08:00
  • daab1ca3b8 feat: enhance OpenAPI spec to include Error types Ashwin Bharambe 2025-02-27 22:08:57 -08:00
  • 834d117077 Release candidate 0.1.5rc2 v0.1.5rc2 github-actions[bot] 2025-02-28 05:04:13 +00:00
  • ea27ae56f8 send start event by default Hardik Shah 2025-02-27 20:50:17 -08:00
  • 0f5d989c06 nbeval Xi Yan 2025-02-27 20:09:48 -08:00
  • 2fc64d611f max infer iters Xi Yan 2025-02-27 20:08:07 -08:00
  • 52ed2a7c35 Release candidate 0.1.5rc1 v0.1.5rc1 github-actions[bot] 2025-02-28 03:38:04 +00:00
  • 33cfa2fb81
    test: Only run embedding tests for remote::nvidia Yuan Tang 2025-02-27 22:31:10 -05:00
  • f55d812d8e do not swallow first token Hardik Shah 2025-02-27 19:17:44 -08:00
  • e157f0ac89 Merge branch 'main' into max_infer_iters Xi Yan 2025-02-27 19:07:12 -08:00
  • ef9233ecd6
    fix: Incorrect import path for print_subcommand_description() Yuan Tang 2025-02-27 21:41:08 -05:00
  • 8e79898582
    fix: Incorrect import path for print_subcommand_description() Yuan Tang 2025-02-27 21:30:25 -05:00
  • 805541cb7c
    fix: Incorrect import path for print_subcommand_description() Yuan Tang 2025-02-27 21:14:22 -05:00
  • e64fda8343 fix litellm Xi Yan 2025-02-27 17:44:19 -08:00
  • 9ba1000cf4
    removed some filler comments Ashwin Bharambe 2025-02-27 17:31:04 -08:00
  • 17ef47e909 verify recursive nature in structured outputs Hardik Shah 2025-02-27 17:21:32 -08:00
  • be5cc85d6f default prompt Eric Huang 2025-02-27 17:03:55 -08:00
  • c5b6e3845f tmp Xi Yan 2025-02-27 16:47:08 -08:00
  • af254f8cf3 max infer iters Xi Yan 2025-02-27 16:30:45 -08:00
  • 5eceb79be4 fix: update notebooks to avoid using the nutsy --image-name __system__ thing Ashwin Bharambe 2025-02-27 16:06:14 -08:00
  • 1ba3ae03c1 include content, don't include tool args Eric Huang 2025-02-27 16:09:48 -08:00
  • 872f7eead0
    Address comments Yuan Tang 2025-02-27 18:28:04 -05:00
  • ea29d0d3a8 chore: add container cmd check reidliu 2025-02-28 06:54:30 +08:00
  • 8ee626b81c rebase Ashwin Bharambe 2025-02-27 14:28:59 -08:00
  • d2b4c7041a alias groq models to their HF aliases Ashwin Bharambe 2025-02-27 13:31:46 -08:00
  • 9f9278f9a8 fix: register provider model name and HF alias in run.yaml Ashwin Bharambe 2025-02-27 12:03:43 -08:00
  • 1d15807f08 precommit Xi Yan 2025-02-27 14:20:48 -08:00
  • 8a0482d845 Merge branch 'main' into debug_duplicate_tools Xi Yan 2025-02-27 14:16:42 -08:00
  • 0560d1f4a2 better checking Xi Yan 2025-02-27 14:14:05 -08:00
  • e32ed65bef lint Xi Yan 2025-02-27 14:02:06 -08:00
  • a99706ec5e fix Xi Yan 2025-02-27 14:01:18 -08:00
  • 58f9fd135b fix Xi Yan 2025-02-27 13:55:46 -08:00
  • a52721948c feat(providers): Groq now uses LiteLLM openai-compat Ashwin Bharambe 2025-02-27 10:08:01 -08:00
  • 32b96d6d91 fix: check conda env name using basepath in exec.py Dinesh Yeduguru 2025-02-27 11:00:05 -08:00
  • 0b9b72dd2c update serialization of values to handle broader types Hardik Shah 2025-02-27 10:56:10 -08:00
  • 90c66c2743
    Revert "chore: remove vector_db_id from AgentSessionInfo (#1296)" Xi Yan 2025-02-27 10:34:39 -08:00
  • 4728d9ff0d fix(test): no need to specify tool prompt format explicitly in tests Ashwin Bharambe 2025-02-27 09:33:08 -08:00
  • 3e6806c250 remove Xi Yan 2025-02-27 09:50:29 -08:00
  • 9be83920ef generate openapi spec Dinesh Yeduguru 2025-02-05 11:03:37 -08:00
  • 77c2418a9c metrics for completion API Dinesh Yeduguru 2025-02-05 10:54:08 -08:00
  • e9bb96334b create metric_store Dinesh Yeduguru 2025-02-05 10:29:48 -08:00
  • 23c1aa4504 throw exception when promethues is not enabled Dinesh Yeduguru 2025-02-05 10:11:13 -08:00
  • cce217bab8 add API to query metrics Dinesh Yeduguru 2025-02-05 10:06:59 -08:00
  • b180069def make the telemetry API dep optional in inference router Dinesh Yeduguru 2025-02-05 09:25:21 -08:00
  • 52e533dc89 check for text delta type Dinesh Yeduguru 2025-02-04 16:23:32 -08:00
  • 37b7390079 add metrics for streaming Dinesh Yeduguru 2025-02-04 16:20:59 -08:00
  • 38f1337afa make router call telemetry Dinesh Yeduguru 2025-02-04 15:42:33 -08:00
  • a72cdafac0 Add inference token usage metrics Dinesh Yeduguru 2025-02-04 10:45:16 -08:00
  • 61db5d5074 chore: use func for cmd check reidliu 2025-02-27 23:48:37 +08:00
  • fafab113b4
    Create RFC-0002-preprocessing-endpoint.md Ilya Kolchinsky 2025-02-27 13:15:31 +01:00
  • be055717dd
    ci: add dynamic CI job to test templates Sébastien Han 2025-02-24 12:47:44 +01:00
  • 6079e727d2 fix returning an iterable for the content param Ashwin Bharambe 2025-02-26 21:08:49 -08:00
  • 1cff4c5907 fix openai-compat / litellm conversions Ashwin Bharambe 2025-02-26 20:56:54 -08:00
  • f389afe024 temp fix Eric Huang 2025-02-26 20:44:26 -08:00
  • 4c0f122a4b fix(test): update client-sdk tests to handle tool format parametrization better Ashwin Bharambe 2025-02-26 20:36:57 -08:00
  • b4225d1ed5 remove agents monitoring notebook Xi Yan 2025-02-26 15:50:37 -08:00
  • 843a93c0be update getting started Xi Yan 2025-02-26 15:50:04 -08:00
  • 6ff7ea127f fix sqlite_vec by using local thread Kai Wu 2025-02-26 15:46:19 -08:00
  • 0e40fe9f00 Add model context protocol tools with ollama provider Shreyanand 2025-02-26 17:24:29 -05:00
  • f42dc48986
    Merge branch 'meta-llama:main' into fix-ollama-rag Kai Wu 2025-02-26 15:26:45 -08:00
  • 961e098bbb remove prints Xi Yan 2025-02-26 15:20:21 -08:00
  • ec62020a20 retrieve session / turn / steps Xi Yan 2025-02-26 15:16:31 -08:00
  • e8a2733e20 feat: don't silently ignore incorrect toolgroup Eric Huang 2025-02-26 14:57:36 -08:00
  • 48174b5422 Merge branch 'main' into export_agent_dataset Xi Yan 2025-02-26 14:55:50 -08:00
  • 36ff793ca3 tmp export dataset Xi Yan 2025-02-26 14:54:49 -08:00
  • 54a576e739 chore: upgrade uv pre-commit version, uv-sync -> uv-lock Ashwin Bharambe 2025-02-26 14:49:31 -08:00
  • 80fba6247d fix: sqlite conn Eric Huang 2025-02-26 14:08:17 -08:00
  • 5e3ee76acf feat: allow specifying specific tool within toolgroup Eric Huang 2025-02-26 13:51:49 -08:00
  • 6aacf4dd61 fix: time logging format Eric Huang 2025-02-26 13:41:53 -08:00
  • a33569e7e7 refine Botao Chen 2025-02-26 13:23:57 -08:00
  • 20a7af15a9 update notebook Botao Chen 2025-02-26 13:18:40 -08:00
  • 658465e088 always include the type field in requests Matthew Farrellee 2025-02-26 15:30:23 -05:00
  • 53a1698ec3 support nvidia hosted vision models Matthew Farrellee 2025-02-26 14:10:26 -05:00
  • 02a6376f13
    Update CONTRIBUTING.md Yuan Tang 2025-02-26 12:31:25 -05:00
  • 73771a43da
    fix(agent): Raise exception when tool call has empty name Yuan Tang 2025-02-25 21:45:24 -05:00
  • 1de0d1906f docs: update the output of llama-stack-client models list reidliu 2025-02-26 16:48:55 +08:00