Commit graph

  • 6fa257b475
    chore(lint): update Ruff ignores for project conventions and maintainability (#1184) Sébastien Han 2025-02-28 18:36:49 +01:00
  • 3b57d8ee88
    feat: add prompt-format list (#1222) Reid 2025-03-01 01:27:22 +08:00
  • 234408f411
    docs: Add link to distributions guide in quick start guide (#1326) Yuan Tang 2025-02-28 12:18:02 -05:00
  • 208f02eb95
    docs: Add link to distributions guide in quick start guide Yuan Tang 2025-02-28 12:06:26 -05:00
  • d6204b072f
    fix: Make remote::vllm compatible with vLLM <= v0.6.3 Yuan Tang 2025-02-28 11:56:17 -05:00
  • d1f2d57768
    chore(lint): update Ruff ignores for project conventions and maintainability Sébastien Han 2025-02-20 18:22:35 +01:00
  • 54264b781c cleanup embedding model test suite Matthew Farrellee 2025-02-28 09:04:39 -05:00
  • ff3384dad6 In case of missing provider_id, use the first one (if any) to register an ephemeral vector db Daniele Martinoli 2025-02-28 11:39:23 +01:00
  • 8336db4976
    build(container): simply container build command Sébastien Han 2025-02-27 14:01:11 +01:00
  • 7db80df116
    build(container): remove uv once done Sébastien Han 2025-02-27 14:00:07 +01:00
  • 44eb308d9c
    build(container): fix ubi9 container Sébastien Han 2025-02-27 13:59:28 +01:00
  • 41361719fa
    build(container): remove unused variables Sébastien Han 2025-02-27 13:56:56 +01:00
  • 9f9a140fbc
    build(container): avoid forcing shebang Sébastien Han 2025-02-27 13:41:59 +01:00
  • 2d0ad6ba3f fixed RAG doc (broken URL) Daniele Martinoli 2025-02-28 10:11:56 +01:00
  • 1181754c5b fixed test_chat_agent Daniele Martinoli 2025-02-28 10:10:11 +01:00
  • 5ca575eefe restored from upstream Daniele Martinoli 2025-02-28 09:47:47 +01:00
  • aa546de8d6 renamed insert_vector_db_id to documents_db_id, removed vector_db_id from session info Daniele Martinoli 2025-02-28 08:32:02 +01:00
  • 3076977937 adding the 1st configured vector_db_id, if any Daniele Martinoli 2025-02-26 12:05:08 +01:00
  • 9ac7f1c8da Bump version to 0.1.5 v0.1.5 github-actions[bot] 2025-02-28 08:14:18 +00:00
  • 56798fbdda Release candidate 0.1.5rc3 v0.1.5rc3 github-actions[bot] 2025-02-28 08:14:18 +00:00
  • a075dded3b minor Ashwin Bharambe 2025-02-27 23:52:13 -08:00
  • 67d34cfbe3 feat: allow conditionally enabling providers in run.yaml Ashwin Bharambe 2025-02-27 23:46:35 -08:00
  • 7f9b767277
    fix: check conda env name using basepath in exec.py (#1301) Dinesh Yeduguru 2025-02-27 23:07:23 -08:00
  • 8efa53daf1
    fix: Agent telemetry inputs/outputs should be structured (#1302) Hardik Shah 2025-02-27 23:06:37 -08:00
  • caffafd101
    feat: update the default system prompt for 3.2/3.3 models (#1310) ehhuang 2025-02-27 23:05:42 -08:00
  • ece354eedd test: dont hardcode faiss as provider in the tests please Ashwin Bharambe 2025-02-27 22:54:34 -08:00
  • 4c8a0fa8dc fix: ensure ollama embedding model is registered properly in the template Ashwin Bharambe 2025-02-27 22:49:06 -08:00
  • daab1ca3b8 feat: enhance OpenAPI spec to include Error types Ashwin Bharambe 2025-02-27 22:08:57 -08:00
  • 834d117077 Release candidate 0.1.5rc2 v0.1.5rc2 github-actions[bot] 2025-02-28 05:04:13 +00:00
  • 999195fe5b
    fix: [Litellm]Do not swallow first token (#1316) Hardik Shah 2025-02-27 20:53:47 -08:00
  • ea27ae56f8 send start event by default Hardik Shah 2025-02-27 20:50:17 -08:00
  • 7780fc92d5
    fix: update getting_started notebook to pass nbeval (#1318) Xi Yan 2025-02-27 20:13:00 -08:00
  • 0f5d989c06 nbeval Xi Yan 2025-02-27 20:09:48 -08:00
  • 2fc64d611f max infer iters Xi Yan 2025-02-27 20:08:07 -08:00
  • 52ed2a7c35 Release candidate 0.1.5rc1 v0.1.5rc1 github-actions[bot] 2025-02-28 03:38:04 +00:00
  • 6824d23dc9
    test: Only run embedding tests for remote::nvidia (#1317) Yuan Tang 2025-02-27 22:35:52 -05:00
  • 33cfa2fb81
    test: Only run embedding tests for remote::nvidia Yuan Tang 2025-02-27 22:31:10 -05:00
  • f55d812d8e do not swallow first token Hardik Shah 2025-02-27 19:17:44 -08:00
  • e157f0ac89 Merge branch 'main' into max_infer_iters Xi Yan 2025-02-27 19:07:12 -08:00
  • a9f5c5bfca
    fix: Incorrect import path for print_subcommand_description() (#1315) Yuan Tang 2025-02-27 21:50:41 -05:00
  • ef9233ecd6
    fix: Incorrect import path for print_subcommand_description() Yuan Tang 2025-02-27 21:41:08 -05:00
  • f4df3a76d9
    fix: Incorrect import path for print_subcommand_description() (#1314) Yuan Tang 2025-02-27 21:35:49 -05:00
  • 8e79898582
    fix: Incorrect import path for print_subcommand_description() Yuan Tang 2025-02-27 21:30:25 -05:00
  • 3567274183
    fix: Incorrect import path for print_subcommand_description() (#1313) Yuan Tang 2025-02-27 21:24:01 -05:00
  • 805541cb7c
    fix: Incorrect import path for print_subcommand_description() Yuan Tang 2025-02-27 21:14:22 -05:00
  • 076d2f349d
    fix: litellm tool call parsing event type to in_progress (#1312) Xi Yan 2025-02-27 18:00:27 -08:00
  • e64fda8343 fix litellm Xi Yan 2025-02-27 17:44:19 -08:00
  • 2f7683bc5f
    fix: Structured outputs for recursive models (#1311) Hardik Shah 2025-02-27 17:31:53 -08:00
  • 9ba1000cf4
    removed some filler comments Ashwin Bharambe 2025-02-27 17:31:04 -08:00
  • 17ef47e909 verify recursive nature in structured outputs Hardik Shah 2025-02-27 17:21:32 -08:00
  • be5cc85d6f default prompt Eric Huang 2025-02-27 17:03:55 -08:00
  • 94e2186bb8
    chore: add subcommands description in help (#1219) Reid 2025-02-28 09:00:27 +08:00
  • e28cedd833
    feat: add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation (#1213) Matthew Farrellee 2025-02-27 18:58:11 -06:00
  • 73c6f6126f
    fix: Avoid unexpected keyword argument for sentence_transformers (#1269) Luis Tomas Bolivar 2025-02-28 01:47:26 +01:00
  • c5b6e3845f tmp Xi Yan 2025-02-27 16:47:08 -08:00
  • c2d2a80b0a
    docs: update the output of llama-stack-client models list (#1271) Reid 2025-02-28 08:46:38 +08:00
  • 264c2c46db
    build: Add dotenv file for running tests with uv (#1251) Yuan Tang 2025-02-27 19:42:55 -05:00
  • 04de2f84e9
    fix: register provider model name and HF alias in run.yaml (#1304) Ashwin Bharambe 2025-02-27 16:39:23 -08:00
  • c54164556a
    fix: update notebooks to avoid using the nutsy --image-name __system__ thing (#1308) Ashwin Bharambe 2025-02-27 16:39:04 -08:00
  • af254f8cf3 max infer iters Xi Yan 2025-02-27 16:30:45 -08:00
  • 5eceb79be4 fix: update notebooks to avoid using the nutsy --image-name __system__ thing Ashwin Bharambe 2025-02-27 16:06:14 -08:00
  • a34f3aafcf
    fix: don't include tool args not in the function definition (#1307) ehhuang 2025-02-27 16:25:30 -08:00
  • 1ba3ae03c1 include content, don't include tool args Eric Huang 2025-02-27 16:09:48 -08:00
  • 872f7eead0
    Address comments Yuan Tang 2025-02-27 18:28:04 -05:00
  • 663c6b0537
    fix: duplicate ToolResponseMessage in Turn message history (#1305) Xi Yan 2025-02-27 15:06:47 -08:00
  • ea29d0d3a8 chore: add container cmd check reidliu 2025-02-28 06:54:30 +08:00
  • 6e8dfa727d fix: precommits ugh why wont they run correctly because they dont have the right dependencies Ashwin Bharambe 2025-02-27 15:02:04 -08:00
  • 8ee626b81c rebase Ashwin Bharambe 2025-02-27 14:28:59 -08:00
  • d2b4c7041a alias groq models to their HF aliases Ashwin Bharambe 2025-02-27 13:31:46 -08:00
  • 9f9278f9a8 fix: register provider model name and HF alias in run.yaml Ashwin Bharambe 2025-02-27 12:03:43 -08:00
  • 1d15807f08 precommit Xi Yan 2025-02-27 14:20:48 -08:00
  • 8a0482d845 Merge branch 'main' into debug_duplicate_tools Xi Yan 2025-02-27 14:16:42 -08:00
  • 0560d1f4a2 better checking Xi Yan 2025-02-27 14:14:05 -08:00
  • 4780223544 fix: groq now depends on litellm Ashwin Bharambe 2025-02-27 14:07:12 -08:00
  • e32ed65bef lint Xi Yan 2025-02-27 14:02:06 -08:00
  • a99706ec5e fix Xi Yan 2025-02-27 14:01:18 -08:00
  • 58f9fd135b fix Xi Yan 2025-02-27 13:55:46 -08:00
  • 928a39d17b
    feat(providers): Groq now uses LiteLLM openai-compat (#1303) Ashwin Bharambe 2025-02-27 13:16:50 -08:00
  • a52721948c feat(providers): Groq now uses LiteLLM openai-compat Ashwin Bharambe 2025-02-27 10:08:01 -08:00
  • 32b96d6d91 fix: check conda env name using basepath in exec.py Dinesh Yeduguru 2025-02-27 11:00:05 -08:00
  • 0b9b72dd2c update serialization of values to handle broader types Hardik Shah 2025-02-27 10:56:10 -08:00
  • 564f0e5f93
    fix: Revert "chore: remove vector_db_id from AgentSessionInfo" (#1299) Xi Yan 2025-02-27 10:37:15 -08:00
  • 90c66c2743
    Revert "chore: remove vector_db_id from AgentSessionInfo (#1296)" Xi Yan 2025-02-27 10:34:39 -08:00
  • 200ef29233
    chore: remove vector_db_id from AgentSessionInfo (#1296) Xi Yan 2025-02-27 10:13:10 -08:00
  • 981fc3c93c
    fix(test): no need to specify tool prompt format explicitly in tests (#1295) Ashwin Bharambe 2025-02-27 10:09:57 -08:00
  • 4728d9ff0d fix(test): no need to specify tool prompt format explicitly in tests Ashwin Bharambe 2025-02-27 09:33:08 -08:00
  • 3e6806c250 remove Xi Yan 2025-02-27 09:50:29 -08:00
  • fc5aff3ccf
    feat: ability to retrieve agents session, turn, step by ids (#1286) Xi Yan 2025-02-27 09:45:14 -08:00
  • 9be83920ef generate openapi spec Dinesh Yeduguru 2025-02-05 11:03:37 -08:00
  • 77c2418a9c metrics for completion API Dinesh Yeduguru 2025-02-05 10:54:08 -08:00
  • e9bb96334b create metric_store Dinesh Yeduguru 2025-02-05 10:29:48 -08:00
  • 23c1aa4504 throw exception when promethues is not enabled Dinesh Yeduguru 2025-02-05 10:11:13 -08:00
  • cce217bab8 add API to query metrics Dinesh Yeduguru 2025-02-05 10:06:59 -08:00
  • b180069def make the telemetry API dep optional in inference router Dinesh Yeduguru 2025-02-05 09:25:21 -08:00
  • 52e533dc89 check for text delta type Dinesh Yeduguru 2025-02-04 16:23:32 -08:00
  • 37b7390079 add metrics for streaming Dinesh Yeduguru 2025-02-04 16:20:59 -08:00
  • 38f1337afa make router call telemetry Dinesh Yeduguru 2025-02-04 15:42:33 -08:00
  • a72cdafac0 Add inference token usage metrics Dinesh Yeduguru 2025-02-04 10:45:16 -08:00
  • 61db5d5074 chore: use func for cmd check reidliu 2025-02-27 23:48:37 +08:00
  • 0762c61402
    feat: don't silently ignore incorrect toolgroup (#1285) ehhuang 2025-02-27 05:11:09 -08:00