Commit graph

  • fafab113b4
    Create RFC-0002-preprocessing-endpoint.md Ilya Kolchinsky 2025-02-27 13:15:31 +01:00
  • be055717dd
    ci: add dynamic CI job to test templates Sébastien Han 2025-02-24 12:47:44 +01:00
  • 99b6925ad8
    feat: add nemo retriever text embedding models to nvidia inference provider (#1218) Matthew Farrellee 2025-02-26 23:18:34 -06:00
  • 23b65b6cee
    fix(test): update client-sdk tests to handle tool format parametrization better (#1287) Ashwin Bharambe 2025-02-26 21:16:00 -08:00
  • 6079e727d2 fix returning an iterable for the content param Ashwin Bharambe 2025-02-26 21:08:49 -08:00
  • 1cff4c5907 fix openai-compat / litellm conversions Ashwin Bharambe 2025-02-26 20:56:54 -08:00
  • f389afe024 temp fix Eric Huang 2025-02-26 20:44:26 -08:00
  • 4c0f122a4b fix(test): update client-sdk tests to handle tool format parametrization better Ashwin Bharambe 2025-02-26 20:36:57 -08:00
  • b4225d1ed5 remove agents monitoring notebook Xi Yan 2025-02-26 15:50:37 -08:00
  • 843a93c0be update getting started Xi Yan 2025-02-26 15:50:04 -08:00
  • 6ff7ea127f fix sqlite_vec by using local thread Kai Wu 2025-02-26 15:46:19 -08:00
  • 30ef1c3680
    feat: Add model context protocol tools with ollama provider (#1283) Shrey 2025-02-26 18:38:18 -05:00
  • 0e40fe9f00 Add model context protocol tools with ollama provider Shreyanand 2025-02-26 17:24:29 -05:00
  • f42dc48986
    Merge branch 'meta-llama:main' into fix-ollama-rag Kai Wu 2025-02-26 15:26:45 -08:00
  • 961e098bbb remove prints Xi Yan 2025-02-26 15:20:21 -08:00
  • ec62020a20 retrieve session / turn / steps Xi Yan 2025-02-26 15:16:31 -08:00
  • 2250ab7274
    fix: don't attempt to clean gpu memory up when device is cpu (#1191) Ihar Hrachyshka 2025-02-26 18:12:11 -05:00
  • 21c547aa21
    chore: upgrade uv pre-commit version, uv-sync -> uv-lock (#1284) Ashwin Bharambe 2025-02-26 14:57:48 -08:00
  • e8a2733e20 feat: don't silently ignore incorrect toolgroup Eric Huang 2025-02-26 14:57:36 -08:00
  • 48174b5422 Merge branch 'main' into export_agent_dataset Xi Yan 2025-02-26 14:55:50 -08:00
  • 36ff793ca3 tmp export dataset Xi Yan 2025-02-26 14:54:49 -08:00
  • 54a576e739 chore: upgrade uv pre-commit version, uv-sync -> uv-lock Ashwin Bharambe 2025-02-26 14:49:31 -08:00
  • 270d64007a
    fix: sqlite conn (#1282) ehhuang 2025-02-26 14:44:31 -08:00
  • 80fba6247d fix: sqlite conn Eric Huang 2025-02-26 14:08:17 -08:00
  • c8a20b8ed0
    feat: allow specifying specific tool within toolgroup (#1239) ehhuang 2025-02-26 14:07:05 -08:00
  • 657efc67bc fix: bump up registry key version to clear off stale entries in dbs Ashwin Bharambe 2025-02-26 13:58:03 -08:00
  • 3f0b8c25aa fix: run uv-sync manually. locally pre-commit is not triggering Ashwin Bharambe 2025-02-26 13:53:57 -08:00
  • 5e3ee76acf feat: allow specifying specific tool within toolgroup Eric Huang 2025-02-26 13:51:49 -08:00
  • fca84db5b0
    fix: time logging format (#1281) ehhuang 2025-02-26 13:51:33 -08:00
  • 6b075e5075 feat: automatically update documentation version based on pyproject.toml source of truth Ashwin Bharambe 2025-02-26 13:41:54 -08:00
  • 6aacf4dd61 fix: time logging format Eric Huang 2025-02-26 13:41:53 -08:00
  • 9a3db9a290
    feat: update the post training notebook (#1280) Botao Chen 2025-02-26 13:39:16 -08:00
  • a33569e7e7 refine Botao Chen 2025-02-26 13:23:57 -08:00
  • 20a7af15a9 update notebook Botao Chen 2025-02-26 13:18:40 -08:00
  • bb2690f176
    feat: remove special handling of builtin::rag tool (#1015) ehhuang 2025-02-26 13:04:52 -08:00
  • 658465e088 always include the type field in requests Matthew Farrellee 2025-02-26 15:30:23 -05:00
  • c64f0d5888
    fix: Get builtin tool calling working in remote-vllm (#1236) Ben Browning 2025-02-26 15:25:47 -05:00
  • 2ed2c0bd26
    fix(cli): Missing default for --image-type in stack run command (#1274) Yuan Tang 2025-02-26 15:23:44 -05:00
  • 4cf95475e5 fix: make vision and embedding tests pass with openai, anthropic and gemini Ashwin Bharambe 2025-02-26 10:52:33 -08:00
  • 53a1698ec3 support nvidia hosted vision models Matthew Farrellee 2025-02-26 14:10:26 -05:00
  • 02a6376f13
    Update CONTRIBUTING.md Yuan Tang 2025-02-26 12:31:25 -05:00
  • 73771a43da
    fix(agent): Raise exception when tool call has empty name Yuan Tang 2025-02-25 21:45:24 -05:00
  • 1de0d1906f docs: update the output of llama-stack-client models list reidliu 2025-02-26 16:48:55 +08:00
  • 86a3c6da88 update Models to models reidliu 2025-02-26 20:27:18 +08:00
  • 52efe45e9f chore: update model list reidliu 2025-02-26 18:48:54 +08:00
  • baa5193be8 fix: Avoid unexpected keyword argument for sentence_transformers Luis Tomas Bolivar 2025-02-26 09:14:52 +01:00
  • 64767578d6
    fix(CLI): Missing default for --image-type in stack run command Yuan Tang 2025-02-26 05:11:54 -05:00
  • abfc4b3bce
    fix: the pre-commit new line issue (#1272) Reid 2025-02-26 17:25:41 +08:00
  • 2f01bcdae2 update reidliu 2025-02-26 17:11:04 +08:00
  • bab75d7acb fix the pre-commit new line issue1 reidliu 2025-02-26 17:03:59 +08:00
  • 0f4f8abf8e fix the pre-commit new line issue reidliu 2025-02-26 17:01:32 +08:00
  • 123fb9eb24
    feat: [post training] support save hf safetensor format checkpoint (#845) Botao Chen 2025-02-25 23:29:08 -08:00
  • f227045b6b refine Botao Chen 2025-02-25 23:28:05 -08:00
  • 0da8974526 docs: update build doc reidliu 2025-02-26 09:46:26 +08:00
  • 87be396e47 docs: update the downloaded list doc reidliu 2025-02-26 11:52:52 +08:00
  • 63e6acd0c3
    feat: add (openai, anthropic, gemini) providers via litellm (#1267) Ashwin Bharambe 2025-02-25 22:07:33 -08:00
  • 3cd387aff6 fix ci-tests distro Ashwin Bharambe 2025-02-25 22:04:51 -08:00
  • bf8283a925 feat: add (openai, anthropic, gemini) providers via litellm Ashwin Bharambe 2025-02-25 12:13:58 -08:00
  • b0310af177
    refactor: move OpenAI compat utilities from nvidia to openai_compat (#1258) Ashwin Bharambe 2025-02-25 22:02:11 -08:00
  • 82799a55bb
    chore: removed executorch submodule (#1265) Jeff Tang 2025-02-25 21:57:21 -08:00
  • 3a002f6cf1
    chore: update download error message (#1217) Reid 2025-02-26 13:38:10 +08:00
  • 56c1a50b86
    fix: fix the describe table display issue (#1221) Reid 2025-02-26 13:34:53 +08:00
  • 929c5f0842
    refactor(server): replace print statements with logger (#1250) Sébastien Han 2025-02-26 06:31:37 +01:00
  • 88768a93eb small enhancement, immaterial mostly Ashwin Bharambe 2025-02-25 14:47:51 -08:00
  • fea9ef59b7 Move OpenAI compat utilities from nvidia to openai_compat Ashwin Bharambe 2025-02-25 13:21:45 -08:00
  • eb743a3b26
    build: Merge redundant "files" field for codegen check in .pre-commit-config.yaml (#1261) Yuan Tang 2025-02-25 23:56:22 -05:00
  • cc24967f8c removed executorch submodule Jeff Tang 2025-02-25 19:48:11 -08:00
  • 14822c4028 fix: fix the pre-commit issue reidliu 2025-02-26 10:10:58 +08:00
  • 81aed4c1e7 upload notebook Botao Chen 2025-02-25 17:35:23 -08:00
  • da5357f09c feat: remove special handling of builtin::rag tool Eric Huang 2025-02-25 17:24:36 -08:00
  • 55eb257459
    chore: update the zero_to_hero_guide doc link (#1220) Reid 2025-02-26 09:16:02 +08:00
  • 822ffe9f2e
    Fix diff Yuan Tang 2025-02-25 20:06:58 -05:00
  • de777be9ee
    build: Merge redundant files field in .pre-commit-config.yaml Yuan Tang 2025-02-25 16:41:09 -05:00
  • c0c7622295
    fix: dont assume SentenceTransformer is imported Hardik Shah 2025-02-25 16:53:01 -08:00
  • 32e89191c2 fix ollama.py bug Kai Wu 2025-02-25 15:08:48 -08:00
  • 967cff4533
    feat: Add Groq distribution template (#1173) Vladislav Bronzov 2025-02-25 23:16:56 +01:00
  • 99c1d4c456
    docs: Remove $ from client CLI ref to add valid copy and paste ability (#1260) Kelly Brown 2025-02-25 16:50:00 -05:00
  • 733b9c07b5 pre-commit Kai Wu 2025-02-25 13:42:02 -08:00
  • deae02f313 Docs: Remove $ from client CLI ref to add valid copy and paste ability Kelly Brown 2025-02-25 16:41:44 -05:00
  • dd9bb9300a
    Update based on feedback Yuan Tang 2025-02-25 16:34:45 -05:00
  • 0885f959f1
    fix: update index.md to include 0.1.4 (#1259) raghotham 2025-02-25 13:34:29 -08:00
  • 25c375471b
    Update index.md to include 0.1.4 raghotham 2025-02-25 13:32:49 -08:00
  • fdc620857c Update Ashwin Bharambe 2025-02-25 13:21:45 -08:00
  • de1e70f7d8 Update Ashwin Bharambe 2025-02-25 12:16:35 -08:00
  • 39fbe9c608 Update (base update) Ashwin Bharambe 2025-02-25 12:16:35 -08:00
  • 3a31611486
    feat: completing text /chat-completion and /completion tests (#1223) LESSuseLESS 2025-02-25 11:37:04 -08:00
  • 8a86a96786 Adding Containerfile for playground and GitHub workflow Jamie Land 2025-02-25 11:47:10 -05:00
  • ed2bd60bd9 add ollama embedding config and fix sqlite_vec db Kai Wu 2025-02-25 11:25:23 -08:00
  • 9b130f96a7
    fix: build_venv expects an extra argument (#1233) Charlie Doern 2025-02-25 14:08:50 -05:00
  • c223b1862b
    fix: resolve type hint issues and import dependencies (#1176) Sébastien Han 2025-02-25 20:06:47 +01:00
  • 056432fb14 "feat: completing text /chat-completion and /completion provider and e1e tests" Haiping Zhao 2025-02-23 10:15:08 -08:00
  • 1a044ef894
    fix: Raise exception when tool call result is None (#1253) Yuan Tang 2025-02-25 13:10:50 -05:00
  • 73a0c7a0e7
    LocalInferenceImpl update for LS013 (#1242) Jeff Tang 2025-02-25 09:58:34 -08:00
  • dc3c881ffe
    fix: include timezone in Agent steps' timestamps (#1247) ehhuang 2025-02-25 09:49:25 -08:00
  • 902eb9dae7 fix: include timezone in Agent steps' timestamps Eric Huang 2025-02-25 09:47:35 -08:00
  • 5a05553e93 feat: adding a Makefile and a test script for sqlite-vec Francisco Javier Arceo 2025-02-25 12:45:53 -05:00
  • f6a0c3e97d
    fix: Raise exception when tool call result is None Yuan Tang 2025-02-25 12:17:36 -05:00
  • 0942329ec6
    docs: move sections from README to docs Sébastien Han 2025-02-18 21:16:35 +01:00
  • 8eb4f7fcb6
    refactor(server): replace print statements with logger Sébastien Han 2025-02-25 15:00:55 +01:00
  • 8eefbfecdd skip -> xfail for image tests Matthew Farrellee 2025-02-25 11:03:38 -05:00