Commit graph

  • d18bd7187a
    Update index.md Yuan Tang 2025-02-23 22:18:22 -05:00
  • 8eb56f466d
    docs: Add vLLM to the list of inference providers in concepts page Yuan Tang 2025-02-23 22:09:45 -05:00
  • a3fa992af4 feat: add --run to llama stack build Charlie Doern 2025-02-19 14:37:55 -05:00
  • 0fb4dafa24 update ruff pre-commit Ashwin Bharambe 2025-02-23 16:49:22 -08:00
  • 0b37299af6 fix: update virtualenv building so llamastack- prefix is not added, make notebook experience easier Ashwin Bharambe 2025-02-23 16:45:54 -08:00
  • 4d8a96a961 fix the 404 reidliu 2025-02-24 07:26:02 +08:00
  • 8b3110295a docs: update the hyperlink name reidliu 2025-02-24 07:18:44 +08:00
  • 6608c7fed9 refactor: support downloading any model from HF Charlie Doern 2025-02-06 19:48:50 -05:00
  • 5a9cb0fd4f fix: fix the describe table display issue reidliu 2025-02-23 14:20:17 +08:00
  • 9867a4dfef chore: update the zero_to_hero_guide doc link reidliu 2025-02-23 07:58:00 +08:00
  • 4f8f0beae2
    Unregister for ollama remote provider Yuan Tang 2025-02-21 22:38:44 -05:00
  • 6abfe0c43b
    fix: Unregister a model from registry if not being served Yuan Tang 2025-02-20 13:04:15 -05:00
  • 00b2a65084 add nemo retriever text embedding models to nvidia inference provider Matthew Farrellee 2025-02-21 20:49:46 -06:00
  • 301a0689f5 fix typo: doument -> document Matthew Farrellee 2025-02-21 20:42:55 -06:00
  • 81b3e65897 chore: update download error message reidliu 2025-02-22 08:56:31 +08:00
  • 8f221e4a33 test: do not overwrite agent_config Eric Huang 2025-02-21 16:37:20 -08:00
  • 01aef1382c test: fix test_rag_agent test Eric Huang 2025-02-21 15:19:26 -08:00
  • ad5ea63140 feat: add substring search option for model list reidliu 2025-02-20 14:33:41 +08:00
  • b54d896ce7 handle error and update tests for new client Matthew Farrellee 2025-02-21 16:38:39 -06:00
  • e4037d03c2 fix input_type values Matthew Farrellee 2025-02-21 15:47:44 -06:00
  • 610dea84c2 add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation Matthew Farrellee 2025-02-21 15:41:49 -06:00
  • a76dd8c464 Updated language to say inline instead of local Francisco Javier Arceo 2025-02-21 16:30:47 -05:00
  • c7e683bd29 Update fixtures Ashwin Bharambe 2025-02-21 12:23:08 -08:00
  • 1b64573284 Support non-llama models for inference providers Ashwin Bharambe 2025-02-20 22:41:41 -08:00
  • 36a1e8bc57
    feat(2/n): agent return turn awaiting input for tool calls (#1187) Xi Yan 2025-02-21 11:47:21 -08:00
  • 4dc7f05a2d
    feat(3/n): agent resume_turn (#1194) Xi Yan 2025-02-21 11:43:48 -08:00
  • ea050f7fa8 datetime nit Xi Yan 2025-02-21 11:42:18 -08:00
  • 4683770f68 Two small but important fixes Ashwin Bharambe 2025-02-21 11:21:37 -08:00
  • fba1664cd1 make sqlite_vec deprecated properly Ashwin Bharambe 2025-02-21 11:10:02 -08:00
  • cf744a97f0 Move embedding deps to RAG tool where they are needed Ashwin Bharambe 2025-02-21 10:51:39 -08:00
  • ec8abe1c78 update the template properly Ashwin Bharambe 2025-02-21 10:31:33 -08:00
  • ae1bcb9593 Pull ollama embedding model if necessary Ashwin Bharambe 2025-02-21 10:13:45 -08:00
  • 1c3410b7fe Added a short blob for each section and removed ios Francisco Javier Arceo 2025-02-21 13:19:23 -05:00
  • b301978340 capitalize Faiss Francisco Javier Arceo 2025-02-21 12:21:52 -05:00
  • f13b1c876e updated structure for better navigation in the toctree Francisco Javier Arceo 2025-02-21 12:17:43 -05:00
  • 8d28ef92cb
    Update CONTRIBUTING.md Yuan Tang 2025-02-21 11:28:17 -05:00
  • b323924c18 Updating images so that they are able to run without root access Jamie Land 2025-02-21 10:22:07 -05:00
  • cb31807f8e remove configure reidliu 2025-02-21 22:57:28 +08:00
  • a37dca7e02 fix embedding metadata: embedding_dimensions -> embedding_dimension Matthew Farrellee 2025-02-21 08:02:23 -06:00
  • c1463eed1e Merged back the original features and added more progress output Francisco Javier Arceo 2025-02-21 09:02:45 -05:00
  • 7f625f0732 remove list of list tests, no longer relevant after #1161 Matthew Farrellee 2025-02-21 07:40:42 -06:00
  • 559d63e3e8 update URL import, URL -> ImageContentItemImageURL Matthew Farrellee 2025-02-21 07:30:43 -06:00
  • c344d14415 update wrt latest changes raspawar 2025-02-21 18:53:04 +05:30
  • 940ee698ad chore: move configure deprecated message to help reidliu 2025-02-21 21:01:25 +08:00
  • 7cbf3f8383 fix after merge Vladislav 2025-02-21 13:56:16 +01:00
  • 19f3b23d47 move model_aliases, some fixes for template Vladislav 2025-02-20 15:06:55 +01:00
  • 7bb7597c00 add sentence-transformers provider Vladislav 2025-02-20 14:45:50 +01:00
  • ecdba4b030 add groq distribution template Vladislav 2025-02-20 12:50:36 +01:00
  • d8c65cc771 fix: convert back to model descriptor for model in list --downloaded reidliu 2025-02-21 16:51:41 +08:00
  • 71c737d0bb use resolve_model reidliu 2025-02-21 16:45:24 +08:00
  • 30f97a0de0 remove unecessary code reidliu 2025-02-20 17:43:34 +08:00
  • 8a0917a01b update remove cmd in doc reidliu 2025-02-19 13:18:07 +08:00
  • b4e7ac8f65 feat: model remove cmd reidliu 2025-02-16 22:12:12 +08:00
  • 0f1a9d06db removing unused code Chantal D Gama Rose 2025-02-21 07:34:52 +00:00
  • 830ecb4b28 fixed import Chantal D Gama Rose 2025-02-21 07:27:29 +00:00
  • 23a6255795 fixed more pre-checks Chantal D Gama Rose 2025-02-21 07:26:18 +00:00
  • 66726241aa fixed breaking tests and run pre-commit Chantal D Gama Rose 2025-02-21 07:19:40 +00:00
  • 4f2427c6c8 add doc Xi Yan 2025-02-20 22:55:22 -08:00
  • db764e7ed6 add doc Xi Yan 2025-02-20 22:55:05 -08:00
  • b1b45ed320 add comment Xi Yan 2025-02-20 22:46:17 -08:00
  • fa4a56cf6c refactor Xi Yan 2025-02-20 22:41:23 -08:00
  • 2c06704d63 refactor Xi Yan 2025-02-20 22:40:51 -08:00
  • e9fd8371a8 Update the router Ashwin Bharambe 2025-02-20 22:24:59 -08:00
  • 2c1e8b5956 Update embeddings signatures for all providers Ashwin Bharambe 2025-02-20 22:21:20 -08:00
  • 99bc54b033 fix duplicate tool msg Xi Yan 2025-02-20 22:16:37 -08:00
  • d02f470983
    docs: Add missing uv command and clarify website rebuild Yuan Tang 2025-02-21 01:10:29 -05:00
  • e011491c6b rfc: Add options for supporting various embedding models Ashwin Bharambe 2025-02-20 16:32:03 -08:00
  • 0de38a2b48 Merge branch 'agents-unify-tools-2' into agents-unify-tools-3 Xi Yan 2025-02-20 21:49:56 -08:00
  • e2bfd165d2 add flag allow_turn_resume Xi Yan 2025-02-20 21:49:06 -08:00
  • 9c40529e93 fix tool execution step from tool response Xi Yan 2025-02-20 21:36:50 -08:00
  • 568da8bdb8 fix: pass tool_prompt_format to chat_formatter Eric Huang 2025-02-20 21:23:38 -08:00
  • 378e1603b0 Update OpenAPI Ashwin Bharambe 2025-02-20 21:31:53 -08:00
  • 25613953d5 Update embeddings signature so inputs and outputs list align Ashwin Bharambe 2025-02-19 16:14:22 -08:00
  • 0daadf5f15
    docs: Add missing uv command for docs generation in contributing guide Yuan Tang 2025-02-20 23:09:59 -05:00
  • 40fa74bd11
    docs: Simplify installation guide with uv Yuan Tang 2025-02-20 23:04:06 -05:00
  • 97f9580b1a rename Xi Yan 2025-02-20 19:49:50 -08:00
  • 9a07e709ee rename Xi Yan 2025-02-20 19:48:54 -08:00
  • 6d08a935ba merge Xi Yan 2025-02-20 19:48:01 -08:00
  • 702e74da8e Merge branch 'agents-unify-tools' into agents-unify-tools-2 Xi Yan 2025-02-20 19:46:59 -08:00
  • fa0dfdeac2 resume request Xi Yan 2025-02-20 19:46:43 -08:00
  • 9f2f6c9b30 Merge branch 'agents-unify-tools-2' into agents-unify-tools-3 Xi Yan 2025-02-20 19:45:01 -08:00
  • b14854943f Merge branch 'agents-unify-tools' into agents-unify-tools-2 Xi Yan 2025-02-20 19:43:52 -08:00
  • 122b20c142 continue to resume Xi Yan 2025-02-20 19:42:56 -08:00
  • 5e00e9f260 persist pending tool execution Xi Yan 2025-02-20 19:33:21 -08:00
  • 5f83124113 updated index page, removed subpage, and updated copyright year Francisco Javier Arceo 2025-02-20 21:39:48 -05:00
  • 1e2d5d0731 updated docs Francisco Javier Arceo 2025-02-20 21:20:13 -05:00
  • 4923270122 continue turn Xi Yan 2025-02-20 18:00:57 -08:00
  • 22355e3b1f add back 2/n Xi Yan 2025-02-20 17:53:29 -08:00
  • 157cf320d9 add back 2/n Xi Yan 2025-02-20 17:52:01 -08:00
  • ee3c174bb3 add back 2/n Xi Yan 2025-02-20 17:40:39 -08:00
  • cd36a77e20 3/n Xi Yan 2025-02-20 17:38:21 -08:00
  • 01f90dfe0c Merge branch 'agents-unify-tools-2' into agents-unify-tools-3 Xi Yan 2025-02-20 17:27:27 -08:00
  • 7677f01beb Merge branch 'agents-unify-tools' into agents-unify-tools-2 Xi Yan 2025-02-20 17:27:11 -08:00
  • c7e84253e7 Merge branch 'agents-unify-tools' into agents-unify-tools-3 Xi Yan 2025-02-20 17:26:58 -08:00
  • 8fe38d128d streaming flag Xi Yan 2025-02-20 16:58:45 -08:00
  • 6e03c3fb69 Run distro codegen for real Ashwin Bharambe 2025-02-20 16:58:06 -08:00
  • 587c429e08 update tests given new type signature Ashwin Bharambe 2025-02-20 16:57:17 -08:00
  • 32aa380e69 Run distro codegen Ashwin Bharambe 2025-02-20 16:53:15 -08:00
  • 8706b311ba add NVIDIA Inference embedding provider and tests Matthew Farrellee 2025-02-03 10:52:41 -05:00
  • 5fbb159cf6 fix test Xi Yan 2025-02-20 16:48:17 -08:00