Commit graph

  • 9aec73ffe3 added dependency Chantal D Gama Rose 2025-02-24 19:15:26 +00:00
  • f47ebb54f8 adding back requests as dep Chantal D Gama Rose 2025-02-24 19:08:14 +00:00
  • d8e864da34 fixed pre-commit checks Chantal D Gama Rose 2025-02-24 19:05:48 +00:00
  • ca6a12e362 Merge remote-tracking branch 'upstream/main' into add_nvidia_safety_provider Chantal D Gama Rose 2025-02-24 18:55:08 +00:00
  • b9564fb435 Fixes for safety provider added to nvidia distro Chantal D Gama Rose 2025-02-24 18:52:56 +00:00
  • 3aa3937560 fix: build_venv expects an extra argument Charlie Doern 2025-02-24 11:06:03 -05:00
  • c4987bc349
    fix: avoid failure when no special pip deps and better exit (#1228) Sébastien Han 2025-02-24 19:18:52 +01:00
  • 9a80776c0e fix: set default tool_prompt_format in inference api Eric Huang 2025-02-24 10:09:57 -08:00
  • d6356f822a fix: remove UV_SYSTEM_PYTHON from getting started notebook since llama stack build detects notebook environment Ashwin Bharambe 2025-02-24 10:05:02 -08:00
  • e8e8fe7c93 fix: add LLAMA_STACK_CLIENT_DIR mount when installing in docker from source Ashwin Bharambe 2025-02-24 10:00:57 -08:00
  • fea46fb334 Release candidate 0.1.4.dev3 v0.1.4.dev3 github-actions[bot] 2025-02-24 17:36:59 +00:00
  • 641549c631 Add llama stack client overrides also; necessary for correct docker building Ashwin Bharambe 2025-02-24 07:51:02 -08:00
  • 238d69b3cf Update llama_stack/distribution/start_stack.sh Ashwin Bharambe 2025-02-21 08:58:03 -08:00
  • a6b1294d39 refactor: combine start scripts for each env Charlie Doern 2025-02-18 10:09:04 -05:00
  • c8e9b19aca
    docs: remove redundant installation instructions Sébastien Han 2025-02-18 16:56:00 +01:00
  • 0c4591a73d fix: image_name defaults to none Charlie Doern 2025-02-23 19:51:30 -05:00
  • 8dc9be9aa3 Ensure build_container does not fail on missing special_pip_deps Luis Tomas Bolivar 2025-02-24 14:57:39 +01:00
  • 1842eeb96f
    docs: small fixes (#1224) Reid 2025-02-24 20:59:58 +08:00
  • bc89d8b5c9
    fix: avoid failure when no special pip deps and better exit Sébastien Han 2025-02-24 12:33:36 +01:00
  • 36794a1c02
    fix: resolve type hint issues and import dependencies Sébastien Han 2025-02-20 17:18:32 +01:00
  • 0973d386e6 fix: update build_container.sh to ensure llama-models is installed first Ashwin Bharambe 2025-02-23 21:47:18 -08:00
  • 17162b9978
    docs: Add vLLM to the list of inference providers in concepts and providers pages (#1227) Yuan Tang 2025-02-23 23:16:30 -05:00
  • d18bd7187a
    Update index.md Yuan Tang 2025-02-23 22:18:22 -05:00
  • 8eb56f466d
    docs: Add vLLM to the list of inference providers in concepts page Yuan Tang 2025-02-23 22:09:45 -05:00
  • 34e3faa4e8
    feat: add --run to llama stack build (#1156) v0.1.4rc2 Charlie Doern 2025-02-23 22:06:09 -05:00
  • a3fa992af4 feat: add --run to llama stack build Charlie Doern 2025-02-19 14:37:55 -05:00
  • 6227e1e3b9
    fix: update virtualenv building so llamastack- prefix is not added, make notebook experience easier (#1225) Ashwin Bharambe 2025-02-23 16:57:11 -08:00
  • 0fb4dafa24 update ruff pre-commit Ashwin Bharambe 2025-02-23 16:49:22 -08:00
  • 0b37299af6 fix: update virtualenv building so llamastack- prefix is not added, make notebook experience easier Ashwin Bharambe 2025-02-23 16:45:54 -08:00
  • 4d8a96a961 fix the 404 reidliu 2025-02-24 07:26:02 +08:00
  • 8b3110295a docs: update the hyperlink name reidliu 2025-02-24 07:18:44 +08:00
  • 6608c7fed9 refactor: support downloading any model from HF Charlie Doern 2025-02-06 19:48:50 -05:00
  • 5a9cb0fd4f fix: fix the describe table display issue reidliu 2025-02-23 14:20:17 +08:00
  • 9867a4dfef chore: update the zero_to_hero_guide doc link reidliu 2025-02-23 07:58:00 +08:00
  • 19ae4b35d9
    docs: Adding Provider sections to docs (#1195) Francisco Arceo 2025-02-22 12:59:34 -07:00
  • 4f8f0beae2
    Unregister for ollama remote provider Yuan Tang 2025-02-21 22:38:44 -05:00
  • 6abfe0c43b
    fix: Unregister a model from registry if not being served Yuan Tang 2025-02-20 13:04:15 -05:00
  • 00b2a65084 add nemo retriever text embedding models to nvidia inference provider Matthew Farrellee 2025-02-21 20:49:46 -06:00
  • 301a0689f5 fix typo: doument -> document Matthew Farrellee 2025-02-21 20:42:55 -06:00
  • 81b3e65897 chore: update download error message reidliu 2025-02-22 08:56:31 +08:00
  • b890d7a611 Test be not having prints yo Ashwin Bharambe 2025-02-21 16:43:00 -08:00
  • c9e08cc0a8
    test: do not overwrite agent_config (#1216) v0.1.4rc1 ehhuang 2025-02-21 16:38:56 -08:00
  • 187524d4ae
    feat: add substring search for model list (#1099) Reid 2025-02-22 08:38:10 +08:00
  • 8f221e4a33 test: do not overwrite agent_config Eric Huang 2025-02-21 16:37:20 -08:00
  • 5be628f637 Add test jsons to MANIFEST for now Ashwin Bharambe 2025-02-21 16:25:51 -08:00
  • 45ffe87d7c Kill noise from test output Ashwin Bharambe 2025-02-21 15:37:23 -08:00
  • bf38d0aba0
    test: fix test_rag_agent test (#1215) ehhuang 2025-02-21 15:24:28 -08:00
  • 01aef1382c test: fix test_rag_agent test Eric Huang 2025-02-21 15:19:26 -08:00
  • ad5ea63140 feat: add substring search option for model list reidliu 2025-02-20 14:33:41 +08:00
  • e7d261ef4a Fix test infra, sentence embeddings mixin Ashwin Bharambe 2025-02-21 15:10:10 -08:00
  • b54d896ce7 handle error and update tests for new client Matthew Farrellee 2025-02-21 16:38:39 -06:00
  • 182608d4bf better test naming Ashwin Bharambe 2025-02-21 14:24:09 -08:00
  • e4037d03c2 fix input_type values Matthew Farrellee 2025-02-21 15:47:44 -06:00
  • 610dea84c2 add nvidia embedding implementation for new signature, task_type, output_dimention, text_truncation Matthew Farrellee 2025-02-21 15:41:49 -06:00
  • a76dd8c464 Updated language to say inline instead of local Francisco Javier Arceo 2025-02-21 16:30:47 -05:00
  • ab54b8cd58
    feat(providers): support non-llama models for inference providers (#1200) Ashwin Bharambe 2025-02-21 13:21:28 -08:00
  • 9bbe34694d
    ci: add mypy for static type checking (#1101) Sébastien Han 2025-02-21 22:15:40 +01:00
  • 25fddccfd8
    feat: tool outputs metadata (#1155) ehhuang 2025-02-21 13:15:31 -08:00
  • 36162c8c82 fix(ollama): register model with the helper first so it gets normalized Ashwin Bharambe 2025-02-21 12:51:38 -08:00
  • c7e683bd29 Update fixtures Ashwin Bharambe 2025-02-21 12:23:08 -08:00
  • 1b64573284 Support non-llama models for inference providers Ashwin Bharambe 2025-02-20 22:41:41 -08:00
  • 0fe071764f
    feat(1/n): api: unify agents for handling server & client tools (#1178) Xi Yan 2025-02-21 11:48:27 -08:00
  • 36a1e8bc57
    feat(2/n): agent return turn awaiting input for tool calls (#1187) Xi Yan 2025-02-21 11:47:21 -08:00
  • 4dc7f05a2d
    feat(3/n): agent resume_turn (#1194) Xi Yan 2025-02-21 11:43:48 -08:00
  • ea050f7fa8 datetime nit Xi Yan 2025-02-21 11:42:18 -08:00
  • 992f865b2e
    chore: move embedding deps to RAG tool where they are needed (#1210) Ashwin Bharambe 2025-02-21 11:33:41 -08:00
  • 4683770f68 Two small but important fixes Ashwin Bharambe 2025-02-21 11:21:37 -08:00
  • fba1664cd1 make sqlite_vec deprecated properly Ashwin Bharambe 2025-02-21 11:10:02 -08:00
  • cf744a97f0 Move embedding deps to RAG tool where they are needed Ashwin Bharambe 2025-02-21 10:51:39 -08:00
  • 11697f85c5
    fix: pull ollama embedding model if necessary (#1209) Ashwin Bharambe 2025-02-21 10:35:56 -08:00
  • ec8abe1c78 update the template properly Ashwin Bharambe 2025-02-21 10:31:33 -08:00
  • ae1bcb9593 Pull ollama embedding model if necessary Ashwin Bharambe 2025-02-21 10:13:45 -08:00
  • 1c3410b7fe Added a short blob for each section and removed ios Francisco Javier Arceo 2025-02-21 13:19:23 -05:00
  • b301978340 capitalize Faiss Francisco Javier Arceo 2025-02-21 12:21:52 -05:00
  • f13b1c876e updated structure for better navigation in the toctree Francisco Javier Arceo 2025-02-21 12:17:43 -05:00
  • 840fae2259
    fix: Updating images so that they are able to run without root access (#1208) Jamie Land 2025-02-21 11:32:56 -05:00
  • 6634864b19
    docs: Add missing uv command and clarify website rebuild (#1199) Yuan Tang 2025-02-21 11:29:32 -05:00
  • 8d28ef92cb
    Update CONTRIBUTING.md Yuan Tang 2025-02-21 11:28:17 -05:00
  • 9898589f12
    fix: convert back to model descriptor for model in list --downloaded (#1201) Reid 2025-02-22 00:10:34 +08:00
  • da9f0b7869
    test(client-sdk): Update embedding test types to use latest imports (#1203) Rashmi Pawar 2025-02-21 21:39:17 +05:30
  • 46da187c07
    fix: remove list of list tests, no longer relevant after #1161 (#1205) Matthew Farrellee 2025-02-21 10:07:35 -06:00
  • d2701b0d6a
    chore: remove configure subcommand (#1202) Reid 2025-02-22 00:06:25 +08:00
  • c9c4a3c921
    feat: model remove cmd (#1128) Reid 2025-02-22 00:05:12 +08:00
  • 3099c5243f
    fix: update URL import, URL -> ImageContentItemImageURL (#1204) Matthew Farrellee 2025-02-21 10:02:21 -06:00
  • b323924c18 Updating images so that they are able to run without root access Jamie Land 2025-02-21 10:22:07 -05:00
  • cb31807f8e remove configure reidliu 2025-02-21 22:57:28 +08:00
  • a37dca7e02 fix embedding metadata: embedding_dimensions -> embedding_dimension Matthew Farrellee 2025-02-21 08:02:23 -06:00
  • c1463eed1e Merged back the original features and added more progress output Francisco Javier Arceo 2025-02-21 09:02:45 -05:00
  • 7f625f0732 remove list of list tests, no longer relevant after #1161 Matthew Farrellee 2025-02-21 07:40:42 -06:00
  • 559d63e3e8 update URL import, URL -> ImageContentItemImageURL Matthew Farrellee 2025-02-21 07:30:43 -06:00
  • c344d14415 update wrt latest changes raspawar 2025-02-21 18:53:04 +05:30
  • 940ee698ad chore: move configure deprecated message to help reidliu 2025-02-21 21:01:25 +08:00
  • 7cbf3f8383 fix after merge Vladislav 2025-02-21 13:56:16 +01:00
  • 19f3b23d47 move model_aliases, some fixes for template Vladislav 2025-02-20 15:06:55 +01:00
  • 7bb7597c00 add sentence-transformers provider Vladislav 2025-02-20 14:45:50 +01:00
  • ecdba4b030 add groq distribution template Vladislav 2025-02-20 12:50:36 +01:00
  • d8c65cc771 fix: convert back to model descriptor for model in list --downloaded reidliu 2025-02-21 16:51:41 +08:00
  • 71c737d0bb use resolve_model reidliu 2025-02-21 16:45:24 +08:00
  • 30f97a0de0 remove unecessary code reidliu 2025-02-20 17:43:34 +08:00
  • 8a0917a01b update remove cmd in doc reidliu 2025-02-19 13:18:07 +08:00