Commit graph

  • 8db49de961
    docker: Install in editable mode for dev purposes (#160) Russell Bryant 2024-09-30 14:56:31 -04:00
  • fd04ad9e1e default entrypoint Xi Yan 2024-09-30 11:53:13 -07:00
  • da33fa0f80 docker: Install in editable mode for dev purposes Russell Bryant 2024-09-30 16:30:04 +00:00
  • 340e134629 clean up debug Xi Yan 2024-09-30 11:02:55 -07:00
  • 827525c77c unique deps Xi Yan 2024-09-30 10:50:42 -07:00
  • 8731cc3304 delete generated Dockerfile Xi Yan 2024-09-30 10:47:44 -07:00
  • 3482adb257 add docker template examples Xi Yan 2024-09-30 09:07:04 -07:00
  • 6cd3e4183f bake run.yaml inside docker, simplify run Xi Yan 2024-09-30 08:49:25 -07:00
  • cb36be320f
    Fix podman+selinux compatibility (#132) Russell Bryant 2024-09-29 23:19:44 -04:00
  • 2bd785354d
    fix broken bedrock inference provider (#151) moritalous 2024-09-30 12:17:58 +09:00
  • 2f096ca509
    accepts not model itself. (#153) Byung Chun Kim 2024-09-30 12:16:50 +09:00
  • 5bf679cab6
    Pull (extract) provider data from the provider instead of pushing from the top (#148) Ashwin Bharambe 2024-09-29 20:00:51 -07:00
  • 78b07ddc92 accepts not model itself. Byung Chun Kim 2024-09-30 02:29:19 +00:00
  • ed84641953 remove prints Zain Hasan 2024-09-29 14:59:10 -04:00
  • c13b2f06af
    Merge branch 'meta-llama:main' into main Zain Hasan 2024-09-29 11:56:29 -07:00
  • 54d7b2f2d7 fix broken bedrock inference provider moritalous 2024-09-29 14:59:30 +00:00
  • cd64371b2e
    Merge branch 'meta-llama:main' into main Pixee OSS Assistant 2024-09-29 07:57:31 -04:00
  • fdadfb6afb Add timeout to requests calls (#1) pixeeai 2024-09-29 07:24:20 -04:00
  • 7274802de9 Pull (extract) provider data from the provider instead of pushing from the top Ashwin Bharambe 2024-09-28 22:04:16 -07:00
  • f6a6598d1a
    [bugfix] fix #146 (#147) Xi Yan 2024-09-28 17:47:00 -07:00
  • 3428c4868d lint Xi Yan 2024-09-28 17:45:38 -07:00
  • 12c0860cae more robust image type Xi Yan 2024-09-28 17:44:45 -07:00
  • b646167d94
    Update README.md Xi Yan 2024-09-28 16:55:22 -07:00
  • 5ce759adc4
    Update README.md Xi Yan 2024-09-28 16:55:08 -07:00
  • 6a8c2ae1df
    [CLI] remove dependency on CONDA_PREFIX in CLI (#144) Xi Yan 2024-09-28 16:46:47 -07:00
  • 390a8fdbe7 more robust Xi Yan 2024-09-28 16:44:28 -07:00
  • 31e235490c Fix TypeError when CONDA_PREFIX is not set Russell Bryant 2024-09-28 23:39:01 +00:00
  • 47b0be1497 typo Xi Yan 2024-09-28 16:37:08 -07:00
  • 0477b29ba9 lint Xi Yan 2024-09-28 16:32:29 -07:00
  • 3ffe51052a remove dependency on CONDA_PREFIX in CLI Xi Yan 2024-09-28 16:21:50 -07:00
  • fe460ba103 Avoid importing a lot of stuff Ashwin Bharambe 2024-09-28 16:05:49 -07:00
  • 4ae8c63a2b pre-commit lint Xi Yan 2024-09-28 16:04:41 -07:00
  • ced5fb6388 Small cleanup for together safety implementation Ashwin Bharambe 2024-09-28 15:47:35 -07:00
  • 940968ee3f
    fixing safety inference and safety adapter for new API spec. Pinned t… (#105) Yogish Baliga 2024-09-28 15:45:38 -07:00
  • 0a3999a9a4
    Use inference APIs for executing Llama Guard (#121) Ashwin Bharambe 2024-09-28 15:40:06 -07:00
  • e61c4954d5 minor Ashwin Bharambe 2024-09-28 15:32:03 -07:00
  • 23028e26ff bugfixes Ashwin Bharambe 2024-09-28 15:21:32 -07:00
  • 37ca22cda6 Use inference APIs for executing Llama Guard Ashwin Bharambe 2024-09-25 19:40:49 -07:00
  • c39ba23508 Fix podman+selinux compatibility Russell Bryant 2024-09-27 14:05:18 +00:00
  • 6236634d84
    [bugfix] fix duplicate api endpoints (#139) Xi Yan 2024-09-27 15:32:50 -07:00
  • d09218a275 remove print Xi Yan 2024-09-27 15:31:20 -07:00
  • 21452a162b fix server api to serve Xi Yan 2024-09-27 15:28:21 -07:00
  • 572c01f454 for agents API, provider data from the header is not parsed as for agents there is no provider_data_validator meta-reference implementation. Added Together data validator as the provider_data_validator for now. Did some code changes accordingly. Yogish Baliga 2024-09-27 14:59:24 -07:00
  • 208b861289
    add env for LLAMA_STACK_CONFIG_DIR (#137) Xi Yan 2024-09-27 14:16:46 -07:00
  • a3fea59eb5 add env for LLAMA_STACK_CONFIG_DIR Xi Yan 2024-09-27 14:11:41 -07:00
  • 43744455d7
    docs: Note how to use podman (#130) Russell Bryant 2024-09-27 17:00:40 -04:00
  • f70c88ab7a
    configure: Fix a error msg typo (#131) Russell Bryant 2024-09-27 17:00:25 -04:00
  • ebb57a0c67 wip Xi Yan 2024-09-27 13:52:23 -07:00
  • f2125667e7 remove library Edward Ma 2024-09-27 13:50:45 -07:00
  • 67ac0e0895 Add SambaNova Adapter Edward Ma 2024-09-27 13:48:24 -07:00
  • 27b63f4de5 adding vision guard to Together safety Yogish Baliga 2024-09-27 13:33:33 -07:00
  • 5828ffd53b
    inference: Fix download command in error msg (#133) Russell Bryant 2024-09-27 16:31:11 -04:00
  • fb9e6371ec
    Validate name in llama stack build (#128) Russell Bryant 2024-09-27 16:30:55 -04:00
  • d7c55f0ad0 fixing model names Yogish Baliga 2024-09-25 21:24:36 -07:00
  • 2b568a462a support Llama 3.2 models in Together inference adapter and cleanup Together safety adapter Yogish Baliga 2024-09-25 17:51:42 -07:00
  • 9bb0c8f4fc fixing safety inference and safety adapter for new API spec. Pinned the llama_models version to 0.0.24 as the latest version 0.0.35 has the model descriptor name changed. I was getting the missing package error during runtime as well, hence added the dependency to requirements.txt Yogish Baliga 2024-09-25 14:14:15 -07:00
  • 53070e34a3
    Update RFC-0001-llama-stack.md (#134) Bhimraj Yadav 2024-09-27 21:59:36 +05:45
  • 787ef09ea3
    Update RFC-0001-llama-stack.md Bhimraj Yadav 2024-09-27 20:58:18 +05:45
  • bec6ab78c2 inference: Fix download command in error msg Russell Bryant 2024-09-27 14:14:07 +00:00
  • 273990659f configure: Fix a error msg typo Russell Bryant 2024-09-27 13:55:55 +00:00
  • e3f98ce97e docs: Note how to use podman Russell Bryant 2024-09-27 13:42:46 +00:00
  • e2a1387b97 Validate name in llama stack build Russell Bryant 2024-09-27 13:21:50 +00:00
  • ecd17ce9e9 reorder output msg Xi Yan 2024-09-26 23:28:42 -07:00
  • 3b807912d2 remove configure outside for docker Xi Yan 2024-09-26 23:11:15 -07:00
  • 0ad0a15810 configure outside container Xi Yan 2024-09-26 21:36:17 -07:00
  • eb526b4d9b
    Update RFC-0001-llama-stack.md Xi Yan 2024-09-26 17:17:08 -07:00
  • 6b0805ebb4
    fix: 404 link to agentic system repository (#118) Moritz Althaus 2024-09-26 23:43:41 +02:00
  • 557ae38289
    Update getting_started.ipynb (#117) Deep Doshi 2024-09-26 14:43:04 -07:00
  • e9562240d2 bake run file into container Xi Yan 2024-09-26 14:38:06 -07:00
  • 35a0a0b577 fix: 404 link to agentic system repository Moritz Althaus 2024-09-26 20:37:55 +02:00
  • 2802ac8e9d
    add llama-stack.png Xi Yan 2024-09-26 11:17:46 -07:00
  • 590ce59043
    Update getting_started.ipynb Deep Doshi 2024-09-26 11:17:28 -07:00
  • 995a1a1d00
    Reordered pip install and llama model download (#112) Karthi Keyan 2024-09-26 23:07:15 +05:30
  • 3c99f08267
    minor typo and HuggingFace -> Hugging Face (#113) Mark Sze 2024-09-27 02:48:23 +10:00
  • 28ba8c1009 minor typo and HuggingFace -> Hugging Face Mark Sze 2024-09-26 19:43:35 +10:00
  • e92115b474
    Reordered pip install and llama model download Karthi Keyan 2024-09-26 13:26:52 +05:30
  • 3ae1597b9b
    load models using hf model id (#108) Kate Plawiak 2024-09-25 18:40:09 -07:00
  • e73e9110b7
    docs: fix typo (#107) JC (Jonathan Chen) 2024-09-25 21:36:31 -04:00
  • 95e4cc52f2
    docs: fix typo JC (Jonathan Chen) 2024-09-25 21:35:30 -04:00
  • 0cfbe319af Merge branch 'main' into lg_safety_fix Kate Plawiak 2024-09-25 18:32:38 -07:00
  • 2f98089f88 load models using hf model id Kate Plawiak 2024-09-25 18:32:15 -07:00
  • d0280138ef
    Update README.md Xi Yan 2024-09-25 17:29:17 -07:00
  • ca7602a642 fix #100 Xi Yan 2024-09-25 15:11:51 -07:00
  • 37be3fb184
    Fix links & format (#104) machina-source 2024-09-25 16:18:46 -05:00
  • 615ed4bfbc
    Make TGI adapter compatible with HF Inference API (#97) Lucain 2024-09-25 23:08:31 +02:00
  • 0f5fe44ec4 Fix links & format machina-source 2024-09-25 15:29:30 -05:00
  • 851c30597a
    chore (doc): fix typo for setup instructionllama-stack to llama-stack-apps (#103) Abhishek 2024-09-26 01:57:55 +05:30
  • fcb3438031
    lint Wauplin 2024-09-25 22:27:06 +02:00
  • bf6c4861c3
    chore (doc): fix typo for setup instructionllama-stack to llama-stack-apps Abhishek 2024-09-26 01:42:18 +05:30
  • c8fa26482d Bump version to 0.0.36 Ashwin Bharambe 2024-09-25 11:58:15 -07:00
  • baf7bb47b9
    Update README.md raghotham 2024-09-25 11:45:47 -07:00
  • 82f420c4f0
    fix safety using inference (#99) Xi Yan 2024-09-25 11:30:27 -07:00
  • 0d19a026a8 fix safety using inference Xi Yan 2024-09-25 11:27:57 -07:00
  • 5c4f73d52f
    Drop header from LocalInference.h Dalton Flanagan 2024-09-25 11:27:37 -07:00
  • d442af0818 Add safety impl for llama guard vision Ashwin Bharambe 2024-09-25 11:06:59 -07:00
  • b3b0349931 Update LocalInference to use public repos Dalton Flanagan 2024-09-25 11:05:03 -07:00
  • 4fcda00872 Re-apply revert Ashwin Bharambe 2024-09-25 11:00:43 -07:00
  • d82a9d94e3 Small fix to the prompt-format error message Ashwin Bharambe 2024-09-25 10:56:13 -07:00
  • a227edb480 Bump version to 0.0.35 Ashwin Bharambe 2024-09-25 10:34:59 -07:00
  • 56aed59eb4
    Support for Llama3.2 models and Swift SDK (#98) Ashwin Bharambe 2024-09-25 10:29:58 -07:00