Commit graph

  • 0f66ae0f61
    Add function for stopping inference (#224) kebbbnnn 2024-10-09 22:50:19 +08:00
  • 6b094b72d3
    Update cli_reference.md Xi Yan 2024-10-08 15:32:06 -07:00
  • ce70d21f65
    Add files via upload Xi Yan 2024-10-08 15:29:19 -07:00
  • 2d4f7d8acf
    Create SECURITY.md Dalton Flanagan 2024-10-08 13:30:40 -04:00
  • 48d0d2001e
    Add classifiers in setup.py (#217) Yuan Tang 2024-10-08 09:55:16 -04:00
  • 4d5f7459aa
    [bugfix] Fix logprobs on meta-reference impl (#213) Xi Yan 2024-10-07 19:42:39 -07:00
  • e4ae09d090
    Add .idea to .gitignore (#216) Yuan Tang 2024-10-07 22:38:43 -04:00
  • 16ba0fa06f
    Update README.md Xi Yan 2024-10-07 11:24:27 -07:00
  • 996efa9b42
    README.md: Add vLLM to providers table (#207) Russell Bryant 2024-10-07 13:26:52 -04:00
  • 2366e18873
    refactor docs (#209) Xi Yan 2024-10-07 10:21:26 -07:00
  • 53d440e952
    Fix ValueError in case chunks are empty (#206) Mindaugas 2024-10-07 18:55:06 +03:00
  • a4e775c465
    download: improve help text (#204) Russell Bryant 2024-10-07 11:40:04 -04:00
  • 4263764493 Fix adapter_id -> adapter_type for Weaviate Ashwin Bharambe 2024-10-07 06:46:32 -07:00
  • f4f7618120
    add Weaviate memory adapter (#95) Zain Hasan 2024-10-07 01:21:50 -04:00
  • 27587f32bc fix db path Xi Yan 2024-10-06 11:46:08 -07:00
  • cfe3ad33b3 fix db path Xi Yan 2024-10-06 11:45:35 -07:00
  • 7abab7604b
    add databricks provider (#83) Prithu Dasgupta 2024-10-05 23:35:54 -07:00
  • f73e247ba1
    Inline vLLM inference provider (#181) Russell Bryant 2024-10-06 02:34:16 -04:00
  • 29138a5167
    Update getting_started.md Xi Yan 2024-10-05 12:28:02 -07:00
  • 6d4013ac99
    Update getting_started.md Xi Yan 2024-10-05 12:14:59 -07:00
  • 9d16129603
    Add 'url' property to Redis KV config (#192) Mindaugas 2024-10-05 21:26:26 +03:00
  • bfb0e92034 Bump version to 0.0.40 Ashwin Bharambe 2024-10-04 09:33:43 -07:00
  • dc75aab547 Add setuptools dependency Ashwin Bharambe 2024-10-04 09:30:54 -07:00
  • 441052b0fd avoid jq since non-standard on macOS Dalton Flanagan 2024-10-04 10:11:43 -04:00
  • 9bf2e354ae CLI now requires jq Dalton Flanagan 2024-10-04 10:05:59 -04:00
  • 00ed9a410b
    Update getting_started.md raghotham 2024-10-03 23:28:43 -07:00
  • 734f59d3b8
    Check that the model is found before use. (#182) AshleyT3 2024-10-03 23:24:47 -07:00
  • f913b57397 fix fp8 imports Ashwin Bharambe 2024-10-03 14:40:21 -07:00
  • 8d41e6caa9 Bump version to 0.0.39 Ashwin Bharambe 2024-10-03 11:31:03 -07:00
  • 7f49315822 Kill a derpy import Ashwin Bharambe 2024-10-03 11:25:58 -07:00
  • 62d266f018
    [CLI] avoid configure twice (#171) Xi Yan 2024-10-03 11:20:54 -07:00
  • 06db9213b1
    inference: Add model option to client (#170) Russell Bryant 2024-10-03 14:18:57 -04:00
  • 210b71b0ba
    fix prompt guard (#177) Ashwin Bharambe 2024-10-03 11:07:53 -07:00
  • b9b1e8b08b
    [bugfix] conda path lookup (#179) Xi Yan 2024-10-03 10:45:16 -07:00
  • d74501f75c
    Update README.md raghotham 2024-10-03 10:21:16 -07:00
  • c02a90e4c8 Bump version to 0.0.38 Ashwin Bharambe 2024-10-03 05:42:47 -07:00
  • e9f6150588 A bit cleanup to avoid breakages Ashwin Bharambe 2024-10-02 21:31:09 -07:00
  • 988a9cada3 Don't ask for Api.inspect in stack build Ashwin Bharambe 2024-10-02 21:10:56 -07:00
  • 19ce6bf009 Don't validate prompt-guard anymore Ashwin Bharambe 2024-10-02 20:43:57 -07:00
  • 703ab9385f fix routing table key list Xi Yan 2024-10-02 18:23:02 -07:00
  • 8d049000e3 Add an introspection "Api.inspect" API Ashwin Bharambe 2024-10-02 15:13:24 -07:00
  • 01d93be948
    Adds markdown-link-check and fixes a broken link (#165) Adrian Cole 2024-10-03 05:26:20 +08:00
  • fe4aabd690 provider_id => provider_type, adapter_id => adapter_type Ashwin Bharambe 2024-10-02 14:05:59 -07:00
  • df68db644b Refactoring distribution/distribution.py Ashwin Bharambe 2024-10-02 13:20:17 -07:00
  • 546f05bd3f No automatic pager Ashwin Bharambe 2024-10-02 12:25:54 -07:00
  • 204eb6d810
    docker: Check for selinux before using --security-opt (#167) Russell Bryant 2024-10-02 13:37:41 -04:00
  • 9b93ee2c2b Bump version to 0.0.37 Ashwin Bharambe 2024-10-02 10:15:08 -07:00
  • 227b69e6e6 Fix sample memory impl Ashwin Bharambe 2024-10-02 10:13:09 -07:00
  • 335dea849a fix sample impls Ashwin Bharambe 2024-10-02 10:09:36 -07:00
  • bf0d111c53 Fix build script Ashwin Bharambe 2024-10-02 10:04:23 -07:00
  • 4a75d922a9 Make Llama Guard 1B the default Ashwin Bharambe 2024-10-02 09:48:26 -07:00
  • cc5029a716 Add special case for prompt guard Ashwin Bharambe 2024-10-02 08:38:23 -07:00
  • a80b707ff8 Ensure we always ask for pydantic>=2 Ashwin Bharambe 2024-10-02 06:29:06 -07:00
  • eb2d8a31a5
    Add a RoutableProvider protocol, support for multiple routing keys (#163) Ashwin Bharambe 2024-09-30 17:30:21 -07:00
  • 73decb3781 re-build from name Xi Yan 2024-09-30 16:22:52 -07:00
  • 4897bf2f85 allow --name to re-build from config Xi Yan 2024-09-30 16:18:12 -07:00
  • d28c3dfe0f
    [CLI] simplify docker run (#159) Xi Yan 2024-09-30 15:04:04 -07:00
  • 8db49de961
    docker: Install in editable mode for dev purposes (#160) Russell Bryant 2024-09-30 14:56:31 -04:00
  • cb36be320f
    Fix podman+selinux compatibility (#132) Russell Bryant 2024-09-29 23:19:44 -04:00
  • 2bd785354d
    fix broken bedrock inference provider (#151) moritalous 2024-09-30 12:17:58 +09:00
  • 2f096ca509
    accepts not model itself. (#153) Byung Chun Kim 2024-09-30 12:16:50 +09:00
  • 5bf679cab6
    Pull (extract) provider data from the provider instead of pushing from the top (#148) Ashwin Bharambe 2024-09-29 20:00:51 -07:00
  • f6a6598d1a
    [bugfix] fix #146 (#147) Xi Yan 2024-09-28 17:47:00 -07:00
  • b646167d94
    Update README.md Xi Yan 2024-09-28 16:55:22 -07:00
  • 5ce759adc4
    Update README.md Xi Yan 2024-09-28 16:55:08 -07:00
  • 6a8c2ae1df
    [CLI] remove dependency on CONDA_PREFIX in CLI (#144) Xi Yan 2024-09-28 16:46:47 -07:00
  • fe460ba103 Avoid importing a lot of stuff Ashwin Bharambe 2024-09-28 16:05:49 -07:00
  • 4ae8c63a2b pre-commit lint Xi Yan 2024-09-28 16:04:41 -07:00
  • ced5fb6388 Small cleanup for together safety implementation Ashwin Bharambe 2024-09-28 15:47:35 -07:00
  • 940968ee3f
    fixing safety inference and safety adapter for new API spec. Pinned t… (#105) Yogish Baliga 2024-09-28 15:45:38 -07:00
  • 0a3999a9a4
    Use inference APIs for executing Llama Guard (#121) Ashwin Bharambe 2024-09-28 15:40:06 -07:00
  • 6236634d84
    [bugfix] fix duplicate api endpoints (#139) Xi Yan 2024-09-27 15:32:50 -07:00
  • 208b861289
    add env for LLAMA_STACK_CONFIG_DIR (#137) Xi Yan 2024-09-27 14:16:46 -07:00
  • 43744455d7
    docs: Note how to use podman (#130) Russell Bryant 2024-09-27 17:00:40 -04:00
  • f70c88ab7a
    configure: Fix a error msg typo (#131) Russell Bryant 2024-09-27 17:00:25 -04:00
  • 5828ffd53b
    inference: Fix download command in error msg (#133) Russell Bryant 2024-09-27 16:31:11 -04:00
  • fb9e6371ec
    Validate name in llama stack build (#128) Russell Bryant 2024-09-27 16:30:55 -04:00
  • 53070e34a3
    Update RFC-0001-llama-stack.md (#134) Bhimraj Yadav 2024-09-27 21:59:36 +05:45
  • eb526b4d9b
    Update RFC-0001-llama-stack.md Xi Yan 2024-09-26 17:17:08 -07:00
  • 6b0805ebb4
    fix: 404 link to agentic system repository (#118) Moritz Althaus 2024-09-26 23:43:41 +02:00
  • 557ae38289
    Update getting_started.ipynb (#117) Deep Doshi 2024-09-26 14:43:04 -07:00
  • 2802ac8e9d
    add llama-stack.png Xi Yan 2024-09-26 11:17:46 -07:00
  • 995a1a1d00
    Reordered pip install and llama model download (#112) Karthi Keyan 2024-09-26 23:07:15 +05:30
  • 3c99f08267
    minor typo and HuggingFace -> Hugging Face (#113) Mark Sze 2024-09-27 02:48:23 +10:00
  • 3ae1597b9b
    load models using hf model id (#108) Kate Plawiak 2024-09-25 18:40:09 -07:00
  • e73e9110b7
    docs: fix typo (#107) JC (Jonathan Chen) 2024-09-25 21:36:31 -04:00
  • d0280138ef
    Update README.md Xi Yan 2024-09-25 17:29:17 -07:00
  • ca7602a642 fix #100 Xi Yan 2024-09-25 15:11:51 -07:00
  • 37be3fb184
    Fix links & format (#104) machina-source 2024-09-25 16:18:46 -05:00
  • 615ed4bfbc
    Make TGI adapter compatible with HF Inference API (#97) Lucain 2024-09-25 23:08:31 +02:00
  • 851c30597a
    chore (doc): fix typo for setup instructionllama-stack to llama-stack-apps (#103) Abhishek 2024-09-26 01:57:55 +05:30
  • c8fa26482d Bump version to 0.0.36 Ashwin Bharambe 2024-09-25 11:58:15 -07:00
  • baf7bb47b9
    Update README.md raghotham 2024-09-25 11:45:47 -07:00
  • 82f420c4f0
    fix safety using inference (#99) Xi Yan 2024-09-25 11:30:27 -07:00
  • 5c4f73d52f
    Drop header from LocalInference.h Dalton Flanagan 2024-09-25 11:27:37 -07:00
  • d442af0818 Add safety impl for llama guard vision Ashwin Bharambe 2024-09-25 11:06:59 -07:00
  • b3b0349931 Update LocalInference to use public repos Dalton Flanagan 2024-09-25 11:05:03 -07:00
  • 4fcda00872 Re-apply revert Ashwin Bharambe 2024-09-25 11:00:43 -07:00
  • d82a9d94e3 Small fix to the prompt-format error message Ashwin Bharambe 2024-09-25 10:56:13 -07:00
  • a227edb480 Bump version to 0.0.35 Ashwin Bharambe 2024-09-25 10:34:59 -07:00