Commit graph

  • d95bef7f2e Merge branch 'main' into evals_6 Xi Yan 2024-10-25 12:55:28 -07:00
  • d0b6894a41 Use enum.value to check against str Ashwin Bharambe 2024-10-25 12:53:32 -07:00
  • 07f9bf723f
    fix broken --list-templates with adding build.yaml files for packaging (#327) Xi Yan 2024-10-25 12:51:22 -07:00
  • 6100b02ff5 fix readmes Xi Yan 2024-10-25 12:49:33 -07:00
  • 474101a9f7 fix build from templates CLI Xi Yan 2024-10-25 12:44:04 -07:00
  • 7e60a9ca6d remove image_type in templates Xi Yan 2024-10-25 12:37:15 -07:00
  • 8e0a4e2885 Added an assertion for model args Sachin Mehta 2024-10-25 12:33:27 -07:00
  • db4f18099f Changed from config to model_args Sachin Mehta 2024-10-25 12:24:50 -07:00
  • 65d914d7b9 change everything to docker build.yaml Xi Yan 2024-10-25 12:19:17 -07:00
  • 98ba12eb15 precommit Xi Yan 2024-10-25 12:15:21 -07:00
  • 37010c4b58 symlink Xi Yan 2024-10-25 12:12:48 -07:00
  • d17c2e5675 symlink Xi Yan 2024-10-25 12:12:13 -07:00
  • 02b2e1c4a4 manifest Xi Yan 2024-10-25 12:10:15 -07:00
  • 32df363b48 fix templates Xi Yan 2024-10-25 12:09:45 -07:00
  • 19adb4070a add build files to templates Xi Yan 2024-10-25 12:08:32 -07:00
  • 94f4f0331d move build.yaml to templates, symlink in distributions Xi Yan 2024-10-25 11:54:09 -07:00
  • 753e8790a3 Update docker build flow a little Ashwin Bharambe 2024-10-25 10:06:21 -07:00
  • 3c3c6e11ad Move function around Ashwin Bharambe 2024-10-25 09:17:33 -07:00
  • 392c23e557 Update docker_base for meta-reference-gpu Ashwin Bharambe 2024-10-25 09:13:33 -07:00
  • be3adb0964 Make vllm inference better Ashwin Bharambe 2024-10-24 22:30:49 -07:00
  • b4a5176a54 update MANIFEST Xi Yan 2024-10-25 11:54:51 -07:00
  • 56f9b7d5d6 move build.yaml to templates, symlink in distributions Xi Yan 2024-10-25 11:54:09 -07:00
  • 93472042f8 Added hadamard transform for spinquant Sachin Mehta 2024-10-25 11:48:24 -07:00
  • 52fe165db8 address comments Xi Yan 2024-10-25 11:08:40 -07:00
  • e83ce36780 fix precommit hook ci failure Dinesh Yeduguru 2024-10-25 11:03:05 -07:00
  • 81ed0327f3 tmp wip Xi Yan 2024-10-25 10:43:22 -07:00
  • b543785c92
    Merge branch 'meta-llama:main' into merge_conflict karthikgutha 2024-10-25 10:23:10 -07:00
  • afae4e3d8e Update docker build flow a little Ashwin Bharambe 2024-10-25 10:06:21 -07:00
  • 5bed6c276c Move function around Ashwin Bharambe 2024-10-25 09:17:33 -07:00
  • a387ca22e2 Update docker_base for meta-reference-gpu Ashwin Bharambe 2024-10-25 09:13:33 -07:00
  • 14cd065b6c
    Merge branch 'main' into merge_conflict karthikgutha 2024-10-25 07:45:36 -07:00
  • 58c3c45f19 workaround list templates command Xi Yan 2024-10-24 23:05:26 -07:00
  • 70d59b0f5d Make vllm inference better Ashwin Bharambe 2024-10-24 22:30:49 -07:00
  • cb43caa2c3 start_container.sh prefix llamastack->distribution name Xi Yan 2024-10-24 21:29:07 -07:00
  • 6b0baa6d53 scoring_function -> scoring_fn rename, scorer -> scoring_fn rename Xi Yan 2024-10-24 21:06:38 -07:00
  • df141b6ef3
    Fix for get_agents_session (#300) Sarthak Deshpande 2024-10-25 07:06:27 +05:30
  • ec7c8f95de generate openapi Xi Yan 2024-10-24 17:41:15 -07:00
  • cdfd584a8f Merge branch 'main' into evals_6 Xi Yan 2024-10-24 17:29:22 -07:00
  • b6d8246b82
    added templates and enhanced readme (#307) Justin Lee 2024-10-24 17:07:06 -07:00
  • 2372de70f8 error checking Xi Yan 2024-10-24 16:16:34 -07:00
  • f6340a47d1 inclusion->subsetof Xi Yan 2024-10-24 16:13:49 -07:00
  • 29e48cc5c1 chatcompletion & completion input type validation Xi Yan 2024-10-24 16:11:25 -07:00
  • 6c49846ecc added templates and enhanced readme Justin Lee 2024-10-24 16:05:33 -07:00
  • 3e1c3fdb3f
    completion() for tgi (#295) Dinesh Yeduguru 2024-10-24 16:02:41 -07:00
  • e468e23249
    Merge branch 'main' into evals_6 Xi Yan 2024-10-24 14:59:41 -07:00
  • 997e3003b9 Update getting_started.md Xi Yan 2024-10-24 14:19:35 -07:00
  • dc6393c271 update manifest for build templates Xi Yan 2024-10-24 14:04:13 -07:00
  • 2bea4bf801 Handle both ipv6 and ipv4 interfaces together Ashwin Bharambe 2024-10-24 13:36:41 -07:00
  • 72755da634 Bump version to 0.0.45 Ashwin Bharambe 2024-10-24 12:14:18 -07:00
  • 7b1a45ee0f Fix score threshold in faiss Ashwin Bharambe 2024-10-24 12:11:58 -07:00
  • 86f1efa680 Small updates to quantization config Ashwin Bharambe 2024-10-24 12:08:43 -07:00
  • 1721d91c95 Update iOS inference instructions for new quantization Dalton Flanagan 2024-10-24 14:47:27 -04:00
  • cb84034567
    [Evals API][3/n] scoring_functions / scoring meta-reference implementations (#296) Xi Yan 2024-10-24 14:52:30 -07:00
  • d4887fc746 address comments Xi Yan 2024-10-24 14:49:02 -07:00
  • f7658f9c3a fix assert Dinesh Yeduguru 2024-10-24 14:46:21 -07:00
  • 9bf1388429 actually test strutured output in completion Dinesh Yeduguru 2024-10-24 14:44:31 -07:00
  • e70420a06e
    Update getting_started.md Xi Yan 2024-10-24 14:19:35 -07:00
  • 8615bc9e08 update manifest for build templates Xi Yan 2024-10-24 14:04:13 -07:00
  • ba0186f2c8 refactor Xi Yan 2024-10-24 14:00:41 -07:00
  • 94728d6983 Handle both ipv6 and ipv4 interfaces together Ashwin Bharambe 2024-10-24 13:36:41 -07:00
  • 3db1b3fbcd rebase name Xi Yan 2024-10-24 13:53:41 -07:00
  • 97ca72288c Merge branch 'evals_5' into evals_6 Xi Yan 2024-10-24 13:53:00 -07:00
  • 6053b8dd34 scoring function def rename Xi Yan 2024-10-24 13:51:11 -07:00
  • 689990b48b Merge branch 'evals_5' into evals_6 Xi Yan 2024-10-24 13:06:11 -07:00
  • 42bac85e1f bugfix Xi Yan 2024-10-24 12:16:28 -07:00
  • 0538cc297e Bump version to 0.0.45 Ashwin Bharambe 2024-10-24 12:14:18 -07:00
  • 205bcfdd4e Fix score threshold in faiss Ashwin Bharambe 2024-10-24 12:11:58 -07:00
  • 24dce9cb7a minor typing Xi Yan 2024-10-24 12:08:57 -07:00
  • 161aef0aae Small updates to quantization config Ashwin Bharambe 2024-10-24 12:08:43 -07:00
  • 32a496ab0f Merge branch 'evals_5' into evals_6 Xi Yan 2024-10-24 12:01:41 -07:00
  • a3a8f32541 add all rows scores to ScoringResult Xi Yan 2024-10-24 11:53:15 -07:00
  • 8eceebec98
    Update iOS inference instructions for new quantization Dalton Flanagan 2024-10-24 14:47:27 -04:00
  • 737fcb795f evals with generation Xi Yan 2024-10-24 11:30:13 -07:00
  • 071dba8871 Merge branch 'main' into evals_5 Xi Yan 2024-10-24 09:18:15 -07:00
  • 8aa8847b4a Bump version to 0.0.44 Ashwin Bharambe 2024-10-24 08:41:39 -07:00
  • 7afe51c84d
    New quantized models (#301) Ashwin Bharambe 2024-10-24 08:38:56 -07:00
  • 335c2561fa New quantized models Ashwin Bharambe 2024-10-24 08:37:26 -07:00
  • 9d630601b9 print statements removed Sarthak Deshpande 2024-10-24 14:02:48 +05:30
  • 5b3c637117 Fix for get_agents_session Sarthak Deshpande 2024-10-24 11:35:57 +05:30
  • afa0c2b146 address comments Xi Yan 2024-10-23 22:17:38 -07:00
  • 05a8d47b98 Add a meta-reference-quantized-gpu distribution Ashwin Bharambe 2024-10-23 19:33:14 -07:00
  • 3796dbd4a5 add test for structured output Dinesh Yeduguru 2024-10-23 20:44:49 -07:00
  • f5dcc03742 use pytorch/pytorch as base Xi Yan 2024-10-23 20:22:00 -07:00
  • 4a073fcee5 refactor get_max_tokens and build_options Dinesh Yeduguru 2024-10-23 19:11:04 -07:00
  • 59c93548bc validate scorer input Xi Yan 2024-10-23 17:43:41 -07:00
  • 0ee82571a8 refactor Xi Yan 2024-10-23 17:30:10 -07:00
  • 7c803cef86 update scoring test Xi Yan 2024-10-23 17:22:48 -07:00
  • 302555b11a Implement embeddings for ollama krgutha 2024-10-23 17:18:08 -07:00
  • 3c6555c408 score batch Xi Yan 2024-10-23 16:38:00 -07:00
  • eb572faf6f score batch impl Xi Yan 2024-10-23 16:19:25 -07:00
  • 4b1d7da030 equality scorer Xi Yan 2024-10-23 16:07:17 -07:00
  • cad8c8710b Merge branch 'main' into evals_5 Xi Yan 2024-10-23 15:33:36 -07:00
  • caf253e08f Merge branch 'main' into evals_5 Xi Yan 2024-10-23 15:33:00 -07:00
  • 0cec86453b
    Fix issue w/ routing_table api getting added when router api is not specified (#298) Xi Yan 2024-10-23 15:27:22 -07:00
  • cc508a6fcc inference only yaml Xi Yan 2024-10-23 15:22:03 -07:00
  • 47d9030542 cleanup Xi Yan 2024-10-23 15:19:41 -07:00
  • 07d45f2af3 fix issue w/ enforcing api Xi Yan 2024-10-23 15:15:52 -07:00
  • 35981a1a3b scorer wip Xi Yan 2024-10-23 15:02:54 -07:00
  • 70c08e694d basic scoring function works Xi Yan 2024-10-23 14:42:28 -07:00
  • 38e31ab525 clean up Xi Yan 2024-10-23 14:08:21 -07:00