Commit graph

386 commits

Author SHA1 Message Date
Xi Yan
dc6393c271 update manifest for build templates 2024-10-24 14:57:10 -07:00
Ashwin Bharambe
2bea4bf801 Handle both ipv6 and ipv4 interfaces together 2024-10-24 14:57:10 -07:00
Ashwin Bharambe
72755da634 Bump version to 0.0.45 2024-10-24 14:57:10 -07:00
Ashwin Bharambe
7b1a45ee0f Fix score threshold in faiss 2024-10-24 14:57:10 -07:00
Ashwin Bharambe
86f1efa680 Small updates to quantization config 2024-10-24 14:57:10 -07:00
Dalton Flanagan
1721d91c95 Update iOS inference instructions for new quantization 2024-10-24 14:57:10 -07:00
Xi Yan
d4887fc746 address comments 2024-10-24 14:49:02 -07:00
Xi Yan
ba0186f2c8 refactor 2024-10-24 14:00:41 -07:00
Xi Yan
3db1b3fbcd rebase name 2024-10-24 13:53:41 -07:00
Xi Yan
97ca72288c Merge branch 'evals_5' into evals_6 2024-10-24 13:53:00 -07:00
Xi Yan
6053b8dd34 scoring function def rename 2024-10-24 13:51:11 -07:00
Xi Yan
689990b48b Merge branch 'evals_5' into evals_6 2024-10-24 13:06:11 -07:00
Xi Yan
42bac85e1f bugfix 2024-10-24 12:16:28 -07:00
Xi Yan
24dce9cb7a minor typing 2024-10-24 12:08:57 -07:00
Xi Yan
32a496ab0f Merge branch 'evals_5' into evals_6 2024-10-24 12:01:41 -07:00
Xi Yan
a3a8f32541 add all rows scores to ScoringResult 2024-10-24 11:53:15 -07:00
Xi Yan
737fcb795f evals with generation 2024-10-24 11:30:13 -07:00
Xi Yan
071dba8871 Merge branch 'main' into evals_5 2024-10-24 09:18:15 -07:00
Ashwin Bharambe
8aa8847b4a Bump version to 0.0.44 2024-10-24 08:41:39 -07:00
Ashwin Bharambe
7afe51c84d
New quantized models (#301) 2024-10-24 08:38:56 -07:00
Xi Yan
afa0c2b146 address comments 2024-10-23 22:17:38 -07:00
Ashwin Bharambe
05a8d47b98 Add a meta-reference-quantized-gpu distribution 2024-10-23 21:45:50 -07:00
Xi Yan
f5dcc03742 use pytorch/pytorch as base 2024-10-23 20:22:00 -07:00
Xi Yan
59c93548bc validate scorer input 2024-10-23 17:43:41 -07:00
Xi Yan
0ee82571a8 refactor 2024-10-23 17:30:10 -07:00
Xi Yan
7c803cef86 update scoring test 2024-10-23 17:22:48 -07:00
Xi Yan
3c6555c408 score batch 2024-10-23 16:38:00 -07:00
Xi Yan
eb572faf6f score batch impl 2024-10-23 16:19:25 -07:00
Xi Yan
4b1d7da030 equality scorer 2024-10-23 16:07:17 -07:00
Xi Yan
cad8c8710b Merge branch 'main' into evals_5 2024-10-23 15:33:36 -07:00
Xi Yan
caf253e08f Merge branch 'main' into evals_5 2024-10-23 15:33:00 -07:00
Xi Yan
0cec86453b
Fix issue w/ routing_table api getting added when router api is not specified (#298)
* fix issue w/ enforcing api

* cleanup

* inference only yaml
2024-10-23 15:27:22 -07:00
Xi Yan
35981a1a3b scorer wip 2024-10-23 15:02:54 -07:00
Xi Yan
70c08e694d basic scoring function works 2024-10-23 14:42:28 -07:00
Xi Yan
38e31ab525 clean up 2024-10-23 14:08:21 -07:00
Xi Yan
5930a92dc7 datasetio client 2024-10-23 14:04:51 -07:00
Xi Yan
51d5ad67c4 test client: 2024-10-23 13:55:55 -07:00
Xi Yan
bb43369521 dataset client 2024-10-23 13:53:58 -07:00
Xi Yan
c5db025320 error checking 2024-10-23 13:17:47 -07:00
Xi Yan
d8bbce6f7c comments 2024-10-23 13:16:08 -07:00
Xi Yan
5e1323b5bf clean up test 2024-10-23 13:08:42 -07:00
Xi Yan
555f6e1531 cleanup 2024-10-23 13:07:15 -07:00
Xi Yan
92e32f80ad test_scoring 2024-10-23 13:01:49 -07:00
Xi Yan
7c280e18fb dataset validation 2024-10-23 12:08:39 -07:00
Dinesh Yeduguru
21f2e9adf5
dont set num_predict for all providers (#294) 2024-10-23 11:44:04 -07:00
Ashwin Bharambe
ffb561070d
Support structured output for Together (#289) 2024-10-22 22:36:38 -07:00
Xi Yan
aefa84e70a wip 2024-10-22 20:00:43 -07:00
Sarthak Deshpande
2e5e46d896
Added tests for persistence (#274) 2024-10-22 19:41:46 -07:00
Xi Yan
821810657f
[Evals API][2/n] datasets / datasetio meta-reference implementation (#288)
* skeleton dataset / datasetio

* dataset datasetio

* config

* address comments

* delete dataset_utils

* address comments

* naming fix
2024-10-22 16:12:16 -07:00
Sarthak Deshpande
8a01b9e40c
Added implementations for get_agents_session, delete_agents_session and delete_agents (#267) 2024-10-22 13:50:43 -07:00