Xi Yan
|
d95bef7f2e
|
Merge branch 'main' into evals_6
|
2024-10-25 12:55:28 -07:00 |
|
Xi Yan
|
07f9bf723f
|
fix broken --list-templates with adding build.yaml files for packaging (#327)
* add build files to templates
* fix templates
* manifest
* symlink
* symlink
* precommit
* change everything to docker build.yaml
* remove image_type in templates
* fix build from templates CLI
* fix readmes
|
2024-10-25 12:51:22 -07:00 |
|
Xi Yan
|
52fe165db8
|
address comments
|
2024-10-25 11:08:40 -07:00 |
|
Ashwin Bharambe
|
70d59b0f5d
|
Make vllm inference better
Tests still don't pass completely (some hang) so I think there are some
potential threading issues maybe
|
2024-10-24 22:52:47 -07:00 |
|
Xi Yan
|
6b0baa6d53
|
scoring_function -> scoring_fn rename, scorer -> scoring_fn rename
|
2024-10-24 21:06:38 -07:00 |
|
Sarthak Deshpande
|
df141b6ef3
|
Fix for get_agents_session (#300)
|
2024-10-24 18:36:27 -07:00 |
|
Xi Yan
|
cdfd584a8f
|
Merge branch 'main' into evals_6
|
2024-10-24 17:29:22 -07:00 |
|
Xi Yan
|
2372de70f8
|
error checking
|
2024-10-24 16:16:34 -07:00 |
|
Xi Yan
|
f6340a47d1
|
inclusion->subsetof
|
2024-10-24 16:13:49 -07:00 |
|
Xi Yan
|
29e48cc5c1
|
chatcompletion & completion input type validation
|
2024-10-24 16:11:25 -07:00 |
|
Dinesh Yeduguru
|
3e1c3fdb3f
|
completion() for tgi (#295)
|
2024-10-24 16:02:41 -07:00 |
|
Xi Yan
|
e468e23249
|
Merge branch 'main' into evals_6
|
2024-10-24 14:59:41 -07:00 |
|
Ashwin Bharambe
|
7b1a45ee0f
|
Fix score threshold in faiss
|
2024-10-24 14:57:10 -07:00 |
|
Dalton Flanagan
|
1721d91c95
|
Update iOS inference instructions for new quantization
|
2024-10-24 14:57:10 -07:00 |
|
Xi Yan
|
cb84034567
|
[Evals API][3/n] scoring_functions / scoring meta-reference implementations (#296)
* wip
* dataset validation
* test_scoring
* cleanup
* clean up test
* comments
* error checking
* dataset client
* test client:
* datasetio client
* clean up
* basic scoring function works
* scorer wip
* equality scorer
* score batch impl
* score batch
* update scoring test
* refactor
* validate scorer input
* address comments
* add all rows scores to ScoringResult
* bugfix
* scoring function def rename
|
2024-10-24 14:52:30 -07:00 |
|
Xi Yan
|
d4887fc746
|
address comments
|
2024-10-24 14:49:02 -07:00 |
|
Xi Yan
|
ba0186f2c8
|
refactor
|
2024-10-24 14:00:41 -07:00 |
|
Xi Yan
|
3db1b3fbcd
|
rebase name
|
2024-10-24 13:53:41 -07:00 |
|
Xi Yan
|
97ca72288c
|
Merge branch 'evals_5' into evals_6
|
2024-10-24 13:53:00 -07:00 |
|
Xi Yan
|
6053b8dd34
|
scoring function def rename
|
2024-10-24 13:51:11 -07:00 |
|
Xi Yan
|
689990b48b
|
Merge branch 'evals_5' into evals_6
|
2024-10-24 13:06:11 -07:00 |
|
Xi Yan
|
42bac85e1f
|
bugfix
|
2024-10-24 12:16:28 -07:00 |
|
Ashwin Bharambe
|
205bcfdd4e
|
Fix score threshold in faiss
|
2024-10-24 12:11:58 -07:00 |
|
Xi Yan
|
24dce9cb7a
|
minor typing
|
2024-10-24 12:08:57 -07:00 |
|
Xi Yan
|
32a496ab0f
|
Merge branch 'evals_5' into evals_6
|
2024-10-24 12:01:41 -07:00 |
|
Xi Yan
|
a3a8f32541
|
add all rows scores to ScoringResult
|
2024-10-24 11:53:15 -07:00 |
|
Dalton Flanagan
|
8eceebec98
|
Update iOS inference instructions for new quantization
|
2024-10-24 14:47:27 -04:00 |
|
Xi Yan
|
737fcb795f
|
evals with generation
|
2024-10-24 11:30:13 -07:00 |
|
Xi Yan
|
071dba8871
|
Merge branch 'main' into evals_5
|
2024-10-24 09:18:15 -07:00 |
|
Ashwin Bharambe
|
7afe51c84d
|
New quantized models (#301)
|
2024-10-24 08:38:56 -07:00 |
|
Xi Yan
|
afa0c2b146
|
address comments
|
2024-10-23 22:17:38 -07:00 |
|
Ashwin Bharambe
|
05a8d47b98
|
Add a meta-reference-quantized-gpu distribution
|
2024-10-23 21:45:50 -07:00 |
|
Xi Yan
|
59c93548bc
|
validate scorer input
|
2024-10-23 17:43:41 -07:00 |
|
Xi Yan
|
0ee82571a8
|
refactor
|
2024-10-23 17:30:10 -07:00 |
|
Xi Yan
|
7c803cef86
|
update scoring test
|
2024-10-23 17:22:48 -07:00 |
|
Xi Yan
|
3c6555c408
|
score batch
|
2024-10-23 16:38:00 -07:00 |
|
Xi Yan
|
eb572faf6f
|
score batch impl
|
2024-10-23 16:19:25 -07:00 |
|
Xi Yan
|
4b1d7da030
|
equality scorer
|
2024-10-23 16:07:17 -07:00 |
|
Xi Yan
|
caf253e08f
|
Merge branch 'main' into evals_5
|
2024-10-23 15:33:00 -07:00 |
|
Xi Yan
|
35981a1a3b
|
scorer wip
|
2024-10-23 15:02:54 -07:00 |
|
Xi Yan
|
70c08e694d
|
basic scoring function works
|
2024-10-23 14:42:28 -07:00 |
|
Xi Yan
|
5930a92dc7
|
datasetio client
|
2024-10-23 14:04:51 -07:00 |
|
Xi Yan
|
bb43369521
|
dataset client
|
2024-10-23 13:53:58 -07:00 |
|
Xi Yan
|
c5db025320
|
error checking
|
2024-10-23 13:17:47 -07:00 |
|
Xi Yan
|
d8bbce6f7c
|
comments
|
2024-10-23 13:16:08 -07:00 |
|
Xi Yan
|
5e1323b5bf
|
clean up test
|
2024-10-23 13:08:42 -07:00 |
|
Xi Yan
|
555f6e1531
|
cleanup
|
2024-10-23 13:07:15 -07:00 |
|
Xi Yan
|
92e32f80ad
|
test_scoring
|
2024-10-23 13:01:49 -07:00 |
|
Xi Yan
|
7c280e18fb
|
dataset validation
|
2024-10-23 12:08:39 -07:00 |
|
Dinesh Yeduguru
|
21f2e9adf5
|
dont set num_predict for all providers (#294)
|
2024-10-23 11:44:04 -07:00 |
|