Commit graph

2 commits

Author SHA1 Message Date
Eric Huang
f27f617629 test(verification): more tests, multiturn
# What does this PR do?


## Test Plan
# What does this PR do?


## Test Plan
2025-04-14 18:20:09 -07:00
ehhuang
14146e4b3f
feat(verification): various improvements (#1921)
# What does this PR do?
- provider and their models now live in config.yaml
- better distinguish different cases within a test
- add model key to surface provider's model_id
- include example command to rerun single test case

## Test Plan
<img width="1173" alt="image"
src="https://github.com/user-attachments/assets/b414baf0-c768-451f-8c3b-c2905cf36fac"
/>
2025-04-10 10:26:19 -07:00