mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-12-14 13:02:36 +00:00
* wip * dataset validation * test_scoring * cleanup * clean up test * comments * error checking * dataset client * test client: * datasetio client * clean up * basic scoring function works * scorer wip * equality scorer * score batch impl * score batch * update scoring test * refactor * validate scorer input * address comments * add all rows scores to ScoringResult * bugfix * scoring function def rename
310 B
310 B
| 1 | input_query | generated_answer | expected_answer |
|---|---|---|---|
| 2 | What is the capital of France? | London | Paris |
| 3 | Who is the CEO of Meta? | Mark Zuckerberg | Mark Zuckerberg |
| 4 | What is the largest planet in our solar system? | Jupiter | Jupiter |
| 5 | What is the smallest country in the world? | China | Vatican City |
| 6 | What is the currency of Japan? | Yen | Yen |