forked from phoenix-oss/llama-stack-mirror
* wip * dataset validation * test_scoring * cleanup * clean up test * comments * error checking * dataset client * test client: * datasetio client * clean up * basic scoring function works * scorer wip * equality scorer * score batch impl * score batch * update scoring test * refactor * validate scorer input * address comments * add all rows scores to ScoringResult * bugfix * scoring function def rename
6 lines
310 B
CSV
6 lines
310 B
CSV
input_query,generated_answer,expected_answer
|
|
What is the capital of France?,London,Paris
|
|
Who is the CEO of Meta?,Mark Zuckerberg,Mark Zuckerberg
|
|
What is the largest planet in our solar system?,Jupiter,Jupiter
|
|
What is the smallest country in the world?,China,Vatican City
|
|
What is the currency of Japan?,Yen,Yen
|