llama-stack

History

Xi Yan 7b8748c53e [Evals API][6/n] meta-reference llm as judge, registration for ScoringFnDefs (#330 ) * wip scoring refactor * llm as judge, move folders * test full generation + eval * extract score regex to llm context * remove prints, cleanup braintrust in this branch * change json -> class * remove initialize * address nits * check identifier prefix * udpate MANIFEST		2024-10-28 14:08:42 -07:00
..
__init__.py	[Evals API][4/n] evals with generation meta-reference impl (#303 )	2024-10-25 13:12:39 -07:00
config.py	[Evals API][4/n] evals with generation meta-reference impl (#303 )	2024-10-25 13:12:39 -07:00
eval.py	[Evals API][6/n] meta-reference llm as judge, registration for ScoringFnDefs (#330 )	2024-10-28 14:08:42 -07:00