[Evals API][6/n] meta-reference llm as judge, registration for ScoringFnDefs (#330)

* wip scoring refactor * llm as judge, move folders * test full generation + eval * extract score regex to llm context * remove prints, cleanup braintrust in this branch * change json -> class * remove initialize * address nits * check identifier prefix * udpate MANIFEST
2025-06-27 18:50:41 +00:00 · 2024-10-28 14:08:42 -07:00 · 2024-10-28 14:08:42 -07:00 · 7b8748c53e
commit 7b8748c53e
parent 04a4784287
20 changed files with 360 additions and 50 deletions
--- a/llama_stack/providers/registry/scoring.py
+++ b/llama_stack/providers/registry/scoring.py
@ -20,6 +20,7 @@ def available_providers() -> List[ProviderSpec]:
            api_dependencies=[
                Api.datasetio,
                Api.datasets,
+                Api.inference,
            ],
        ),
    ]