llama-stack-mirror/llama_stack/providers/tests
Xi Yan ed833bb758
[Evals API][7/n] braintrust scoring provider (#333)
* wip scoring refactor

* llm as judge, move folders

* test full generation + eval

* extract score regex to llm context

* remove prints, cleanup braintrust in this branch

* braintrust skeleton

* datasetio test fix

* braintrust provider

* remove prints

* dependencies

* change json -> class

* json -> class

* remove initialize

* address nits

* check identifier prefix

* braintrust scoring identifier check, rebase

* udpate MANIFEST

* manifest

* remove braintrust scoring_fn

* remove comments

* tests

* imports fix
2024-10-28 18:59:35 -07:00
..
agents Added tests for persistence (#274) 2024-10-22 19:41:46 -07:00
datasetio [Evals API][7/n] braintrust scoring provider (#333) 2024-10-28 18:59:35 -07:00
eval [Evals API][6/n] meta-reference llm as judge, registration for ScoringFnDefs (#330) 2024-10-28 14:08:42 -07:00
inference completion() for fireworks (#329) 2024-10-25 16:12:10 -07:00
memory feat: Qdrant Vector index support (#221) 2024-10-22 12:50:19 -07:00
safety Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
scoring [Evals API][7/n] braintrust scoring provider (#333) 2024-10-28 18:59:35 -07:00
__init__.py Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
resolver.py Allow passing provider_registry to resolve_impls() 2024-10-28 11:58:16 -07:00