Krrish Dholakia
|
b187deb787
|
docs(deploy.md): cleanup docker quick start docs
|
2024-05-01 10:00:49 -07:00 |
|
Krrish Dholakia
|
1ad67a0d75
|
fix(router.py): fix update routing strategy
|
2024-05-01 09:51:11 -07:00 |
|
Ishaan Jaff
|
b3a788142b
|
Merge pull request #3380 from BerriAI/ui_polish_viewing_model_latencies
[UI] Polish viewing Model Latencies
|
2024-05-01 09:44:53 -07:00 |
|
Ishaan Jaff
|
0c2e9936ae
|
ui - polish Avg Latency per Toke
|
2024-05-01 09:42:36 -07:00 |
|
Krrish Dholakia
|
642790efba
|
build(config.yml): fix circle ci resource
|
2024-05-01 09:03:43 -07:00 |
|
Krrish Dholakia
|
cc54d3ef12
|
build(config.yml): bump circle-ci resource
|
2024-05-01 09:02:19 -07:00 |
|
Ishaan Jaff
|
e3ce97d33e
|
ui - sort the dropdown
|
2024-05-01 08:59:05 -07:00 |
|
Ishaan Jaff
|
e3229d0468
|
fix tool tip
|
2024-05-01 08:56:21 -07:00 |
|
Krrish Dholakia
|
0ab6b4bb22
|
fix(langfuse.py): fix trace param overwriting when existing trace id is given
|
2024-05-01 08:44:46 -07:00 |
|
Ishaan Jaff
|
94b98f5c4e
|
clean up model latency metrics
|
2024-05-01 08:27:01 -07:00 |
|
Ishaan Jaff
|
38cf04cc38
|
fix tool tip
|
2024-05-01 08:26:44 -07:00 |
|
Krrish Dholakia
|
d0f9f8c0ed
|
fix(proxy/utils.py): emit number of spend transactions for keys being written to db in a batch
|
2024-05-01 08:25:04 -07:00 |
|
Krrish Dholakia
|
b9f5b3c1a0
|
build(test_keys.py): improve error message for test
|
2024-05-01 08:22:28 -07:00 |
|
Krrish Dholakia
|
abdae87ba2
|
fix(langfuse.py): don't overwrite trace details if existing trace id passed in
|
2024-05-01 08:15:17 -07:00 |
|
Marc Klingen
|
adf5e61f2e
|
Merge branch 'main' into patch-1
|
2024-05-01 15:19:25 +02:00 |
|
alisalim17
|
81ad331d92
|
set default tool calls and function call
|
2024-05-01 17:01:45 +04:00 |
|
alisalim17
|
20a796bacb
|
add tool_calls attribute to Message and Delta classes in order to improve type-safety
|
2024-05-01 13:47:01 +04:00 |
|
Ishaan Jaff
|
fc5a845838
|
fix - prisma schema
|
2024-04-30 23:09:53 -07:00 |
|
Ishaan Jaff
|
eb54b57a44
|
bump: version 1.35.33 → 1.35.34
|
2024-04-30 22:55:03 -07:00 |
|
Ishaan Jaff
|
1e94d53a9b
|
(ui - new build)
|
2024-04-30 22:54:51 -07:00 |
|
Ishaan Jaff
|
b8ca83114a
|
Merge pull request #3373 from BerriAI/litellm_ui_show_model_exceptions
[UI] show exceptions by model deployments + model latencies - v0
|
2024-04-30 22:53:47 -07:00 |
|
Ishaan Jaff
|
9bf99df7e2
|
ui - model analytics
|
2024-04-30 22:51:25 -07:00 |
|
Ishaan Jaff
|
b9238a00af
|
ui - show tokens / sec
|
2024-04-30 22:44:28 -07:00 |
|
Krrish Dholakia
|
6a2ddc2791
|
docs(routing.md): add docs on lowest latency routing buffer
|
2024-04-30 22:41:50 -07:00 |
|
Krrish Dholakia
|
cfc1eeb3c3
|
test(test_router_fallbacks.py): rename test to run earlier
|
2024-04-30 22:04:20 -07:00 |
|
Krrish Dholakia
|
e506e71cb9
|
fix(test_router_fallbacks.py): reduce test rpm
|
2024-04-30 22:00:48 -07:00 |
|
Krish Dholakia
|
9f55a99e98
|
Merge pull request #3376 from BerriAI/litellm_routing_logic
fix(router.py): unify retry timeout logic across sync + async function_with_retries
|
2024-04-30 19:58:45 -07:00 |
|
Krrish Dholakia
|
e2eddac406
|
test(test_ratelimit.py): fix test to send below rpm
|
2024-04-30 19:35:21 -07:00 |
|
Krrish Dholakia
|
4761345311
|
fix(main.py): fix mock completion response
|
2024-04-30 19:30:18 -07:00 |
|
Krrish Dholakia
|
bc5c9d7da9
|
fix(test_router_fallbacks.py): fix tests
|
2024-04-30 18:48:39 -07:00 |
|
Ishaan Jaff
|
5da931f297
|
ui - clean up table
|
2024-04-30 18:48:32 -07:00 |
|
Ishaan Jaff
|
0c464f7f61
|
fix - viewing model metrics
|
2024-04-30 18:26:14 -07:00 |
|
Ishaan Jaff
|
ace3b02d97
|
ui - model analytics show failed requests %
|
2024-04-30 18:23:31 -07:00 |
|
Krish Dholakia
|
47017f5bc4
|
Merge pull request #3377 from BerriAI/revert-3374-abramowi/disambiguate-invalid-model-name-errors
Revert "Disambiguate invalid model name errors"
|
2024-04-30 17:58:00 -07:00 |
|
Krish Dholakia
|
82095731c1
|
Revert "Disambiguate invalid model name errors"
|
2024-04-30 17:57:52 -07:00 |
|
Krish Dholakia
|
fe3496961a
|
Merge pull request #3374 from msabramo/abramowi/disambiguate-invalid-model-name-errors
Disambiguate invalid model name errors
|
2024-04-30 17:57:29 -07:00 |
|
Krrish Dholakia
|
1baad80c7d
|
fix(router.py): cooldown deployments, for 401 errors
|
2024-04-30 17:54:00 -07:00 |
|
Ishaan Jaff
|
f2849d0641
|
fix - track litellm_model_name in LiteLLM_ErrorLogs
|
2024-04-30 17:31:40 -07:00 |
|
Ishaan Jaff
|
8a1a043801
|
backend - show model latency per token
|
2024-04-30 17:23:36 -07:00 |
|
Ishaan Jaff
|
8177ef5ec0
|
ui - show model latency / token
|
2024-04-30 17:23:27 -07:00 |
|
Ishaan Jaff
|
ce1817380e
|
feat ui - modelExceptionsCall
|
2024-04-30 16:56:45 -07:00 |
|
Ishaan Jaff
|
a2a8fef8f4
|
fix passing starttime and endtime to model/exceptions
|
2024-04-30 16:53:53 -07:00 |
|
Krrish Dholakia
|
8ee51a96f4
|
test(test_router_debug_logs.py): fix retry logic
|
2024-04-30 16:42:10 -07:00 |
|
Krish Dholakia
|
ce9ede6110
|
Merge pull request #3370 from BerriAI/litellm_latency_buffer
fix(lowest_latency.py): allow setting a buffer for getting values within a certain latency threshold
|
2024-04-30 16:01:47 -07:00 |
|
Ishaan Jaff
|
26a5d85869
|
fix - backend return exceptions
|
2024-04-30 15:41:16 -07:00 |
|
Krrish Dholakia
|
0267069c6a
|
fix(router.py): return routing args as dict
|
2024-04-30 15:39:14 -07:00 |
|
Krrish Dholakia
|
668a5353ee
|
fix(router.py): fix linting issue
|
2024-04-30 15:35:16 -07:00 |
|
Krrish Dholakia
|
6a2b4bcab8
|
fix(router.py): only check /v1 for azure ai studio models
Fixes https://github.com/BerriAI/litellm/issues/3346
|
2024-04-30 15:29:50 -07:00 |
|
Krrish Dholakia
|
87ff26ff27
|
fix(router.py): unify retry timeout logic across sync + async function_with_retries
|
2024-04-30 15:23:19 -07:00 |
|
Ishaan Jaff
|
49f83ce204
|
ui - show models analytics
|
2024-04-30 15:16:25 -07:00 |
|