Commit graph

11139 commits

Author SHA1 Message Date
Krrish Dholakia
b187deb787 docs(deploy.md): cleanup docker quick start docs 2024-05-01 10:00:49 -07:00
Krrish Dholakia
1ad67a0d75 fix(router.py): fix update routing strategy 2024-05-01 09:51:11 -07:00
Ishaan Jaff
b3a788142b
Merge pull request #3380 from BerriAI/ui_polish_viewing_model_latencies
[UI] Polish viewing Model Latencies
2024-05-01 09:44:53 -07:00
Ishaan Jaff
0c2e9936ae ui - polish Avg Latency per Toke 2024-05-01 09:42:36 -07:00
Krrish Dholakia
642790efba build(config.yml): fix circle ci resource 2024-05-01 09:03:43 -07:00
Krrish Dholakia
cc54d3ef12 build(config.yml): bump circle-ci resource 2024-05-01 09:02:19 -07:00
Ishaan Jaff
e3ce97d33e ui - sort the dropdown 2024-05-01 08:59:05 -07:00
Ishaan Jaff
e3229d0468 fix tool tip 2024-05-01 08:56:21 -07:00
Krrish Dholakia
0ab6b4bb22 fix(langfuse.py): fix trace param overwriting when existing trace id is given 2024-05-01 08:44:46 -07:00
Ishaan Jaff
94b98f5c4e clean up model latency metrics 2024-05-01 08:27:01 -07:00
Ishaan Jaff
38cf04cc38 fix tool tip 2024-05-01 08:26:44 -07:00
Krrish Dholakia
d0f9f8c0ed fix(proxy/utils.py): emit number of spend transactions for keys being written to db in a batch 2024-05-01 08:25:04 -07:00
Krrish Dholakia
b9f5b3c1a0 build(test_keys.py): improve error message for test 2024-05-01 08:22:28 -07:00
Krrish Dholakia
abdae87ba2 fix(langfuse.py): don't overwrite trace details if existing trace id passed in 2024-05-01 08:15:17 -07:00
Marc Klingen
adf5e61f2e
Merge branch 'main' into patch-1 2024-05-01 15:19:25 +02:00
alisalim17
81ad331d92 set default tool calls and function call 2024-05-01 17:01:45 +04:00
alisalim17
20a796bacb add tool_calls attribute to Message and Delta classes in order to improve type-safety 2024-05-01 13:47:01 +04:00
Ishaan Jaff
fc5a845838 fix - prisma schema 2024-04-30 23:09:53 -07:00
Ishaan Jaff
eb54b57a44 bump: version 1.35.33 → 1.35.34 2024-04-30 22:55:03 -07:00
Ishaan Jaff
1e94d53a9b (ui - new build) 2024-04-30 22:54:51 -07:00
Ishaan Jaff
b8ca83114a
Merge pull request #3373 from BerriAI/litellm_ui_show_model_exceptions
[UI] show exceptions by model deployments + model latencies - v0
2024-04-30 22:53:47 -07:00
Ishaan Jaff
9bf99df7e2 ui - model analytics 2024-04-30 22:51:25 -07:00
Ishaan Jaff
b9238a00af ui - show tokens / sec 2024-04-30 22:44:28 -07:00
Krrish Dholakia
6a2ddc2791 docs(routing.md): add docs on lowest latency routing buffer 2024-04-30 22:41:50 -07:00
Krrish Dholakia
cfc1eeb3c3 test(test_router_fallbacks.py): rename test to run earlier 2024-04-30 22:04:20 -07:00
Krrish Dholakia
e506e71cb9 fix(test_router_fallbacks.py): reduce test rpm 2024-04-30 22:00:48 -07:00
Krish Dholakia
9f55a99e98
Merge pull request #3376 from BerriAI/litellm_routing_logic
fix(router.py): unify retry timeout logic across sync + async function_with_retries
2024-04-30 19:58:45 -07:00
Krrish Dholakia
e2eddac406 test(test_ratelimit.py): fix test to send below rpm 2024-04-30 19:35:21 -07:00
Krrish Dholakia
4761345311 fix(main.py): fix mock completion response 2024-04-30 19:30:18 -07:00
Krrish Dholakia
bc5c9d7da9 fix(test_router_fallbacks.py): fix tests 2024-04-30 18:48:39 -07:00
Ishaan Jaff
5da931f297 ui - clean up table 2024-04-30 18:48:32 -07:00
Ishaan Jaff
0c464f7f61 fix - viewing model metrics 2024-04-30 18:26:14 -07:00
Ishaan Jaff
ace3b02d97 ui - model analytics show failed requests % 2024-04-30 18:23:31 -07:00
Krish Dholakia
47017f5bc4
Merge pull request #3377 from BerriAI/revert-3374-abramowi/disambiguate-invalid-model-name-errors
Revert "Disambiguate invalid model name errors"
2024-04-30 17:58:00 -07:00
Krish Dholakia
82095731c1
Revert "Disambiguate invalid model name errors" 2024-04-30 17:57:52 -07:00
Krish Dholakia
fe3496961a
Merge pull request #3374 from msabramo/abramowi/disambiguate-invalid-model-name-errors
Disambiguate invalid model name errors
2024-04-30 17:57:29 -07:00
Krrish Dholakia
1baad80c7d fix(router.py): cooldown deployments, for 401 errors 2024-04-30 17:54:00 -07:00
Ishaan Jaff
f2849d0641 fix - track litellm_model_name in LiteLLM_ErrorLogs 2024-04-30 17:31:40 -07:00
Ishaan Jaff
8a1a043801 backend - show model latency per token 2024-04-30 17:23:36 -07:00
Ishaan Jaff
8177ef5ec0 ui - show model latency / token 2024-04-30 17:23:27 -07:00
Ishaan Jaff
ce1817380e feat ui - modelExceptionsCall 2024-04-30 16:56:45 -07:00
Ishaan Jaff
a2a8fef8f4 fix passing starttime and endtime to model/exceptions 2024-04-30 16:53:53 -07:00
Krrish Dholakia
8ee51a96f4 test(test_router_debug_logs.py): fix retry logic 2024-04-30 16:42:10 -07:00
Krish Dholakia
ce9ede6110
Merge pull request #3370 from BerriAI/litellm_latency_buffer
fix(lowest_latency.py): allow setting a buffer for getting values within a certain latency threshold
2024-04-30 16:01:47 -07:00
Ishaan Jaff
26a5d85869 fix - backend return exceptions 2024-04-30 15:41:16 -07:00
Krrish Dholakia
0267069c6a fix(router.py): return routing args as dict 2024-04-30 15:39:14 -07:00
Krrish Dholakia
668a5353ee fix(router.py): fix linting issue 2024-04-30 15:35:16 -07:00
Krrish Dholakia
6a2b4bcab8 fix(router.py): only check /v1 for azure ai studio models
Fixes https://github.com/BerriAI/litellm/issues/3346
2024-04-30 15:29:50 -07:00
Krrish Dholakia
87ff26ff27 fix(router.py): unify retry timeout logic across sync + async function_with_retries 2024-04-30 15:23:19 -07:00
Ishaan Jaff
49f83ce204 ui - show models analytics 2024-04-30 15:16:25 -07:00