Commit graph

1482 commits

Author SHA1 Message Date
Ishaan Jaff
051d4daa08 proxy - return num callbacks on /health/readiness 2024-05-03 09:14:32 -07:00
Marc Abramowitz
24ddba804f Disambiguate invalid model name errors
because that error can be thrown in several different places, so
knowing the function it's being thrown from can be very useul for debugging.
2024-05-02 15:02:54 -07:00
Lunik
3ca174bc57 feat: Add Azure Content-Safety Proxy hooks
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-02 23:21:08 +02:00
Krish Dholakia
1b5efc47e1 Merge pull request #3375 from msabramo/GH-3372
Fix route `/openai/deployments/{model}/chat/completions` not working properly
2024-05-02 13:00:25 -07:00
Krish Dholakia
dff4649509 Merge branch 'main' into litellm_openmeter_integration 2024-05-01 21:19:29 -07:00
Krrish Dholakia
0b1382aef4 build(ui): enable adding openmeter via proxy ui 2024-05-01 21:16:23 -07:00
Ishaan Jaff
5ce76a5b0d feat - show slow count and total count 2024-05-01 17:18:14 -07:00
Ishaan Jaff
1721ab64aa feat - return slow responses on admin UI 2024-05-01 17:16:33 -07:00
Ishaan Jaff
cb5073da62 fix - dont show cache hits on model latency tracker 2024-05-01 16:51:15 -07:00
Ishaan Jaff
5b70130049 fix latency tracking tool tip 2024-05-01 16:47:30 -07:00
Ishaan Jaff
cf2dd97f0b clean up model latency metrics 2024-05-01 08:27:01 -07:00
Ishaan Jaff
1a2045d63a ui - show tokens / sec 2024-04-30 22:44:28 -07:00
Ishaan Jaff
ee5ebd8f4f fix - viewing model metrics 2024-04-30 18:26:14 -07:00
Ishaan Jaff
69d8f7b89b fix - track litellm_model_name in LiteLLM_ErrorLogs 2024-04-30 17:31:40 -07:00
Ishaan Jaff
5756adc178 backend - show model latency per token 2024-04-30 17:23:36 -07:00
Ishaan Jaff
1c9aa7f0f1 fix passing starttime and endtime to model/exceptions 2024-04-30 16:53:53 -07:00
Ishaan Jaff
672d1c67fc fix - backend return exceptions 2024-04-30 15:41:16 -07:00
Marc Abramowitz
eac4593ab9 Move chat_completions before completions
so that the `chat_completions` route is defined before the `completions` route.
This is necessary because the `chat_completions` route is more
specific than the `completions` route, and the order of route definitions
matters in FastAPI.

Without this, doing a request to
`/openai/deployments/{model_in_url}/chat/completions` might trigger
`completions` being called (with `model` set to `{model_in_url}/chat` instead of
`chat_completions` getting called, which is the correct function.

Fixes: GH-3372
2024-04-30 15:07:10 -07:00
Ishaan Jaff
1776f5aea4 stash /model/metrics/exceptions endpoints 2024-04-30 14:19:23 -07:00
Ishaan Jaff
ebe842d404 log startTime and EndTime for exceptions 2024-04-30 13:34:14 -07:00
Ishaan Jaff
ab2b78cd88 feat log request kwargs in error logs 2024-04-30 13:28:26 -07:00
Ishaan Jaff
918fff424f fix log model_group 2024-04-30 13:11:09 -07:00
Ishaan Jaff
7f9b9d0173 fix - log api_base in errors 2024-04-30 13:02:42 -07:00
Ishaan Jaff
96cbdcb349 fix - working exception writing 2024-04-30 12:48:17 -07:00
Krrish Dholakia
a3fea74826 fix(proxy_server.py): return more detailed auth error message. 2024-04-29 07:24:19 -07:00
Krrish Dholakia
ae04330348 fix(proxy_server.py): fix setting offset-aware datetime 2024-04-25 21:18:32 -07:00
Krrish Dholakia
462724feef fix(proxy/utils.py): log rejected proxy requests to langfuse 2024-04-25 19:26:27 -07:00
Ishaan Jaff
9f0284affa fixes for testing alerting 2024-04-25 16:33:55 -07:00
Ishaan Jaff
a6132ab5d4 backend - update slack alert_to_webhook_url_map 2024-04-25 13:47:52 -07:00
Ishaan Jaff
2190b039fd pass alert type on alerting handle 2024-04-25 13:05:34 -07:00
Krrish Dholakia
f260050e4d fix(proxy_server.py): fix update router 2024-04-24 23:01:21 -07:00
Krrish Dholakia
9577548060 fix(proxy_server.py): fix updating non-router settings for proxy config 2024-04-24 22:50:04 -07:00
Krrish Dholakia
1988ce3247 feat(router.py): support mock testing fallbacks flag
allow user to test if fallbacks work as expected with a `mock_testing_fallbacks = True` flag set during a call
2024-04-24 20:13:10 -07:00
Krrish Dholakia
f4bd85a489 fix(proxy_server.py): fix /config/update/
allows updating router config via UI and having the change be propogated across all proxy instances by persisting config changes to the db
2024-04-24 16:42:42 -07:00
Ishaan Jaff
f7053ac85f Merge pull request #3277 from BerriAI/litellm_update_deployments
[UI] V0 - Edit Model tpm, rpm, api_base
2024-04-24 14:03:00 -07:00
Ishaan Jaff
c19f28b7d2 /model/update endpoint 2024-04-24 10:39:20 -07:00
Krrish Dholakia
840bb5a18d fix(proxy_server.py): add new flag for disable sharing master key on ui 2024-04-24 10:06:01 -07:00
Ishaan Jaff
acf08e6d00 backend allow filtering by model_group 2024-04-23 22:03:20 -07:00
Ishaan Jaff
bd18a8909e ui - filter by time and deployments 2024-04-23 20:42:15 -07:00
Ishaan Jaff
1117a20f52 fix using slack alerting through admin ui 2024-04-23 19:05:50 -07:00
Krrish Dholakia
f1f08af785 fix(proxy_server.py): handle router being initialized without a model list 2024-04-23 10:52:28 -07:00
Ishaan Jaff
4e7b9356ce Merge pull request #3228 from BerriAI/litellm_ui_polish
[Fix] Non-Admin SSO Login
2024-04-22 18:15:10 -07:00
Ishaan Jaff
41d8bda94d fix - sso login for non admins 2024-04-22 17:57:47 -07:00
Ishaan Jaff
cd58805534 fix - track litellm_status=fail 2024-04-22 16:11:04 -07:00
Ishaan Jaff
b24be22ad0 ui - show all teams on ui 2024-04-22 14:15:50 -07:00
Ishaan Jaff
5e18a8bfbc Merge pull request #3212 from BerriAI/ui_increase_default_session_time
UI - increase default session time to 2 hours
2024-04-22 13:46:18 -07:00
Ishaan Jaff
a2e758f530 increase ui default session time to 2 hours 2024-04-22 10:00:53 -07:00
Ishaan Jaff
454925e461 fix - round spend to 2 decimals 2024-04-22 09:17:40 -07:00
Ishaan Jaff
8100ec4060 Merge pull request #3184 from BerriAI/litellm_ui_non_admins_flow
[UI] - non admin flow - only Create + Test Key available
2024-04-20 12:40:43 -07:00
Ishaan Jaff
20fe57107c fix - security fix 2024-04-20 12:10:08 -07:00