Ishaan Jaff
5756adc178
backend - show model latency per token
2024-04-30 17:23:36 -07:00
Ishaan Jaff
1c9aa7f0f1
fix passing starttime and endtime to model/exceptions
2024-04-30 16:53:53 -07:00
Ishaan Jaff
672d1c67fc
fix - backend return exceptions
2024-04-30 15:41:16 -07:00
Marc Abramowitz
eac4593ab9
Move chat_completions before completions
...
so that the `chat_completions` route is defined before the `completions` route.
This is necessary because the `chat_completions` route is more
specific than the `completions` route, and the order of route definitions
matters in FastAPI.
Without this, doing a request to
`/openai/deployments/{model_in_url}/chat/completions` might trigger
`completions` being called (with `model` set to `{model_in_url}/chat` instead of
`chat_completions` getting called, which is the correct function.
Fixes: GH-3372
2024-04-30 15:07:10 -07:00
Ishaan Jaff
1776f5aea4
stash /model/metrics/exceptions endpoints
2024-04-30 14:19:23 -07:00
Ishaan Jaff
ebe842d404
log startTime and EndTime for exceptions
2024-04-30 13:34:14 -07:00
Ishaan Jaff
ab2b78cd88
feat log request kwargs in error logs
2024-04-30 13:28:26 -07:00
Ishaan Jaff
918fff424f
fix log model_group
2024-04-30 13:11:09 -07:00
Ishaan Jaff
7f9b9d0173
fix - log api_base in errors
2024-04-30 13:02:42 -07:00
Ishaan Jaff
96cbdcb349
fix - working exception writing
2024-04-30 12:48:17 -07:00
Krrish Dholakia
a3fea74826
fix(proxy_server.py): return more detailed auth error message.
2024-04-29 07:24:19 -07:00
Krrish Dholakia
ae04330348
fix(proxy_server.py): fix setting offset-aware datetime
2024-04-25 21:18:32 -07:00
Krrish Dholakia
462724feef
fix(proxy/utils.py): log rejected proxy requests to langfuse
2024-04-25 19:26:27 -07:00
Ishaan Jaff
9f0284affa
fixes for testing alerting
2024-04-25 16:33:55 -07:00
Ishaan Jaff
a6132ab5d4
backend - update slack alert_to_webhook_url_map
2024-04-25 13:47:52 -07:00
Ishaan Jaff
2190b039fd
pass alert type on alerting handle
2024-04-25 13:05:34 -07:00
Krrish Dholakia
f260050e4d
fix(proxy_server.py): fix update router
2024-04-24 23:01:21 -07:00
Krrish Dholakia
9577548060
fix(proxy_server.py): fix updating non-router settings for proxy config
2024-04-24 22:50:04 -07:00
Krrish Dholakia
1988ce3247
feat(router.py): support mock testing fallbacks flag
...
allow user to test if fallbacks work as expected with a `mock_testing_fallbacks = True` flag set during a call
2024-04-24 20:13:10 -07:00
Krrish Dholakia
f4bd85a489
fix(proxy_server.py): fix /config/update
/
...
allows updating router config via UI and having the change be propogated across all proxy instances by persisting config changes to the db
2024-04-24 16:42:42 -07:00
Ishaan Jaff
f7053ac85f
Merge pull request #3277 from BerriAI/litellm_update_deployments
...
[UI] V0 - Edit Model tpm, rpm, api_base
2024-04-24 14:03:00 -07:00
Ishaan Jaff
c19f28b7d2
/model/update endpoint
2024-04-24 10:39:20 -07:00
Krrish Dholakia
840bb5a18d
fix(proxy_server.py): add new flag for disable sharing master key on ui
2024-04-24 10:06:01 -07:00
Ishaan Jaff
acf08e6d00
backend allow filtering by model_group
2024-04-23 22:03:20 -07:00
Ishaan Jaff
bd18a8909e
ui - filter by time and deployments
2024-04-23 20:42:15 -07:00
Ishaan Jaff
1117a20f52
fix using slack alerting through admin ui
2024-04-23 19:05:50 -07:00
Krrish Dholakia
f1f08af785
fix(proxy_server.py): handle router being initialized without a model list
2024-04-23 10:52:28 -07:00
Ishaan Jaff
4e7b9356ce
Merge pull request #3228 from BerriAI/litellm_ui_polish
...
[Fix] Non-Admin SSO Login
2024-04-22 18:15:10 -07:00
Ishaan Jaff
41d8bda94d
fix - sso login for non admins
2024-04-22 17:57:47 -07:00
Ishaan Jaff
cd58805534
fix - track litellm_status=fail
2024-04-22 16:11:04 -07:00
Ishaan Jaff
b24be22ad0
ui - show all teams on ui
2024-04-22 14:15:50 -07:00
Ishaan Jaff
5e18a8bfbc
Merge pull request #3212 from BerriAI/ui_increase_default_session_time
...
UI - increase default session time to 2 hours
2024-04-22 13:46:18 -07:00
Ishaan Jaff
a2e758f530
increase ui default session time to 2 hours
2024-04-22 10:00:53 -07:00
Ishaan Jaff
454925e461
fix - round spend to 2 decimals
2024-04-22 09:17:40 -07:00
Ishaan Jaff
8100ec4060
Merge pull request #3184 from BerriAI/litellm_ui_non_admins_flow
...
[UI] - non admin flow - only Create + Test Key available
2024-04-20 12:40:43 -07:00
Ishaan Jaff
20fe57107c
fix - security fix
2024-04-20 12:10:08 -07:00
Ishaan Jaff
c50cf8bbd2
fix - audio_transcriptions security fix
2024-04-20 11:58:15 -07:00
Ishaan Jaff
941b6cecf5
fix testing fixes
2024-04-20 11:48:41 -07:00
Ishaan Jaff
d55237442b
(ci/cd) testing with team_id and /user/new
2024-04-20 11:09:34 -07:00
Ishaan Jaff
ec540d686f
fix - backend logic for non admin flow
2024-04-19 17:36:29 -07:00
Ishaan Jaff
69ff5805fc
feat - track team_alias is metadata for /chat, /embeddings
2024-04-19 10:52:54 -07:00
Ishaan Jaff
977b030dd9
ui - show all alert types when getting all callbacks
2024-04-18 20:08:13 -07:00
Ishaan Jaff
bb07c5fdc5
Merge pull request #3112 from BerriAI/litellm_add_alert_types
...
[Feat] Allow user to select slack alert types to Opt In to
2024-04-18 16:21:33 -07:00
Ishaan Jaff
3cbbbd5861
Merge pull request #3142 from BerriAI/litellm_slack_alerting_show_model_passed
...
[Fix] Show `model` passed on `"400: {'error': 'Invalid model name passed in mode` errors 👻
2024-04-18 16:18:17 -07:00
Ishaan Jaff
3125b9fbba
fix - show model passed in on Invalid model name passed in error
2024-04-18 15:43:30 -07:00
Krrish Dholakia
deccde6be1
fix(utils.py): support prometheus failed call metrics
2024-04-18 12:29:15 -07:00
Ishaan Jaff
670476353f
ui - set alert_types
2024-04-18 11:13:05 -07:00
Ishaan Jaff
58eea0f330
feat return alert types on /config/get/callback
2024-04-17 21:02:10 -07:00
Ishaan Jaff
36b2d51e48
ui - dont include model/metrics in schema
2024-04-17 15:30:06 -07:00
Krrish Dholakia
afdaa349fa
test(test_models.py): ensure only admin can call /health
2024-04-16 18:13:40 -07:00