Commit graph

1964 commits

Author SHA1 Message Date
Ishaan Jaff
8874eaa0b3 fix - track litellm_status=fail 2024-04-22 16:11:04 -07:00
Ishaan Jaff
517f577292 fix - dont send alert on fail request 2024-04-22 16:07:58 -07:00
Ishaan Jaff
50bbd188fb ui - show all teams on ui 2024-04-22 14:15:50 -07:00
Ishaan Jaff
cd3b2a21c1 ui - find all teams 2024-04-22 14:15:09 -07:00
Ishaan Jaff
877c4e27f4
Merge pull request #3212 from BerriAI/ui_increase_default_session_time
UI - increase default session time to 2 hours
2024-04-22 13:46:18 -07:00
Ishaan Jaff
1a945092de
Merge pull request #3210 from BerriAI/litellm_ui_round_up_team_spend_2_decimals
[UI] round up team spend to 2 decimals + diversify legend for team spend
2024-04-22 13:45:52 -07:00
Krrish Dholakia
a520e1bd6f fix(router.py): add random shuffle and tpm-based shuffle for async shuffle logic 2024-04-22 12:58:59 -07:00
Ishaan Jaff
bb065f64c6 increase ui default session time to 2 hours 2024-04-22 10:00:53 -07:00
Ishaan Jaff
f54982a560 fix - round spend to 2 decimals 2024-04-22 09:17:40 -07:00
Ishaan Jaff
094583f18e feat - show langfuse trace in alerts 2024-04-22 08:51:46 -07:00
Ishaan Jaff
1dbd4f6d7b ui - new build 2024-04-22 07:23:22 -07:00
Ishaan Jaff
ced29f6b76 ui - new build 2024-04-20 19:32:40 -07:00
Krrish Dholakia
9379e3d047 fix(lowest_tpm_rpm_v2.py): use a combined tpm+rpm query in async get cache, to reduce redis client calls in high traffic 2024-04-20 16:13:11 -07:00
Krrish Dholakia
b96741e4f4 fix(router.py): async simple-shuffle support 2024-04-20 15:01:12 -07:00
Ishaan Jaff
ddc71d766a fix - slack alerting show input in the api_base 2024-04-20 13:16:47 -07:00
Ishaan Jaff
f89f8a4157
Merge pull request #3184 from BerriAI/litellm_ui_non_admins_flow
[UI] - non admin flow - only Create + Test Key available
2024-04-20 12:40:43 -07:00
Ishaan Jaff
07a10247db fix - security fix 2024-04-20 12:10:08 -07:00
Ishaan Jaff
5d39865362 fix - audio_transcriptions security fix 2024-04-20 11:58:15 -07:00
Ishaan Jaff
fd282ea932 fix testing fixes 2024-04-20 11:48:41 -07:00
Ishaan Jaff
7ebf2ca4d9 (ci/cd) testing with team_id and /user/new 2024-04-20 11:09:34 -07:00
Krrish Dholakia
e5b98814ad test(test_proxy_server.py): fix test to use valid redis host 2024-04-19 19:22:24 -07:00
Ishaan Jaff
00a07a99cd fix - backend logic for non admin flow 2024-04-19 17:36:29 -07:00
Krish Dholakia
cc7f27252b
Merge pull request #3179 from BerriAI/litellm_dual_caching_fix
fix(caching.py): dual cache async_batch_get_cache fix + testing
2024-04-19 17:02:47 -07:00
Ishaan Jaff
def5ce4888 fix user needs to opt in to langfuse tags 2024-04-19 16:13:14 -07:00
Ishaan Jaff
48e5de4f0e
Merge pull request #3175 from BerriAI/litellm_log_team_alias_langfuse
[FEAT] Log team alias to langfuse
2024-04-19 15:30:59 -07:00
Krrish Dholakia
01a1a8f731 fix(caching.py): dual cache async_batch_get_cache fix + testing
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Krrish Dholakia
3c6b6355c7 fix(ollama_chat.py): accept api key as a param for ollama calls
allows user to call hosted ollama endpoint using bearer token for auth
2024-04-19 13:02:13 -07:00
Ishaan Jaff
423121ff7d feat - track team_alias is metadata for /chat, /embeddings 2024-04-19 10:52:54 -07:00
Ishaan Jaff
6d92b13c22 feat - log team_alias to langfuse 2024-04-19 10:29:42 -07:00
Ishaan Jaff
6f948cd559 fix - show api_base in hanging requests 2024-04-18 21:01:26 -07:00
Ishaan Jaff
f04604910b fix - show api base on hanging requests 2024-04-18 20:57:22 -07:00
Ishaan Jaff
e95e7fde65 ui - new build 2024-04-18 20:18:50 -07:00
Ishaan Jaff
554c83fdaf ui - show all alert types when getting all callbacks 2024-04-18 20:08:13 -07:00
Krish Dholakia
77a353d484
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
2024-04-18 19:10:58 -07:00
Ishaan Jaff
d9091dcf97 fix order by spend 2024-04-18 17:33:38 -07:00
Ishaan Jaff
b669e2987b fix return key aliases on /user/info 2024-04-18 17:16:52 -07:00
Krrish Dholakia
919a2876f1 fix(proxy/utils.py): add prometheus failed db request tracking 2024-04-18 16:30:29 -07:00
Krrish Dholakia
d61250109e fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
2024-04-18 16:24:36 -07:00
Ishaan Jaff
eb04a929e6
Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
2024-04-18 16:21:33 -07:00
Ishaan Jaff
8958bbeac9
Merge pull request #3142 from BerriAI/litellm_slack_alerting_show_model_passed
[Fix] Show `model` passed on `"400: {'error': 'Invalid model name passed in mode` errors 👻
2024-04-18 16:18:17 -07:00
Krrish Dholakia
0f95a824c4 feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
2024-04-18 16:04:35 -07:00
Ishaan Jaff
6cfa80c42a
Merge pull request #3140 from BerriAI/litellm_trim_messages
[Fix] Slack Alerting -  trim messages to first 100 chars
2024-04-18 16:01:39 -07:00
Ishaan Jaff
b308f8c079 fix - show model passed in on Invalid model name passed in error 2024-04-18 15:43:30 -07:00
Ishaan Jaff
e20b05d6dd fix trim messages to first 100 chars 2024-04-18 15:21:31 -07:00
Krrish Dholakia
28edb77350 fix(utils.py): support prometheus failed call metrics 2024-04-18 12:29:15 -07:00
Ishaan Jaff
1cda0db2ca fix - test alerting 2024-04-18 11:40:40 -07:00
Ishaan Jaff
5e0dc57329 ui - set alert_types 2024-04-18 11:13:05 -07:00
Ishaan Jaff
a8e2ef79ef add alert_types to config.yaml 2024-04-18 08:51:07 -07:00
Krish Dholakia
91fe668411
Merge pull request #3105 from BerriAI/litellm_fix_hashing
fix(_types.py): hash api key in UserAPIKeyAuth
2024-04-18 08:16:24 -07:00
Ishaan Jaff
beeee01199 feat return alert types on /config/get/callback 2024-04-17 21:02:10 -07:00