Commit graph

104 commits

Author SHA1 Message Date
Krish Dholakia
f3a47e4b1a Merge branch 'main' into litellm_webhook_support 2024-05-20 18:41:58 -07:00
Krrish Dholakia
fd13a830ab fix(slack_alerting.py): cleanup webhook event 2024-05-20 16:55:01 -07:00
Krrish Dholakia
c0d5e83570 feat(slack_alerting.py): support webhook for budget alerts
Closes https://github.com/BerriAI/litellm/issues/3743
2024-05-20 15:30:56 -07:00
Ishaan Jaff
540c476024 try/except deployment metrics error 2024-05-20 15:03:04 -07:00
Ishaan Jaff
e9ea7438a9 fix divide by 0 bug 2024-05-20 14:26:30 -07:00
Krrish Dholakia
41556b3d02 fix(slack_alerting.py): prevent alert spam within 24 hr period for budget alerts
Closes https://github.com/BerriAI/litellm/issues/3738
2024-05-20 13:35:19 -07:00
Ishaan Jaff
455bae52ae Merge pull request #3713 from BerriAI/litellm_ui_infer_azure_prefix
[Feat] Admin UI - use `base_model` for Slack Alerts
2024-05-17 21:55:23 -07:00
Krrish Dholakia
1ccc604953 fix(slack_alerting.py): show langfuse traces on error messages 2024-05-17 18:42:30 -07:00
Ishaan Jaff
f92dd0ec9b fix - show correct base_model in slack alerts 2024-05-17 16:07:02 -07:00
Ishaan Jaff
70194f7ba2 fix - allow users to opt into specific alert types 2024-05-16 16:52:44 -07:00
Krrish Dholakia
b37449f99c fix(slack_alerting.py): fix timezone utc issue 2024-05-14 22:54:33 -07:00
Krrish Dholakia
c098ad0a60 fix(alerting.py): fix datetime comparison logic 2024-05-14 22:10:09 -07:00
Krish Dholakia
01a9c28974 Merge pull request #3599 from taralika/patch-1
Ignore 0 failures and 0s latency in daily slack reports
2024-05-14 11:47:46 -07:00
Anand Taralika
e1efb1a64b Merge branch 'BerriAI:main' into patch-1 2024-05-13 21:31:00 -07:00
Marc Abramowitz
e01177fd55 Cast SLACK_DAILY_REPORT_FREQUENCY to int 2024-05-13 17:08:53 -07:00
Marc Abramowitz
e8f92f89d8 Add ability to customize slack report frequency
via `SLACK_DAILY_REPORT_FREQUENCY` env var

Addresses one thing that @taralika mentioned in
https://github.com/BerriAI/litellm/issues/3597
2024-05-13 16:39:44 -07:00
Krrish Dholakia
68ed9ed943 fix(slack_alerting.py): if 'turn_off_message_logging' enabled, do not log the message to logging integration 2024-05-13 14:02:43 -07:00
Ishaan Jaff
471cb148cb Merge pull request #3609 from BerriAI/litellm_send_daily_spend_report
[Feat] send weekly spend reports by Team/Tag
2024-05-13 12:45:37 -07:00
Ishaan Jaff
4d72d4676d fix - spend reports on alerts 2024-05-13 10:51:59 -07:00
Ishaan Jaff
2694721e81 fix - show monthly spend in slack reports 2024-05-13 10:17:09 -07:00
Krrish Dholakia
56b6efae50 fix(slack_alerting.py): don't fire spam alerts when backend api call fails 2024-05-13 10:04:43 -07:00
Ishaan Jaff
d384a0b39f feat - send daily spend reports 2024-05-13 09:25:31 -07:00
Anand Taralika
7862767428 Removed "5" from the string
since it's not guaranteed to always be 5, it will be at most 5, but could be less than 5 (if some values are 0s now that we ignore 0s)
2024-05-12 09:29:23 -07:00
Anand Taralika
f8957a7be8 Update a comment about ignoring 0 values in addition to None 2024-05-12 08:53:04 -07:00
Anand Taralika
aa6d51edc9 Ignore 0 failures and 0s latency in daily slack reports
Should fix #3598
2024-05-12 08:47:29 -07:00
Krrish Dholakia
926b86af87 feat(bedrock_httpx.py): moves to using httpx client for bedrock cohere calls 2024-05-11 13:43:08 -07:00
Ishaan Jaff
b21a9e35f7 fix add cooldown_deployment alert_type 2024-05-08 14:13:51 -07:00
Ishaan Jaff
470f8f6de9 Merge pull request #3511 from BerriAI/litellm_router_send_exceptions_slack
[Feat] litellm.Router / litellm.completion -  send llm exceptions to slack
2024-05-07 19:09:44 -07:00
Ishaan Jaff
d1d037aaaa fix slack alerting 2024-05-07 18:17:12 -07:00
Ishaan Jaff
54bdbe09d2 feat - send slack alerts litellm.router 2024-05-07 15:10:47 -07:00
Krrish Dholakia
e5c57cdddb feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
2024-05-07 12:58:49 -07:00
Krrish Dholakia
7d27e4a85d fix(slack_alerting.py): fix storing + reading datetime object from cache
this converts the dt object to isoformat before storing, and loads it back to dt when comparing
2024-05-07 11:44:55 -07:00
Krish Dholakia
185581d5a9 Merge branch 'main' into litellm_slack_daily_reports 2024-05-06 19:31:20 -07:00
Krrish Dholakia
13e8f63388 fix(slack_alerting.py): allow internal cache to be an optional param 2024-05-06 18:34:09 -07:00
Krrish Dholakia
4e1fe377bf refactor(slack_alerting.py): trigger new build 2024-05-06 18:29:55 -07:00
Krrish Dholakia
2170e1723d feat(proxy_server.py): schedule slack daily report if enabled
if user enabled daily_reports, send them a slack report every 12 hours
2024-05-06 18:25:48 -07:00
Krrish Dholakia
0c1fc34e9f feat(slack_alerting.py): support sending daily reports on deployments
allow admin to easily know slow + failing deployments

Closes  https://github.com/BerriAI/litellm/issues/3483
2024-05-06 17:18:42 -07:00
Ishaan Jaff
e272d4d5c7 feat - format model price in slack alerts for new model 2024-05-06 15:57:22 -07:00
Ishaan Jaff
eaaa0f6cbe feat - send slack alerts on adding new model 2024-05-06 15:43:02 -07:00
Ishaan Jaff
edfed32d1c feat - add key name and team alias to slack alerts 2024-05-06 14:30:10 -07:00
Ishaan Jaff
5974e6792f fix - show api key name and alias in alerts 2024-05-06 14:15:34 -07:00
Ishaan Jaff
724b42f478 fix - revert init langfuse client on slack alerts 2024-05-02 16:02:52 -07:00
Ishaan Jaff
fb40f8001b Merge pull request #3338 from BerriAI/litellm_alerting_sort_latencies
Fix - slack alerting show deployment latencies in sorted order
2024-04-27 18:40:07 -07:00
Ishaan Jaff
b87badce7b fix slack alerting show deployment latencies 2024-04-27 15:44:30 -07:00
Ishaan Jaff
3e30f566ff fix - link to langfuse traces on slack alerts 2024-04-27 15:23:18 -07:00
Ishaan Jaff
1dbf666f4b Merge pull request #3301 from BerriAI/debug_lowest_latency
[Fix] sending deployment latencies to slack alerting - lowest_latency
2024-04-25 19:43:25 -07:00
Ishaan Jaff
6c8998d6ca fix clean up prints from slack alerting 2024-04-25 19:36:42 -07:00
Ishaan Jaff
2190b039fd pass alert type on alerting handle 2024-04-25 13:05:34 -07:00
Ishaan Jaff
3b9d6dfc47 temp - show better debug logs for lowest latency 2024-04-25 11:22:52 -07:00
Ishaan Jaff
ad637c15ce Merge pull request #3283 from BerriAI/litellm_debug_lowest_latency
[Fix] Add better observability for debugging lowest latency routing
2024-04-24 20:42:52 -07:00