Commit graph

517 commits

Author SHA1 Message Date
Ishaan Jaff
7a8d78e6c9 feat - check team member spend, budget 2024-05-22 18:36:39 -07:00
Ishaan Jaff
c2d25b9a14 feat - update team+user_id spend 2024-05-22 17:49:54 -07:00
Krrish Dholakia
3c0e9cb48e feat(schema.prisma): store model id + model group as part of spend logs
allows precise model metrics
2024-05-22 17:29:44 -07:00
Krrish Dholakia
9f518030b9 build(schema.prisma): add completion_start_time to spend logs
enables time to first token tracking for models
2024-05-22 16:43:08 -07:00
Krish Dholakia
707cf24472
Merge branch 'main' into litellm_webhook_support 2024-05-20 18:41:58 -07:00
Krish Dholakia
c6bb6e325b
Merge pull request #3740 from BerriAI/litellm_return_rejected_response
feat(proxy_server.py): allow admin to return rejected response as string to user
2024-05-20 17:48:21 -07:00
Krrish Dholakia
867f9300e3 fix(slack_alerting.py): cleanup webhook event 2024-05-20 16:55:01 -07:00
Ishaan Jaff
8413fdf4c7
Merge branch 'main' into litellm_standardize_slack_exception_msg_format 2024-05-20 16:39:41 -07:00
Ishaan Jaff
5d24b47d2b fix standardize llm exception alert to msg: value 2024-05-20 16:23:48 -07:00
Krrish Dholakia
da0e5d1b8d feat(slack_alerting.py): support webhook for budget alerts
Closes https://github.com/BerriAI/litellm/issues/3743
2024-05-20 15:30:56 -07:00
Ishaan Jaff
f417495b7b fix - only adding alerting callbacks when alerting is on 2024-05-20 14:59:06 -07:00
Krrish Dholakia
f11f207ae6 feat(proxy_server.py): refactor returning rejected message, to work with error logging
log the rejected request as a failed call to langfuse/slack alerting
2024-05-20 11:14:36 -07:00
Krrish Dholakia
372323c38a feat(proxy_server.py): allow admin to return rejected response as string to user
Closes https://github.com/BerriAI/litellm/issues/3671
2024-05-20 10:30:23 -07:00
Ishaan Jaff
a292583ff1 fix - allow users to opt into specific alert types 2024-05-16 16:52:44 -07:00
Krish Dholakia
d43f75150a
Merge pull request #3685 from BerriAI/litellm_lago_integration
feat(lago.py): Enable Usage-based billing with lago
2024-05-16 13:09:48 -07:00
Krrish Dholakia
e273e66618 feat(lago.py): adding support for usage-based billing with lago
Closes https://github.com/BerriAI/litellm/issues/3639
2024-05-16 10:54:18 -07:00
Krish Dholakia
152946e910
Merge branch 'main' into litellm_end_user_cost_tracking 2024-05-15 22:27:04 -07:00
Ishaan Jaff
89dd1ffee8 fix deep copy logic in during_call_hook 2024-05-15 17:34:54 -07:00
Ishaan Jaff
f6e342303e show debug info on slack alerts 2024-05-15 13:23:44 -07:00
Krrish Dholakia
5dcf3d672c feat(proxy_server.py): new /end_user/info endpoint
get spend for a specific end-user
2024-05-15 09:50:52 -07:00
Krish Dholakia
8ad979cdfe
Merge branch 'main' into litellm_region_based_routing 2024-05-08 22:19:51 -07:00
Krrish Dholakia
db666b01e5 feat(proxy_server.py): add CRUD endpoints for 'end_user' management
allow admin to specify region + default models for end users
2024-05-08 18:50:36 -07:00
Krrish Dholakia
6575143460 feat(proxy_server.py): return litellm version in response headers 2024-05-08 16:00:08 -07:00
Krrish Dholakia
872470ff1f feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
2024-05-07 12:58:49 -07:00
Krrish Dholakia
6b9b4f05ba feat(proxy_server.py): schedule slack daily report if enabled
if user enabled daily_reports, send them a slack report every 12 hours
2024-05-06 18:25:48 -07:00
Krrish Dholakia
718f423d7d feat(slack_alerting.py): support sending daily reports on deployments
allow admin to easily know slow + failing deployments

Closes  https://github.com/BerriAI/litellm/issues/3483
2024-05-06 17:18:42 -07:00
Krrish Dholakia
fdc4fdb91a fix(proxy/utils.py): fix slack alerting to only raise alerts for llm api exceptions
don't spam for bad user requests. Closes https://github.com/BerriAI/litellm/issues/3395
2024-05-02 17:18:21 -07:00
Krrish Dholakia
acda064be6 fix(proxy/utils.py): fix retry logic for generic data request 2024-05-02 14:50:50 -07:00
Krrish Dholakia
2a9651b3ca feat(openmeter.py): add support for user billing
open-meter supports user based billing. Closes https://github.com/BerriAI/litellm/issues/1268
2024-05-01 17:23:48 -07:00
Krrish Dholakia
d0f9f8c0ed fix(proxy/utils.py): emit number of spend transactions for keys being written to db in a batch 2024-05-01 08:25:04 -07:00
Krrish Dholakia
885de2e3c6 fix(proxy/utils.py): log rejected proxy requests to langfuse 2024-04-25 19:26:27 -07:00
Ishaan Jaff
1d5e70f7a0 pass alert type on alerting handle 2024-04-25 13:05:34 -07:00
Ishaan Jaff
31979f83d2 fix - refactor slack alerting 2024-04-23 18:34:11 -07:00
Ishaan Jaff
5118c96b4d fix langfuse project id 2024-04-23 11:37:17 -07:00
Ishaan Jaff
517f577292 fix - dont send alert on fail request 2024-04-22 16:07:58 -07:00
Ishaan Jaff
cd3b2a21c1 ui - find all teams 2024-04-22 14:15:09 -07:00
Ishaan Jaff
094583f18e feat - show langfuse trace in alerts 2024-04-22 08:51:46 -07:00
Ishaan Jaff
ddc71d766a fix - slack alerting show input in the api_base 2024-04-20 13:16:47 -07:00
Ishaan Jaff
6d92b13c22 feat - log team_alias to langfuse 2024-04-19 10:29:42 -07:00
Ishaan Jaff
6f948cd559 fix - show api_base in hanging requests 2024-04-18 21:01:26 -07:00
Ishaan Jaff
f04604910b fix - show api base on hanging requests 2024-04-18 20:57:22 -07:00
Ishaan Jaff
554c83fdaf ui - show all alert types when getting all callbacks 2024-04-18 20:08:13 -07:00
Krish Dholakia
77a353d484
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
2024-04-18 19:10:58 -07:00
Ishaan Jaff
d9091dcf97 fix order by spend 2024-04-18 17:33:38 -07:00
Ishaan Jaff
b669e2987b fix return key aliases on /user/info 2024-04-18 17:16:52 -07:00
Krrish Dholakia
919a2876f1 fix(proxy/utils.py): add prometheus failed db request tracking 2024-04-18 16:30:29 -07:00
Krrish Dholakia
d61250109e fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
2024-04-18 16:24:36 -07:00
Ishaan Jaff
eb04a929e6
Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
2024-04-18 16:21:33 -07:00
Krrish Dholakia
0f95a824c4 feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
2024-04-18 16:04:35 -07:00
Ishaan Jaff
e20b05d6dd fix trim messages to first 100 chars 2024-04-18 15:21:31 -07:00