Commit graph

503 commits

Author SHA1 Message Date
Krish Dholakia
4c8b620352 Merge pull request #3685 from BerriAI/litellm_lago_integration
feat(lago.py): Enable Usage-based billing with lago
2024-05-16 13:09:48 -07:00
Krrish Dholakia
d167a9ea99 feat(lago.py): adding support for usage-based billing with lago
Closes https://github.com/BerriAI/litellm/issues/3639
2024-05-16 10:54:18 -07:00
Krish Dholakia
07a1cf39e8 Merge branch 'main' into litellm_end_user_cost_tracking 2024-05-15 22:27:04 -07:00
Ishaan Jaff
d257518e47 fix deep copy logic in during_call_hook 2024-05-15 17:34:54 -07:00
Ishaan Jaff
38a539baf4 show debug info on slack alerts 2024-05-15 13:23:44 -07:00
Krrish Dholakia
9945387d7a feat(proxy_server.py): new /end_user/info endpoint
get spend for a specific end-user
2024-05-15 09:50:52 -07:00
Krish Dholakia
64ca2fde53 Merge branch 'main' into litellm_region_based_routing 2024-05-08 22:19:51 -07:00
Krrish Dholakia
7c99808529 feat(proxy_server.py): add CRUD endpoints for 'end_user' management
allow admin to specify region + default models for end users
2024-05-08 18:50:36 -07:00
Krrish Dholakia
5f93cae3ff feat(proxy_server.py): return litellm version in response headers 2024-05-08 16:00:08 -07:00
Krrish Dholakia
e5c57cdddb feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
2024-05-07 12:58:49 -07:00
Krrish Dholakia
2170e1723d feat(proxy_server.py): schedule slack daily report if enabled
if user enabled daily_reports, send them a slack report every 12 hours
2024-05-06 18:25:48 -07:00
Krrish Dholakia
0c1fc34e9f feat(slack_alerting.py): support sending daily reports on deployments
allow admin to easily know slow + failing deployments

Closes  https://github.com/BerriAI/litellm/issues/3483
2024-05-06 17:18:42 -07:00
Krrish Dholakia
4a22948ce4 fix(proxy/utils.py): fix slack alerting to only raise alerts for llm api exceptions
don't spam for bad user requests. Closes https://github.com/BerriAI/litellm/issues/3395
2024-05-02 17:18:21 -07:00
Krrish Dholakia
c1ffa7d409 fix(proxy/utils.py): fix retry logic for generic data request 2024-05-02 14:50:50 -07:00
Krrish Dholakia
e7b3ac8e06 feat(openmeter.py): add support for user billing
open-meter supports user based billing. Closes https://github.com/BerriAI/litellm/issues/1268
2024-05-01 17:23:48 -07:00
Krrish Dholakia
0b2c3f78d4 fix(proxy/utils.py): emit number of spend transactions for keys being written to db in a batch 2024-05-01 08:25:04 -07:00
Krrish Dholakia
462724feef fix(proxy/utils.py): log rejected proxy requests to langfuse 2024-04-25 19:26:27 -07:00
Ishaan Jaff
2190b039fd pass alert type on alerting handle 2024-04-25 13:05:34 -07:00
Ishaan Jaff
c5a0b3a8d4 fix - refactor slack alerting 2024-04-23 18:34:11 -07:00
Ishaan Jaff
a3109db4e1 fix langfuse project id 2024-04-23 11:37:17 -07:00
Ishaan Jaff
d076aed9d0 fix - dont send alert on fail request 2024-04-22 16:07:58 -07:00
Ishaan Jaff
8fb9c8d833 ui - find all teams 2024-04-22 14:15:09 -07:00
Ishaan Jaff
aa365c5c4a feat - show langfuse trace in alerts 2024-04-22 08:51:46 -07:00
Ishaan Jaff
d4c9439cc0 fix - slack alerting show input in the api_base 2024-04-20 13:16:47 -07:00
Ishaan Jaff
828645137c feat - log team_alias to langfuse 2024-04-19 10:29:42 -07:00
Ishaan Jaff
532e252559 fix - show api_base in hanging requests 2024-04-18 21:01:26 -07:00
Ishaan Jaff
ecc770be00 fix - show api base on hanging requests 2024-04-18 20:57:22 -07:00
Ishaan Jaff
977b030dd9 ui - show all alert types when getting all callbacks 2024-04-18 20:08:13 -07:00
Krish Dholakia
741a18a040 Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
2024-04-18 19:10:58 -07:00
Ishaan Jaff
27333d17e2 fix order by spend 2024-04-18 17:33:38 -07:00
Ishaan Jaff
03b4652af1 fix return key aliases on /user/info 2024-04-18 17:16:52 -07:00
Krrish Dholakia
51cc8dd95b fix(proxy/utils.py): add prometheus failed db request tracking 2024-04-18 16:30:29 -07:00
Krrish Dholakia
cdfd873713 fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
2024-04-18 16:24:36 -07:00
Ishaan Jaff
bb07c5fdc5 Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
2024-04-18 16:21:33 -07:00
Krrish Dholakia
7f5bcf38b7 feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
2024-04-18 16:04:35 -07:00
Ishaan Jaff
d6e3f587fe fix trim messages to first 100 chars 2024-04-18 15:21:31 -07:00
Ishaan Jaff
d178916048 fix - test alerting 2024-04-18 11:40:40 -07:00
Ishaan Jaff
58eea0f330 feat return alert types on /config/get/callback 2024-04-17 21:02:10 -07:00
Ishaan Jaff
a97f8a40c1 fix - user based alerting 2024-04-17 20:35:29 -07:00
Ishaan Jaff
2e62b0059c v0 add types of alerts to slack alerting 2024-04-17 18:16:19 -07:00
Ishaan Jaff
39488780e0 litellm_add_proxy_base_url in slack alerts 2024-04-17 17:42:28 -07:00
Krrish Dholakia
d75cfc5e32 fix(utils.py): return vertex api base for request hanging alerts 2024-04-16 17:53:28 -07:00
Krrish Dholakia
aa5da4346a fix(proxy_server.py): support tracking org spend
currently works when org set for jwt auth
2024-04-11 23:01:21 -07:00
Krrish Dholakia
07798af50d fix(proxy/utils.py): fix error message 2024-04-08 20:47:13 -07:00
Krrish Dholakia
da216c6915 fix(proxy_server.py): allow mapping a user to an org 2024-04-08 20:45:11 -07:00
Krrish Dholakia
0dad78b53c feat(proxy/utils.py): return api base for request hanging alerts 2024-04-06 15:58:53 -07:00
Krrish Dholakia
ece37a4b7f feat(ui): add models via ui
adds ability to add models via ui to the proxy. also fixes additional bugs around new /model/new endpoint
2024-04-04 18:56:20 -07:00
Krrish Dholakia
129bb52e9d fix(proxy_server.py): persist models added via /model/new to db
allows models to be used across instances

https://github.com/BerriAI/litellm/issues/2319 , https://github.com/BerriAI/litellm/issues/2329
2024-04-03 20:16:41 -07:00
Krrish Dholakia
029ee15951 perf(proxy_server.py): batch write spend logs
reduces prisma client errors, by batch writing spend logs - max 1k logs at a time
2024-04-02 18:46:55 -07:00
Krrish Dholakia
e06d43dc90 fix(tpm_rpm_limiter.py): fix cache init logic 2024-04-01 18:01:38 -07:00