Krish Dholakia
|
4c8b620352
|
Merge pull request #3685 from BerriAI/litellm_lago_integration
feat(lago.py): Enable Usage-based billing with lago
|
2024-05-16 13:09:48 -07:00 |
|
Krrish Dholakia
|
d167a9ea99
|
feat(lago.py): adding support for usage-based billing with lago
Closes https://github.com/BerriAI/litellm/issues/3639
|
2024-05-16 10:54:18 -07:00 |
|
Krish Dholakia
|
07a1cf39e8
|
Merge branch 'main' into litellm_end_user_cost_tracking
|
2024-05-15 22:27:04 -07:00 |
|
Ishaan Jaff
|
d257518e47
|
fix deep copy logic in during_call_hook
|
2024-05-15 17:34:54 -07:00 |
|
Ishaan Jaff
|
38a539baf4
|
show debug info on slack alerts
|
2024-05-15 13:23:44 -07:00 |
|
Krrish Dholakia
|
9945387d7a
|
feat(proxy_server.py): new /end_user/info endpoint
get spend for a specific end-user
|
2024-05-15 09:50:52 -07:00 |
|
Krish Dholakia
|
64ca2fde53
|
Merge branch 'main' into litellm_region_based_routing
|
2024-05-08 22:19:51 -07:00 |
|
Krrish Dholakia
|
7c99808529
|
feat(proxy_server.py): add CRUD endpoints for 'end_user' management
allow admin to specify region + default models for end users
|
2024-05-08 18:50:36 -07:00 |
|
Krrish Dholakia
|
5f93cae3ff
|
feat(proxy_server.py): return litellm version in response headers
|
2024-05-08 16:00:08 -07:00 |
|
Krrish Dholakia
|
e5c57cdddb
|
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
|
2024-05-07 12:58:49 -07:00 |
|
Krrish Dholakia
|
2170e1723d
|
feat(proxy_server.py): schedule slack daily report if enabled
if user enabled daily_reports, send them a slack report every 12 hours
|
2024-05-06 18:25:48 -07:00 |
|
Krrish Dholakia
|
0c1fc34e9f
|
feat(slack_alerting.py): support sending daily reports on deployments
allow admin to easily know slow + failing deployments
Closes https://github.com/BerriAI/litellm/issues/3483
|
2024-05-06 17:18:42 -07:00 |
|
Krrish Dholakia
|
4a22948ce4
|
fix(proxy/utils.py): fix slack alerting to only raise alerts for llm api exceptions
don't spam for bad user requests. Closes https://github.com/BerriAI/litellm/issues/3395
|
2024-05-02 17:18:21 -07:00 |
|
Krrish Dholakia
|
c1ffa7d409
|
fix(proxy/utils.py): fix retry logic for generic data request
|
2024-05-02 14:50:50 -07:00 |
|
Krrish Dholakia
|
e7b3ac8e06
|
feat(openmeter.py): add support for user billing
open-meter supports user based billing. Closes https://github.com/BerriAI/litellm/issues/1268
|
2024-05-01 17:23:48 -07:00 |
|
Krrish Dholakia
|
0b2c3f78d4
|
fix(proxy/utils.py): emit number of spend transactions for keys being written to db in a batch
|
2024-05-01 08:25:04 -07:00 |
|
Krrish Dholakia
|
462724feef
|
fix(proxy/utils.py): log rejected proxy requests to langfuse
|
2024-04-25 19:26:27 -07:00 |
|
Ishaan Jaff
|
2190b039fd
|
pass alert type on alerting handle
|
2024-04-25 13:05:34 -07:00 |
|
Ishaan Jaff
|
c5a0b3a8d4
|
fix - refactor slack alerting
|
2024-04-23 18:34:11 -07:00 |
|
Ishaan Jaff
|
a3109db4e1
|
fix langfuse project id
|
2024-04-23 11:37:17 -07:00 |
|
Ishaan Jaff
|
d076aed9d0
|
fix - dont send alert on fail request
|
2024-04-22 16:07:58 -07:00 |
|
Ishaan Jaff
|
8fb9c8d833
|
ui - find all teams
|
2024-04-22 14:15:09 -07:00 |
|
Ishaan Jaff
|
aa365c5c4a
|
feat - show langfuse trace in alerts
|
2024-04-22 08:51:46 -07:00 |
|
Ishaan Jaff
|
d4c9439cc0
|
fix - slack alerting show input in the api_base
|
2024-04-20 13:16:47 -07:00 |
|
Ishaan Jaff
|
828645137c
|
feat - log team_alias to langfuse
|
2024-04-19 10:29:42 -07:00 |
|
Ishaan Jaff
|
532e252559
|
fix - show api_base in hanging requests
|
2024-04-18 21:01:26 -07:00 |
|
Ishaan Jaff
|
ecc770be00
|
fix - show api base on hanging requests
|
2024-04-18 20:57:22 -07:00 |
|
Ishaan Jaff
|
977b030dd9
|
ui - show all alert types when getting all callbacks
|
2024-04-18 20:08:13 -07:00 |
|
Krish Dholakia
|
741a18a040
|
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
|
2024-04-18 19:10:58 -07:00 |
|
Ishaan Jaff
|
27333d17e2
|
fix order by spend
|
2024-04-18 17:33:38 -07:00 |
|
Ishaan Jaff
|
03b4652af1
|
fix return key aliases on /user/info
|
2024-04-18 17:16:52 -07:00 |
|
Krrish Dholakia
|
51cc8dd95b
|
fix(proxy/utils.py): add prometheus failed db request tracking
|
2024-04-18 16:30:29 -07:00 |
|
Krrish Dholakia
|
cdfd873713
|
fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
|
2024-04-18 16:24:36 -07:00 |
|
Ishaan Jaff
|
bb07c5fdc5
|
Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
|
2024-04-18 16:21:33 -07:00 |
|
Krrish Dholakia
|
7f5bcf38b7
|
feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
|
2024-04-18 16:04:35 -07:00 |
|
Ishaan Jaff
|
d6e3f587fe
|
fix trim messages to first 100 chars
|
2024-04-18 15:21:31 -07:00 |
|
Ishaan Jaff
|
d178916048
|
fix - test alerting
|
2024-04-18 11:40:40 -07:00 |
|
Ishaan Jaff
|
58eea0f330
|
feat return alert types on /config/get/callback
|
2024-04-17 21:02:10 -07:00 |
|
Ishaan Jaff
|
a97f8a40c1
|
fix - user based alerting
|
2024-04-17 20:35:29 -07:00 |
|
Ishaan Jaff
|
2e62b0059c
|
v0 add types of alerts to slack alerting
|
2024-04-17 18:16:19 -07:00 |
|
Ishaan Jaff
|
39488780e0
|
litellm_add_proxy_base_url in slack alerts
|
2024-04-17 17:42:28 -07:00 |
|
Krrish Dholakia
|
d75cfc5e32
|
fix(utils.py): return vertex api base for request hanging alerts
|
2024-04-16 17:53:28 -07:00 |
|
Krrish Dholakia
|
aa5da4346a
|
fix(proxy_server.py): support tracking org spend
currently works when org set for jwt auth
|
2024-04-11 23:01:21 -07:00 |
|
Krrish Dholakia
|
07798af50d
|
fix(proxy/utils.py): fix error message
|
2024-04-08 20:47:13 -07:00 |
|
Krrish Dholakia
|
da216c6915
|
fix(proxy_server.py): allow mapping a user to an org
|
2024-04-08 20:45:11 -07:00 |
|
Krrish Dholakia
|
0dad78b53c
|
feat(proxy/utils.py): return api base for request hanging alerts
|
2024-04-06 15:58:53 -07:00 |
|
Krrish Dholakia
|
ece37a4b7f
|
feat(ui): add models via ui
adds ability to add models via ui to the proxy. also fixes additional bugs around new /model/new endpoint
|
2024-04-04 18:56:20 -07:00 |
|
Krrish Dholakia
|
129bb52e9d
|
fix(proxy_server.py): persist models added via /model/new to db
allows models to be used across instances
https://github.com/BerriAI/litellm/issues/2319 , https://github.com/BerriAI/litellm/issues/2329
|
2024-04-03 20:16:41 -07:00 |
|
Krrish Dholakia
|
029ee15951
|
perf(proxy_server.py): batch write spend logs
reduces prisma client errors, by batch writing spend logs - max 1k logs at a time
|
2024-04-02 18:46:55 -07:00 |
|
Krrish Dholakia
|
e06d43dc90
|
fix(tpm_rpm_limiter.py): fix cache init logic
|
2024-04-01 18:01:38 -07:00 |
|