Krish Dholakia
|
8ad979cdfe
|
Merge branch 'main' into litellm_region_based_routing
|
2024-05-08 22:19:51 -07:00 |
|
Krrish Dholakia
|
db666b01e5
|
feat(proxy_server.py): add CRUD endpoints for 'end_user' management
allow admin to specify region + default models for end users
|
2024-05-08 18:50:36 -07:00 |
|
Krrish Dholakia
|
6575143460
|
feat(proxy_server.py): return litellm version in response headers
|
2024-05-08 16:00:08 -07:00 |
|
Krrish Dholakia
|
872470ff1f
|
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
|
2024-05-07 12:58:49 -07:00 |
|
Krrish Dholakia
|
6b9b4f05ba
|
feat(proxy_server.py): schedule slack daily report if enabled
if user enabled daily_reports, send them a slack report every 12 hours
|
2024-05-06 18:25:48 -07:00 |
|
Krrish Dholakia
|
718f423d7d
|
feat(slack_alerting.py): support sending daily reports on deployments
allow admin to easily know slow + failing deployments
Closes https://github.com/BerriAI/litellm/issues/3483
|
2024-05-06 17:18:42 -07:00 |
|
Krrish Dholakia
|
fdc4fdb91a
|
fix(proxy/utils.py): fix slack alerting to only raise alerts for llm api exceptions
don't spam for bad user requests. Closes https://github.com/BerriAI/litellm/issues/3395
|
2024-05-02 17:18:21 -07:00 |
|
Krrish Dholakia
|
acda064be6
|
fix(proxy/utils.py): fix retry logic for generic data request
|
2024-05-02 14:50:50 -07:00 |
|
Krrish Dholakia
|
2a9651b3ca
|
feat(openmeter.py): add support for user billing
open-meter supports user based billing. Closes https://github.com/BerriAI/litellm/issues/1268
|
2024-05-01 17:23:48 -07:00 |
|
Krrish Dholakia
|
d0f9f8c0ed
|
fix(proxy/utils.py): emit number of spend transactions for keys being written to db in a batch
|
2024-05-01 08:25:04 -07:00 |
|
Krrish Dholakia
|
885de2e3c6
|
fix(proxy/utils.py): log rejected proxy requests to langfuse
|
2024-04-25 19:26:27 -07:00 |
|
Ishaan Jaff
|
1d5e70f7a0
|
pass alert type on alerting handle
|
2024-04-25 13:05:34 -07:00 |
|
Ishaan Jaff
|
31979f83d2
|
fix - refactor slack alerting
|
2024-04-23 18:34:11 -07:00 |
|
Ishaan Jaff
|
5118c96b4d
|
fix langfuse project id
|
2024-04-23 11:37:17 -07:00 |
|
Ishaan Jaff
|
517f577292
|
fix - dont send alert on fail request
|
2024-04-22 16:07:58 -07:00 |
|
Ishaan Jaff
|
cd3b2a21c1
|
ui - find all teams
|
2024-04-22 14:15:09 -07:00 |
|
Ishaan Jaff
|
094583f18e
|
feat - show langfuse trace in alerts
|
2024-04-22 08:51:46 -07:00 |
|
Ishaan Jaff
|
ddc71d766a
|
fix - slack alerting show input in the api_base
|
2024-04-20 13:16:47 -07:00 |
|
Ishaan Jaff
|
6d92b13c22
|
feat - log team_alias to langfuse
|
2024-04-19 10:29:42 -07:00 |
|
Ishaan Jaff
|
6f948cd559
|
fix - show api_base in hanging requests
|
2024-04-18 21:01:26 -07:00 |
|
Ishaan Jaff
|
f04604910b
|
fix - show api base on hanging requests
|
2024-04-18 20:57:22 -07:00 |
|
Ishaan Jaff
|
554c83fdaf
|
ui - show all alert types when getting all callbacks
|
2024-04-18 20:08:13 -07:00 |
|
Krish Dholakia
|
77a353d484
|
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
|
2024-04-18 19:10:58 -07:00 |
|
Ishaan Jaff
|
d9091dcf97
|
fix order by spend
|
2024-04-18 17:33:38 -07:00 |
|
Ishaan Jaff
|
b669e2987b
|
fix return key aliases on /user/info
|
2024-04-18 17:16:52 -07:00 |
|
Krrish Dholakia
|
919a2876f1
|
fix(proxy/utils.py): add prometheus failed db request tracking
|
2024-04-18 16:30:29 -07:00 |
|
Krrish Dholakia
|
d61250109e
|
fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
|
2024-04-18 16:24:36 -07:00 |
|
Ishaan Jaff
|
eb04a929e6
|
Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
|
2024-04-18 16:21:33 -07:00 |
|
Krrish Dholakia
|
0f95a824c4
|
feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
|
2024-04-18 16:04:35 -07:00 |
|
Ishaan Jaff
|
e20b05d6dd
|
fix trim messages to first 100 chars
|
2024-04-18 15:21:31 -07:00 |
|
Ishaan Jaff
|
1cda0db2ca
|
fix - test alerting
|
2024-04-18 11:40:40 -07:00 |
|
Ishaan Jaff
|
beeee01199
|
feat return alert types on /config/get/callback
|
2024-04-17 21:02:10 -07:00 |
|
Ishaan Jaff
|
9a5fd07f16
|
fix - user based alerting
|
2024-04-17 20:35:29 -07:00 |
|
Ishaan Jaff
|
52d7fc22bb
|
v0 add types of alerts to slack alerting
|
2024-04-17 18:16:19 -07:00 |
|
Ishaan Jaff
|
12a01ba096
|
litellm_add_proxy_base_url in slack alerts
|
2024-04-17 17:42:28 -07:00 |
|
Krrish Dholakia
|
f4b595ce71
|
fix(utils.py): return vertex api base for request hanging alerts
|
2024-04-16 17:53:28 -07:00 |
|
Krrish Dholakia
|
f4c7f4f901
|
fix(proxy_server.py): support tracking org spend
currently works when org set for jwt auth
|
2024-04-11 23:01:21 -07:00 |
|
Krrish Dholakia
|
470b7b64c9
|
fix(proxy/utils.py): fix error message
|
2024-04-08 20:47:13 -07:00 |
|
Krrish Dholakia
|
6c1444bfaa
|
fix(proxy_server.py): allow mapping a user to an org
|
2024-04-08 20:45:11 -07:00 |
|
Krrish Dholakia
|
6110d32b1c
|
feat(proxy/utils.py): return api base for request hanging alerts
|
2024-04-06 15:58:53 -07:00 |
|
Krrish Dholakia
|
e3c2bdef4d
|
feat(ui): add models via ui
adds ability to add models via ui to the proxy. also fixes additional bugs around new /model/new endpoint
|
2024-04-04 18:56:20 -07:00 |
|
Krrish Dholakia
|
f536fb13e6
|
fix(proxy_server.py): persist models added via /model/new to db
allows models to be used across instances
https://github.com/BerriAI/litellm/issues/2319 , https://github.com/BerriAI/litellm/issues/2329
|
2024-04-03 20:16:41 -07:00 |
|
Krrish Dholakia
|
d7601a4844
|
perf(proxy_server.py): batch write spend logs
reduces prisma client errors, by batch writing spend logs - max 1k logs at a time
|
2024-04-02 18:46:55 -07:00 |
|
Krrish Dholakia
|
6467dd4e11
|
fix(tpm_rpm_limiter.py): fix cache init logic
|
2024-04-01 18:01:38 -07:00 |
|
Krrish Dholakia
|
9c0aecf9b8
|
fix(proxy/utils.py): support redis caching for alerting
|
2024-04-01 16:13:59 -07:00 |
|
Krrish Dholakia
|
3b8e7241b4
|
fix(proxy/utils.py): uncomment max parallel request limit check
|
2024-03-30 20:51:59 -07:00 |
|
Krrish Dholakia
|
d9ff13b624
|
fix(utils.py): set redis_usage_cache to none by default
|
2024-03-30 20:10:56 -07:00 |
|
Krrish Dholakia
|
f58fefd589
|
fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
allows tpm/rpm checks to work across instances
https://github.com/BerriAI/litellm/issues/2730
|
2024-03-30 20:01:36 -07:00 |
|
Krrish Dholakia
|
5280fc809f
|
fix(proxy_server.py): enforce end user budgets with 'litellm.max_end_user_budget' param
|
2024-03-29 17:14:40 -07:00 |
|
Krrish Dholakia
|
c15ba368e7
|
fix(proxy_server.py): enable spend tracking for team-based jwt auth
|
2024-03-28 20:16:22 -07:00 |
|