Ishaan Jaff
|
c5a0b3a8d4
|
fix - refactor slack alerting
|
2024-04-23 18:34:11 -07:00 |
|
Ishaan Jaff
|
a3109db4e1
|
fix langfuse project id
|
2024-04-23 11:37:17 -07:00 |
|
Ishaan Jaff
|
d076aed9d0
|
fix - dont send alert on fail request
|
2024-04-22 16:07:58 -07:00 |
|
Ishaan Jaff
|
8fb9c8d833
|
ui - find all teams
|
2024-04-22 14:15:09 -07:00 |
|
Ishaan Jaff
|
aa365c5c4a
|
feat - show langfuse trace in alerts
|
2024-04-22 08:51:46 -07:00 |
|
Ishaan Jaff
|
d4c9439cc0
|
fix - slack alerting show input in the api_base
|
2024-04-20 13:16:47 -07:00 |
|
Ishaan Jaff
|
828645137c
|
feat - log team_alias to langfuse
|
2024-04-19 10:29:42 -07:00 |
|
Ishaan Jaff
|
532e252559
|
fix - show api_base in hanging requests
|
2024-04-18 21:01:26 -07:00 |
|
Ishaan Jaff
|
ecc770be00
|
fix - show api base on hanging requests
|
2024-04-18 20:57:22 -07:00 |
|
Ishaan Jaff
|
977b030dd9
|
ui - show all alert types when getting all callbacks
|
2024-04-18 20:08:13 -07:00 |
|
Krish Dholakia
|
741a18a040
|
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
|
2024-04-18 19:10:58 -07:00 |
|
Ishaan Jaff
|
27333d17e2
|
fix order by spend
|
2024-04-18 17:33:38 -07:00 |
|
Ishaan Jaff
|
03b4652af1
|
fix return key aliases on /user/info
|
2024-04-18 17:16:52 -07:00 |
|
Krrish Dholakia
|
51cc8dd95b
|
fix(proxy/utils.py): add prometheus failed db request tracking
|
2024-04-18 16:30:29 -07:00 |
|
Krrish Dholakia
|
cdfd873713
|
fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
|
2024-04-18 16:24:36 -07:00 |
|
Ishaan Jaff
|
bb07c5fdc5
|
Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
|
2024-04-18 16:21:33 -07:00 |
|
Krrish Dholakia
|
7f5bcf38b7
|
feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
|
2024-04-18 16:04:35 -07:00 |
|
Ishaan Jaff
|
d6e3f587fe
|
fix trim messages to first 100 chars
|
2024-04-18 15:21:31 -07:00 |
|
Ishaan Jaff
|
d178916048
|
fix - test alerting
|
2024-04-18 11:40:40 -07:00 |
|
Ishaan Jaff
|
58eea0f330
|
feat return alert types on /config/get/callback
|
2024-04-17 21:02:10 -07:00 |
|
Ishaan Jaff
|
a97f8a40c1
|
fix - user based alerting
|
2024-04-17 20:35:29 -07:00 |
|
Ishaan Jaff
|
2e62b0059c
|
v0 add types of alerts to slack alerting
|
2024-04-17 18:16:19 -07:00 |
|
Ishaan Jaff
|
39488780e0
|
litellm_add_proxy_base_url in slack alerts
|
2024-04-17 17:42:28 -07:00 |
|
Krrish Dholakia
|
d75cfc5e32
|
fix(utils.py): return vertex api base for request hanging alerts
|
2024-04-16 17:53:28 -07:00 |
|
Krrish Dholakia
|
aa5da4346a
|
fix(proxy_server.py): support tracking org spend
currently works when org set for jwt auth
|
2024-04-11 23:01:21 -07:00 |
|
Krrish Dholakia
|
07798af50d
|
fix(proxy/utils.py): fix error message
|
2024-04-08 20:47:13 -07:00 |
|
Krrish Dholakia
|
da216c6915
|
fix(proxy_server.py): allow mapping a user to an org
|
2024-04-08 20:45:11 -07:00 |
|
Krrish Dholakia
|
0dad78b53c
|
feat(proxy/utils.py): return api base for request hanging alerts
|
2024-04-06 15:58:53 -07:00 |
|
Krrish Dholakia
|
ece37a4b7f
|
feat(ui): add models via ui
adds ability to add models via ui to the proxy. also fixes additional bugs around new /model/new endpoint
|
2024-04-04 18:56:20 -07:00 |
|
Krrish Dholakia
|
129bb52e9d
|
fix(proxy_server.py): persist models added via /model/new to db
allows models to be used across instances
https://github.com/BerriAI/litellm/issues/2319 , https://github.com/BerriAI/litellm/issues/2329
|
2024-04-03 20:16:41 -07:00 |
|
Krrish Dholakia
|
029ee15951
|
perf(proxy_server.py): batch write spend logs
reduces prisma client errors, by batch writing spend logs - max 1k logs at a time
|
2024-04-02 18:46:55 -07:00 |
|
Krrish Dholakia
|
e06d43dc90
|
fix(tpm_rpm_limiter.py): fix cache init logic
|
2024-04-01 18:01:38 -07:00 |
|
Krrish Dholakia
|
8d35e659ad
|
fix(proxy/utils.py): support redis caching for alerting
|
2024-04-01 16:13:59 -07:00 |
|
Krrish Dholakia
|
60b9e25e3c
|
fix(proxy/utils.py): uncomment max parallel request limit check
|
2024-03-30 20:51:59 -07:00 |
|
Krrish Dholakia
|
7738107d49
|
fix(utils.py): set redis_usage_cache to none by default
|
2024-03-30 20:10:56 -07:00 |
|
Krrish Dholakia
|
555f0af027
|
fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
allows tpm/rpm checks to work across instances
https://github.com/BerriAI/litellm/issues/2730
|
2024-03-30 20:01:36 -07:00 |
|
Krrish Dholakia
|
49e2624240
|
fix(proxy_server.py): enforce end user budgets with 'litellm.max_end_user_budget' param
|
2024-03-29 17:14:40 -07:00 |
|
Krrish Dholakia
|
6848e3b1d2
|
fix(proxy_server.py): enable spend tracking for team-based jwt auth
|
2024-03-28 20:16:22 -07:00 |
|
Krrish Dholakia
|
473bab8a19
|
refactor(proxy/utils.py): add more debug logs
|
2024-03-28 18:44:35 -07:00 |
|
Krish Dholakia
|
b828290c81
|
Merge pull request #2722 from BerriAI/litellm_db_perf_improvement
feat(proxy/utils.py): enable updating db in a separate server
|
2024-03-28 14:56:14 -07:00 |
|
Krrish Dholakia
|
eca5d04126
|
test(test_update_spend.py): allow db_client to be none
|
2024-03-28 13:44:40 -07:00 |
|
Krrish Dholakia
|
e87c5f5d6f
|
fix(proxy_server.py): allow user to pass in spend logs collector url
|
2024-03-28 09:14:30 -07:00 |
|
Ishaan Jaff
|
c96e1af901
|
Merge pull request #2728 from BerriAI/litellm_reduce_deep_copies
[FEAT] Proxy - reduce deep copies
|
2024-03-27 21:26:09 -07:00 |
|
Ishaan Jaff
|
7c32955f64
|
(fix) remove deep copy from all responses
|
2024-03-27 20:36:53 -07:00 |
|
Krrish Dholakia
|
7fe02405e0
|
fix(proxy/utils.py): check cache before alerting user
|
2024-03-27 20:09:15 -07:00 |
|
Krrish Dholakia
|
0417ce6cbe
|
feat(auth_checks.py): enable admin to enforce 'user' param for all openai endpoints
|
2024-03-27 17:36:27 -07:00 |
|
Krrish Dholakia
|
46937935d1
|
feat(proxy/utils.py): enable updating db in a separate server
|
2024-03-27 16:02:36 -07:00 |
|
Krrish Dholakia
|
7bc76ddbc3
|
feat(llm_guard.py): enable key-specific llm guard check
|
2024-03-26 17:21:51 -07:00 |
|
Ishaan Jaff
|
f0992c2dbd
|
(fix) stop using f strings with logger
|
2024-03-25 10:47:18 -07:00 |
|
Ishaan Jaff
|
2c01457a4b
|
(feat) stop eagerly evaluating fstring
|
2024-03-25 09:01:42 -07:00 |
|