Commit graph

2000 commits

Author SHA1 Message Date
Ishaan Jaff
ddc71d766a fix - slack alerting show input in the api_base 2024-04-20 13:16:47 -07:00
Ishaan Jaff
f89f8a4157
Merge pull request #3184 from BerriAI/litellm_ui_non_admins_flow
[UI] - non admin flow - only Create + Test Key available
2024-04-20 12:40:43 -07:00
Ishaan Jaff
07a10247db fix - security fix 2024-04-20 12:10:08 -07:00
Ishaan Jaff
5d39865362 fix - audio_transcriptions security fix 2024-04-20 11:58:15 -07:00
Ishaan Jaff
fd282ea932 fix testing fixes 2024-04-20 11:48:41 -07:00
Ishaan Jaff
7ebf2ca4d9 (ci/cd) testing with team_id and /user/new 2024-04-20 11:09:34 -07:00
Krrish Dholakia
e5b98814ad test(test_proxy_server.py): fix test to use valid redis host 2024-04-19 19:22:24 -07:00
Ishaan Jaff
00a07a99cd fix - backend logic for non admin flow 2024-04-19 17:36:29 -07:00
Krish Dholakia
cc7f27252b
Merge pull request #3179 from BerriAI/litellm_dual_caching_fix
fix(caching.py): dual cache async_batch_get_cache fix + testing
2024-04-19 17:02:47 -07:00
Ishaan Jaff
def5ce4888 fix user needs to opt in to langfuse tags 2024-04-19 16:13:14 -07:00
Ishaan Jaff
48e5de4f0e
Merge pull request #3175 from BerriAI/litellm_log_team_alias_langfuse
[FEAT] Log team alias to langfuse
2024-04-19 15:30:59 -07:00
Krrish Dholakia
01a1a8f731 fix(caching.py): dual cache async_batch_get_cache fix + testing
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Krrish Dholakia
3c6b6355c7 fix(ollama_chat.py): accept api key as a param for ollama calls
allows user to call hosted ollama endpoint using bearer token for auth
2024-04-19 13:02:13 -07:00
Ishaan Jaff
423121ff7d feat - track team_alias is metadata for /chat, /embeddings 2024-04-19 10:52:54 -07:00
Ishaan Jaff
6d92b13c22 feat - log team_alias to langfuse 2024-04-19 10:29:42 -07:00
Ishaan Jaff
6f948cd559 fix - show api_base in hanging requests 2024-04-18 21:01:26 -07:00
Ishaan Jaff
f04604910b fix - show api base on hanging requests 2024-04-18 20:57:22 -07:00
Ishaan Jaff
e95e7fde65 ui - new build 2024-04-18 20:18:50 -07:00
Ishaan Jaff
554c83fdaf ui - show all alert types when getting all callbacks 2024-04-18 20:08:13 -07:00
Krish Dholakia
77a353d484
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
2024-04-18 19:10:58 -07:00
Ishaan Jaff
d9091dcf97 fix order by spend 2024-04-18 17:33:38 -07:00
Ishaan Jaff
b669e2987b fix return key aliases on /user/info 2024-04-18 17:16:52 -07:00
Krrish Dholakia
919a2876f1 fix(proxy/utils.py): add prometheus failed db request tracking 2024-04-18 16:30:29 -07:00
Krrish Dholakia
d61250109e fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
2024-04-18 16:24:36 -07:00
Ishaan Jaff
eb04a929e6
Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
2024-04-18 16:21:33 -07:00
Ishaan Jaff
8958bbeac9
Merge pull request #3142 from BerriAI/litellm_slack_alerting_show_model_passed
[Fix] Show `model` passed on `"400: {'error': 'Invalid model name passed in mode` errors 👻
2024-04-18 16:18:17 -07:00
Krrish Dholakia
0f95a824c4 feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
2024-04-18 16:04:35 -07:00
Ishaan Jaff
6cfa80c42a
Merge pull request #3140 from BerriAI/litellm_trim_messages
[Fix] Slack Alerting -  trim messages to first 100 chars
2024-04-18 16:01:39 -07:00
Ishaan Jaff
b308f8c079 fix - show model passed in on Invalid model name passed in error 2024-04-18 15:43:30 -07:00
Ishaan Jaff
e20b05d6dd fix trim messages to first 100 chars 2024-04-18 15:21:31 -07:00
Krrish Dholakia
28edb77350 fix(utils.py): support prometheus failed call metrics 2024-04-18 12:29:15 -07:00
Ishaan Jaff
1cda0db2ca fix - test alerting 2024-04-18 11:40:40 -07:00
Ishaan Jaff
5e0dc57329 ui - set alert_types 2024-04-18 11:13:05 -07:00
Ishaan Jaff
a8e2ef79ef add alert_types to config.yaml 2024-04-18 08:51:07 -07:00
Krish Dholakia
91fe668411
Merge pull request #3105 from BerriAI/litellm_fix_hashing
fix(_types.py): hash api key in UserAPIKeyAuth
2024-04-18 08:16:24 -07:00
Ishaan Jaff
beeee01199 feat return alert types on /config/get/callback 2024-04-17 21:02:10 -07:00
Ishaan Jaff
3e8e0beb25
Merge pull request #3114 from BerriAI/litellm_fix_alerting_users
[FIX} Repeat Slack Alerts triggered for "User Crossed Budget"
2024-04-17 20:38:27 -07:00
Ishaan Jaff
9a5fd07f16 fix - user based alerting 2024-04-17 20:35:29 -07:00
Krrish Dholakia
18e3cf8bff fix(utils.py): support azure mistral function calling 2024-04-17 19:10:26 -07:00
Ishaan Jaff
c3dcbf2067 ui - new build 2024-04-17 18:22:23 -07:00
Ishaan Jaff
cd8d3d4788 ui - new build 2024-04-17 18:20:45 -07:00
Ishaan Jaff
7178da3144 bump: version 1.35.10 → 1.35.11 2024-04-17 18:20:31 -07:00
Ishaan Jaff
52d7fc22bb v0 add types of alerts to slack alerting 2024-04-17 18:16:19 -07:00
Ishaan Jaff
12a01ba096 litellm_add_proxy_base_url in slack alerts 2024-04-17 17:42:28 -07:00
Krrish Dholakia
bafb008b44 fix(_types.py): hash api key, if given
fixes https://github.com/BerriAI/litellm/issues/2114
2024-04-17 16:43:35 -07:00
Ishaan Jaff
336de9eeeb ui - dont include model/metrics in schema 2024-04-17 15:30:06 -07:00
Krrish Dholakia
2ffd057042 test(test_models.py): ensure only admin can call /health 2024-04-16 18:13:40 -07:00
Krrish Dholakia
dd0fddb2e1 build(ui): add ui updates 2024-04-16 17:56:35 -07:00
Krrish Dholakia
f4b595ce71 fix(utils.py): return vertex api base for request hanging alerts 2024-04-16 17:53:28 -07:00
Krish Dholakia
1cdf733341
Merge pull request #3081 from BerriAI/litellm_ui_fixes_2
build(ui): view_key_table.tsx
2024-04-16 17:19:41 -07:00