Ishaan Jaff
8874eaa0b3
fix - track litellm_status=fail
2024-04-22 16:11:04 -07:00
Ishaan Jaff
517f577292
fix - dont send alert on fail request
2024-04-22 16:07:58 -07:00
Ishaan Jaff
50bbd188fb
ui - show all teams on ui
2024-04-22 14:15:50 -07:00
Ishaan Jaff
cd3b2a21c1
ui - find all teams
2024-04-22 14:15:09 -07:00
Ishaan Jaff
877c4e27f4
Merge pull request #3212 from BerriAI/ui_increase_default_session_time
...
UI - increase default session time to 2 hours
2024-04-22 13:46:18 -07:00
Ishaan Jaff
1a945092de
Merge pull request #3210 from BerriAI/litellm_ui_round_up_team_spend_2_decimals
...
[UI] round up team spend to 2 decimals + diversify legend for team spend
2024-04-22 13:45:52 -07:00
Krrish Dholakia
a520e1bd6f
fix(router.py): add random shuffle and tpm-based shuffle for async shuffle logic
2024-04-22 12:58:59 -07:00
Ishaan Jaff
bb065f64c6
increase ui default session time to 2 hours
2024-04-22 10:00:53 -07:00
Ishaan Jaff
f54982a560
fix - round spend to 2 decimals
2024-04-22 09:17:40 -07:00
Ishaan Jaff
094583f18e
feat - show langfuse trace in alerts
2024-04-22 08:51:46 -07:00
Ishaan Jaff
1dbd4f6d7b
ui - new build
2024-04-22 07:23:22 -07:00
Ishaan Jaff
ced29f6b76
ui - new build
2024-04-20 19:32:40 -07:00
Krrish Dholakia
9379e3d047
fix(lowest_tpm_rpm_v2.py): use a combined tpm+rpm query in async get cache, to reduce redis client calls in high traffic
2024-04-20 16:13:11 -07:00
Krrish Dholakia
b96741e4f4
fix(router.py): async simple-shuffle support
2024-04-20 15:01:12 -07:00
Ishaan Jaff
ddc71d766a
fix - slack alerting show input in the api_base
2024-04-20 13:16:47 -07:00
Ishaan Jaff
f89f8a4157
Merge pull request #3184 from BerriAI/litellm_ui_non_admins_flow
...
[UI] - non admin flow - only Create + Test Key available
2024-04-20 12:40:43 -07:00
Ishaan Jaff
07a10247db
fix - security fix
2024-04-20 12:10:08 -07:00
Ishaan Jaff
5d39865362
fix - audio_transcriptions security fix
2024-04-20 11:58:15 -07:00
Ishaan Jaff
fd282ea932
fix testing fixes
2024-04-20 11:48:41 -07:00
Ishaan Jaff
7ebf2ca4d9
(ci/cd) testing with team_id and /user/new
2024-04-20 11:09:34 -07:00
Krrish Dholakia
e5b98814ad
test(test_proxy_server.py): fix test to use valid redis host
2024-04-19 19:22:24 -07:00
Ishaan Jaff
00a07a99cd
fix - backend logic for non admin flow
2024-04-19 17:36:29 -07:00
Krish Dholakia
cc7f27252b
Merge pull request #3179 from BerriAI/litellm_dual_caching_fix
...
fix(caching.py): dual cache async_batch_get_cache fix + testing
2024-04-19 17:02:47 -07:00
Ishaan Jaff
def5ce4888
fix user needs to opt in to langfuse tags
2024-04-19 16:13:14 -07:00
Ishaan Jaff
48e5de4f0e
Merge pull request #3175 from BerriAI/litellm_log_team_alias_langfuse
...
[FEAT] Log team alias to langfuse
2024-04-19 15:30:59 -07:00
Krrish Dholakia
01a1a8f731
fix(caching.py): dual cache async_batch_get_cache fix + testing
...
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Krrish Dholakia
3c6b6355c7
fix(ollama_chat.py): accept api key as a param for ollama calls
...
allows user to call hosted ollama endpoint using bearer token for auth
2024-04-19 13:02:13 -07:00
Ishaan Jaff
423121ff7d
feat - track team_alias is metadata for /chat, /embeddings
2024-04-19 10:52:54 -07:00
Ishaan Jaff
6d92b13c22
feat - log team_alias to langfuse
2024-04-19 10:29:42 -07:00
Ishaan Jaff
6f948cd559
fix - show api_base in hanging requests
2024-04-18 21:01:26 -07:00
Ishaan Jaff
f04604910b
fix - show api base on hanging requests
2024-04-18 20:57:22 -07:00
Ishaan Jaff
e95e7fde65
ui - new build
2024-04-18 20:18:50 -07:00
Ishaan Jaff
554c83fdaf
ui - show all alert types when getting all callbacks
2024-04-18 20:08:13 -07:00
Krish Dholakia
77a353d484
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
...
feat(prometheus_services.py): emit proxy latency for successful llm api requests
2024-04-18 19:10:58 -07:00
Ishaan Jaff
d9091dcf97
fix order by spend
2024-04-18 17:33:38 -07:00
Ishaan Jaff
b669e2987b
fix return key aliases on /user/info
2024-04-18 17:16:52 -07:00
Krrish Dholakia
919a2876f1
fix(proxy/utils.py): add prometheus failed db request tracking
2024-04-18 16:30:29 -07:00
Krrish Dholakia
d61250109e
fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
...
this is for tracking failed db requests on prometheus
2024-04-18 16:24:36 -07:00
Ishaan Jaff
eb04a929e6
Merge pull request #3112 from BerriAI/litellm_add_alert_types
...
[Feat] Allow user to select slack alert types to Opt In to
2024-04-18 16:21:33 -07:00
Ishaan Jaff
8958bbeac9
Merge pull request #3142 from BerriAI/litellm_slack_alerting_show_model_passed
...
[Fix] Show `model` passed on `"400: {'error': 'Invalid model name passed in mode` errors 👻
2024-04-18 16:18:17 -07:00
Krrish Dholakia
0f95a824c4
feat(prometheus_services.py): emit proxy latency for successful llm api requests
...
uses prometheus histogram for this
2024-04-18 16:04:35 -07:00
Ishaan Jaff
6cfa80c42a
Merge pull request #3140 from BerriAI/litellm_trim_messages
...
[Fix] Slack Alerting - trim messages to first 100 chars
2024-04-18 16:01:39 -07:00
Ishaan Jaff
b308f8c079
fix - show model passed in on Invalid model name passed in error
2024-04-18 15:43:30 -07:00
Ishaan Jaff
e20b05d6dd
fix trim messages to first 100 chars
2024-04-18 15:21:31 -07:00
Krrish Dholakia
28edb77350
fix(utils.py): support prometheus failed call metrics
2024-04-18 12:29:15 -07:00
Ishaan Jaff
1cda0db2ca
fix - test alerting
2024-04-18 11:40:40 -07:00
Ishaan Jaff
5e0dc57329
ui - set alert_types
2024-04-18 11:13:05 -07:00
Ishaan Jaff
a8e2ef79ef
add alert_types to config.yaml
2024-04-18 08:51:07 -07:00
Krish Dholakia
91fe668411
Merge pull request #3105 from BerriAI/litellm_fix_hashing
...
fix(_types.py): hash api key in UserAPIKeyAuth
2024-04-18 08:16:24 -07:00
Ishaan Jaff
beeee01199
feat return alert types on /config/get/callback
2024-04-17 21:02:10 -07:00