Ishaan Jaff
9017d9bb81
backend allow filtering by model_group
2024-04-23 22:03:20 -07:00
Krrish Dholakia
9e3514fd24
fix(langsmith.py): fix streaming langsmith calls
2024-04-23 20:47:19 -07:00
Ishaan Jaff
ac6809e9df
ui - filter by time and deployments
2024-04-23 20:42:15 -07:00
Krish Dholakia
4acdde988f
Merge pull request #3250 from BerriAI/litellm_caching_no_cache_fix
...
fix(utils.py): fix 'no-cache': true when caching is turned on
2024-04-23 19:57:07 -07:00
Ishaan Jaff
3b860a7f20
Merge pull request #3255 from BerriAI/litellm_refactor_slack_alerting
...
[Fix] Refactor slack alerting, use one httpx client
2024-04-23 19:45:52 -07:00
Krrish Dholakia
df60e475e8
build(requirements.txt): pin all dependency versions
2024-04-23 19:18:19 -07:00
Ishaan Jaff
9d18e4770d
fix using slack alerting through admin ui
2024-04-23 19:05:50 -07:00
Ishaan Jaff
31979f83d2
fix - refactor slack alerting
2024-04-23 18:34:11 -07:00
Krrish Dholakia
161e836427
fix(utils.py): fix 'no-cache': true when caching is turned on
2024-04-23 12:58:30 -07:00
Ishaan Jaff
e15ce79833
Merge pull request #3244 from BerriAI/litellm_fix_linking_langfuse_slack_alerts
...
[Fix] Linking Langfuse Projects to Slack Alerts
2024-04-23 12:53:18 -07:00
Ishaan Jaff
5118c96b4d
fix langfuse project id
2024-04-23 11:37:17 -07:00
Krrish Dholakia
9d2726c2ac
fix(proxy_server.py): handle router being initialized without a model list
2024-04-23 10:52:28 -07:00
Ishaan Jaff
7e9587c102
ui - new build
2024-04-22 18:16:54 -07:00
Ishaan Jaff
fdf432798e
Merge pull request #3228 from BerriAI/litellm_ui_polish
...
[Fix] Non-Admin SSO Login
2024-04-22 18:15:10 -07:00
Ishaan Jaff
9250f61a4c
fix - sso login for non admins
2024-04-22 17:57:47 -07:00
Ishaan Jaff
8874eaa0b3
fix - track litellm_status=fail
2024-04-22 16:11:04 -07:00
Ishaan Jaff
517f577292
fix - dont send alert on fail request
2024-04-22 16:07:58 -07:00
Ishaan Jaff
50bbd188fb
ui - show all teams on ui
2024-04-22 14:15:50 -07:00
Ishaan Jaff
cd3b2a21c1
ui - find all teams
2024-04-22 14:15:09 -07:00
Ishaan Jaff
877c4e27f4
Merge pull request #3212 from BerriAI/ui_increase_default_session_time
...
UI - increase default session time to 2 hours
2024-04-22 13:46:18 -07:00
Ishaan Jaff
1a945092de
Merge pull request #3210 from BerriAI/litellm_ui_round_up_team_spend_2_decimals
...
[UI] round up team spend to 2 decimals + diversify legend for team spend
2024-04-22 13:45:52 -07:00
Krrish Dholakia
a520e1bd6f
fix(router.py): add random shuffle and tpm-based shuffle for async shuffle logic
2024-04-22 12:58:59 -07:00
Ishaan Jaff
bb065f64c6
increase ui default session time to 2 hours
2024-04-22 10:00:53 -07:00
Ishaan Jaff
f54982a560
fix - round spend to 2 decimals
2024-04-22 09:17:40 -07:00
Ishaan Jaff
094583f18e
feat - show langfuse trace in alerts
2024-04-22 08:51:46 -07:00
Ishaan Jaff
1dbd4f6d7b
ui - new build
2024-04-22 07:23:22 -07:00
Ishaan Jaff
ced29f6b76
ui - new build
2024-04-20 19:32:40 -07:00
Krrish Dholakia
9379e3d047
fix(lowest_tpm_rpm_v2.py): use a combined tpm+rpm query in async get cache, to reduce redis client calls in high traffic
2024-04-20 16:13:11 -07:00
Krrish Dholakia
b96741e4f4
fix(router.py): async simple-shuffle support
2024-04-20 15:01:12 -07:00
Ishaan Jaff
ddc71d766a
fix - slack alerting show input in the api_base
2024-04-20 13:16:47 -07:00
Ishaan Jaff
f89f8a4157
Merge pull request #3184 from BerriAI/litellm_ui_non_admins_flow
...
[UI] - non admin flow - only Create + Test Key available
2024-04-20 12:40:43 -07:00
Ishaan Jaff
07a10247db
fix - security fix
2024-04-20 12:10:08 -07:00
Ishaan Jaff
5d39865362
fix - audio_transcriptions security fix
2024-04-20 11:58:15 -07:00
Ishaan Jaff
fd282ea932
fix testing fixes
2024-04-20 11:48:41 -07:00
Ishaan Jaff
7ebf2ca4d9
(ci/cd) testing with team_id and /user/new
2024-04-20 11:09:34 -07:00
Krrish Dholakia
e5b98814ad
test(test_proxy_server.py): fix test to use valid redis host
2024-04-19 19:22:24 -07:00
Ishaan Jaff
00a07a99cd
fix - backend logic for non admin flow
2024-04-19 17:36:29 -07:00
Krish Dholakia
cc7f27252b
Merge pull request #3179 from BerriAI/litellm_dual_caching_fix
...
fix(caching.py): dual cache async_batch_get_cache fix + testing
2024-04-19 17:02:47 -07:00
Ishaan Jaff
def5ce4888
fix user needs to opt in to langfuse tags
2024-04-19 16:13:14 -07:00
Ishaan Jaff
48e5de4f0e
Merge pull request #3175 from BerriAI/litellm_log_team_alias_langfuse
...
[FEAT] Log team alias to langfuse
2024-04-19 15:30:59 -07:00
Krrish Dholakia
01a1a8f731
fix(caching.py): dual cache async_batch_get_cache fix + testing
...
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Krrish Dholakia
3c6b6355c7
fix(ollama_chat.py): accept api key as a param for ollama calls
...
allows user to call hosted ollama endpoint using bearer token for auth
2024-04-19 13:02:13 -07:00
Ishaan Jaff
423121ff7d
feat - track team_alias is metadata for /chat, /embeddings
2024-04-19 10:52:54 -07:00
Ishaan Jaff
6d92b13c22
feat - log team_alias to langfuse
2024-04-19 10:29:42 -07:00
Ishaan Jaff
6f948cd559
fix - show api_base in hanging requests
2024-04-18 21:01:26 -07:00
Ishaan Jaff
f04604910b
fix - show api base on hanging requests
2024-04-18 20:57:22 -07:00
Ishaan Jaff
e95e7fde65
ui - new build
2024-04-18 20:18:50 -07:00
Ishaan Jaff
554c83fdaf
ui - show all alert types when getting all callbacks
2024-04-18 20:08:13 -07:00
Krish Dholakia
77a353d484
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
...
feat(prometheus_services.py): emit proxy latency for successful llm api requests
2024-04-18 19:10:58 -07:00
Ishaan Jaff
d9091dcf97
fix order by spend
2024-04-18 17:33:38 -07:00