Commit graph

776 commits

Author SHA1 Message Date
Ishaan Jaff
8a46b50724 Merge pull request #1933 from BerriAI/litellm_team_specific_models
[FEAT] Proxy - set team specific models
2024-02-10 21:40:13 -08:00
ishaan-jaff
2b683f18e8 (feat)team-specific models 2024-02-10 21:13:39 -08:00
Krrish Dholakia
e1156c99d9 fix(proxy_server.py): support list as well as direct input for custom callbacks 2024-02-10 20:38:09 -08:00
Krrish Dholakia
75d474819e fix(proxy_server.py): fix import 2024-02-10 20:31:54 -08:00
Krrish Dholakia
c1a60176a5 feat(proxy_server.py): support for pii masking with microsoft presidio 2024-02-10 20:21:12 -08:00
ishaan-jaff
8316820112 (fix) undo today's mistake where bedrock was async 2024-02-10 18:32:29 -08:00
Krrish Dholakia
ba06132d56 fix(proxy_server.py): don't change old /key/info endpoint 2024-02-10 17:31:23 -08:00
Krrish Dholakia
8d9ed83389 feat(ui): show key alias instead of hashed token 2024-02-10 15:45:42 -08:00
Krrish Dholakia
31e8fbef54 feat(proxy_server.py): support key lists for /key/info 2024-02-10 14:24:12 -08:00
ishaan-jaff
5fcdc95279 (fix) proxy - use bedrock in sync streaming 2024-02-10 09:47:33 -08:00
Krrish Dholakia
b1ae8e40c5 fix(ui): export ui npm build 2024-02-10 08:43:41 -08:00
Krish Dholakia
8d51888d15 Merge branch 'main' into litellm_ui_improvements_time_series 2024-02-09 20:54:36 -08:00
Krrish Dholakia
8950ae2240 fix(usage.tsx): show top users and top keys driving spend 2024-02-09 19:50:07 -08:00
Krrish Dholakia
610d039b18 fix(proxy_server.py): fix /spend/logs endpoint 2024-02-09 18:11:33 -08:00
ishaan-jaff
da4441b1fc (feat) support azure deployments for embeddings 2024-02-09 16:47:01 -08:00
Krrish Dholakia
ecbcb45d7a feat(proxy_server.py): show admin global spend as time series data 2024-02-09 16:31:35 -08:00
Krrish Dholakia
11db12c023 fix: fixes 2024-02-08 21:54:48 -08:00
Krrish Dholakia
e4c8a6684e fix(proxy_server.py): enable aggregate queries via /spend/keys 2024-02-08 20:29:08 -08:00
ishaan-jaff
eb95a1c1ef (fix) ui /login without sso 2024-02-08 10:47:45 -08:00
ishaan-jaff
d55f75cc46 (feat) slack alerting when track callback fails 2024-02-07 20:09:28 -08:00
ishaan-jaff
e8532d4285 (feat) alert for failing cost tracking 2024-02-07 18:49:45 -08:00
Krrish Dholakia
5b8aa14f9b fix(proxy_server.py): check if prisma client is set before scheduling reset budget 2024-02-07 18:14:37 -08:00
Krrish Dholakia
605cfecf27 fix(proxy_server.py): fix merge errors 2024-02-07 00:04:52 -08:00
Krrish Dholakia
cf588a97fe refactor(proxy_server.py): fix merge error 2024-02-06 23:44:23 -08:00
Krish Dholakia
f5eca2f4ad Merge branch 'main' into litellm_spend_logging_high_traffic 2024-02-06 23:36:58 -08:00
Krrish Dholakia
c7b3c3d43f fix(proxy_server.py): update user cache to with new spend 2024-02-06 23:06:05 -08:00
Krrish Dholakia
1c78da4100 fix(proxy_server.py): fix endpoint 2024-02-06 22:09:30 -08:00
Krish Dholakia
8960031eae Merge pull request #1843 from BerriAI/litellm_admin_ui_view_all_keys
feat(ui): enable admin to view all valid keys created on the proxy
2024-02-06 22:06:46 -08:00
Krrish Dholakia
c43baff5c9 fix: export npm build into proxy 2024-02-06 20:12:50 -08:00
Krrish Dholakia
c429dfa746 fix(proxy_cli.py-&&-proxy_server.py): bump reset budget intervals and fix pool limits for prisma connections 2024-02-06 19:39:49 -08:00
ishaan-jaff
80f24fc29c (fix) allow litellm_settings to be None 2024-02-06 19:29:39 -08:00
Krrish Dholakia
c28d72ca8d fix(proxy_server.py): prisma client fixes for high traffic 2024-02-06 17:30:36 -08:00
Ishaan Jaff
2a89ff18e2 Merge pull request #1859 from BerriAI/litellm_allow_using_budgets_without_keys
[Feat] Budgets for 'user' param passed to /chat/completions, /embeddings etc
2024-02-06 16:32:25 -08:00
ishaan-jaff
4de77018cc (feat) support max_user_budget 2024-02-06 15:19:36 -08:00
Krish Dholakia
c68764005d Merge branch 'main' into litellm_admin_ui_view_all_keys 2024-02-06 14:34:57 -08:00
Krish Dholakia
1fc2c74877 Merge branch 'main' into litellm_vertex_ai_streaming_fix 2024-02-06 14:33:54 -08:00
ishaan-jaff
9bf57170e7 (feat) show semantic-cache on health/readiness 2024-02-06 13:58:20 -08:00
ishaan-jaff
ab4e7f2be9 (feat) show semantic-cache on health/readiness 2024-02-06 13:35:34 -08:00
ishaan-jaff
bb4889f04d (feat) redis-semantic cache on proxy 2024-02-06 13:32:03 -08:00
ishaan-jaff
f8472fe3cf (feat) redis-semantic cache on proxy 2024-02-06 13:30:59 -08:00
ishaan-jaff
48be4a2695 (feat) redis-semantic cache on proxy 2024-02-06 13:30:31 -08:00
ishaan-jaff
1b7cd40ab4 (feat) upperbound_key_generate_params 2024-02-06 13:17:57 -08:00
Krrish Dholakia
97514b6bed fix(proxy_server.py): do a health check on db before returning if proxy ready (if db connected) 2024-02-06 12:57:05 -08:00
Ishaan Jaff
e9c42eb4ae Merge branch 'main' into litellm_add_semantic_cache 2024-02-06 11:18:43 -08:00
ishaan-jaff
3b869d7cd1 (feat) redis-semantic cache on proxy 2024-02-06 10:35:21 -08:00
Krrish Dholakia
32639bf398 fix(utils.py): return finish reason for last vertex ai chunk 2024-02-06 09:21:03 -08:00
Krrish Dholakia
6939cf5ead fix(proxy_server.py): update dynamodb load config 2024-02-06 08:18:17 -08:00
ishaan-jaff
0ca4f962d9 (feat) upperbound_key_generate_params 2024-02-05 22:38:47 -08:00
Krrish Dholakia
98b3978998 feat(ui): enable admin to view all valid keys created on the proxy 2024-02-05 19:28:57 -08:00
ishaan-jaff
dadfef7eac (fix) make sure route is str 2024-02-05 16:22:36 -08:00