Commit graph

1842 commits

Author SHA1 Message Date
Krish Dholakia
cc7f27252b
Merge pull request #3179 from BerriAI/litellm_dual_caching_fix
fix(caching.py): dual cache async_batch_get_cache fix + testing
2024-04-19 17:02:47 -07:00
Ishaan Jaff
def5ce4888 fix user needs to opt in to langfuse tags 2024-04-19 16:13:14 -07:00
Ishaan Jaff
48e5de4f0e
Merge pull request #3175 from BerriAI/litellm_log_team_alias_langfuse
[FEAT] Log team alias to langfuse
2024-04-19 15:30:59 -07:00
Krrish Dholakia
01a1a8f731 fix(caching.py): dual cache async_batch_get_cache fix + testing
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Krrish Dholakia
3c6b6355c7 fix(ollama_chat.py): accept api key as a param for ollama calls
allows user to call hosted ollama endpoint using bearer token for auth
2024-04-19 13:02:13 -07:00
Ishaan Jaff
423121ff7d feat - track team_alias is metadata for /chat, /embeddings 2024-04-19 10:52:54 -07:00
Ishaan Jaff
6d92b13c22 feat - log team_alias to langfuse 2024-04-19 10:29:42 -07:00
Ishaan Jaff
6f948cd559 fix - show api_base in hanging requests 2024-04-18 21:01:26 -07:00
Ishaan Jaff
f04604910b fix - show api base on hanging requests 2024-04-18 20:57:22 -07:00
Ishaan Jaff
e95e7fde65 ui - new build 2024-04-18 20:18:50 -07:00
Ishaan Jaff
554c83fdaf ui - show all alert types when getting all callbacks 2024-04-18 20:08:13 -07:00
Krish Dholakia
77a353d484
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
feat(prometheus_services.py): emit proxy latency for successful llm api requests
2024-04-18 19:10:58 -07:00
Ishaan Jaff
d9091dcf97 fix order by spend 2024-04-18 17:33:38 -07:00
Ishaan Jaff
b669e2987b fix return key aliases on /user/info 2024-04-18 17:16:52 -07:00
Krrish Dholakia
919a2876f1 fix(proxy/utils.py): add prometheus failed db request tracking 2024-04-18 16:30:29 -07:00
Krrish Dholakia
d61250109e fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
this is for tracking failed db requests on prometheus
2024-04-18 16:24:36 -07:00
Ishaan Jaff
eb04a929e6
Merge pull request #3112 from BerriAI/litellm_add_alert_types
[Feat] Allow user to select slack alert types to Opt In to
2024-04-18 16:21:33 -07:00
Ishaan Jaff
8958bbeac9
Merge pull request #3142 from BerriAI/litellm_slack_alerting_show_model_passed
[Fix] Show `model` passed on `"400: {'error': 'Invalid model name passed in mode` errors 👻
2024-04-18 16:18:17 -07:00
Krrish Dholakia
0f95a824c4 feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
2024-04-18 16:04:35 -07:00
Ishaan Jaff
6cfa80c42a
Merge pull request #3140 from BerriAI/litellm_trim_messages
[Fix] Slack Alerting -  trim messages to first 100 chars
2024-04-18 16:01:39 -07:00
Ishaan Jaff
b308f8c079 fix - show model passed in on Invalid model name passed in error 2024-04-18 15:43:30 -07:00
Ishaan Jaff
e20b05d6dd fix trim messages to first 100 chars 2024-04-18 15:21:31 -07:00
Krrish Dholakia
28edb77350 fix(utils.py): support prometheus failed call metrics 2024-04-18 12:29:15 -07:00
Ishaan Jaff
1cda0db2ca fix - test alerting 2024-04-18 11:40:40 -07:00
Ishaan Jaff
5e0dc57329 ui - set alert_types 2024-04-18 11:13:05 -07:00
Ishaan Jaff
a8e2ef79ef add alert_types to config.yaml 2024-04-18 08:51:07 -07:00
Krish Dholakia
91fe668411
Merge pull request #3105 from BerriAI/litellm_fix_hashing
fix(_types.py): hash api key in UserAPIKeyAuth
2024-04-18 08:16:24 -07:00
Ishaan Jaff
beeee01199 feat return alert types on /config/get/callback 2024-04-17 21:02:10 -07:00
Ishaan Jaff
3e8e0beb25
Merge pull request #3114 from BerriAI/litellm_fix_alerting_users
[FIX} Repeat Slack Alerts triggered for "User Crossed Budget"
2024-04-17 20:38:27 -07:00
Ishaan Jaff
9a5fd07f16 fix - user based alerting 2024-04-17 20:35:29 -07:00
Krrish Dholakia
18e3cf8bff fix(utils.py): support azure mistral function calling 2024-04-17 19:10:26 -07:00
Ishaan Jaff
c3dcbf2067 ui - new build 2024-04-17 18:22:23 -07:00
Ishaan Jaff
cd8d3d4788 ui - new build 2024-04-17 18:20:45 -07:00
Ishaan Jaff
7178da3144 bump: version 1.35.10 → 1.35.11 2024-04-17 18:20:31 -07:00
Ishaan Jaff
52d7fc22bb v0 add types of alerts to slack alerting 2024-04-17 18:16:19 -07:00
Ishaan Jaff
12a01ba096 litellm_add_proxy_base_url in slack alerts 2024-04-17 17:42:28 -07:00
Krrish Dholakia
bafb008b44 fix(_types.py): hash api key, if given
fixes https://github.com/BerriAI/litellm/issues/2114
2024-04-17 16:43:35 -07:00
Ishaan Jaff
336de9eeeb ui - dont include model/metrics in schema 2024-04-17 15:30:06 -07:00
Krrish Dholakia
2ffd057042 test(test_models.py): ensure only admin can call /health 2024-04-16 18:13:40 -07:00
Krrish Dholakia
dd0fddb2e1 build(ui): add ui updates 2024-04-16 17:56:35 -07:00
Krrish Dholakia
f4b595ce71 fix(utils.py): return vertex api base for request hanging alerts 2024-04-16 17:53:28 -07:00
Krish Dholakia
1cdf733341
Merge pull request #3081 from BerriAI/litellm_ui_fixes_2
build(ui): view_key_table.tsx
2024-04-16 17:19:41 -07:00
Krrish Dholakia
6dbd73b619 build(ui): view_key_table.tsx
show unmapped keys in default team
2024-04-16 17:16:08 -07:00
Ishaan Jaff
7e99854d05
Merge pull request #3079 from BerriAI/litellm_router_save_settings_ui
UI - Save / Edit router settings UI
2024-04-16 16:57:42 -07:00
Krrish Dholakia
fed8d61933 fix(_types.py): add /v2/ routes to info 2024-04-16 16:25:24 -07:00
Krish Dholakia
8837b2d51b
Merge pull request #3077 from BerriAI/litellm_delete_deployment_fix
fix(proxy_server.py): ensure id used in delete deployment matches id used in litellm Router
2024-04-16 15:48:43 -07:00
Krrish Dholakia
23c2c5d3c7 fix(proxy_server.py): allow non-admins to access openai routes 2024-04-16 15:40:36 -07:00
Ishaan Jaff
ea830eeebd ui - new build 2024-04-16 15:38:07 -07:00
Ishaan Jaff
59b154f152 feat - update router settings on Admin UI 2024-04-16 15:36:26 -07:00
Krrish Dholakia
13cd252f3e fix(proxy_server.py): ensure id used in delete deployment matches id used in litellm Router 2024-04-16 15:17:18 -07:00