Commit graph

1591 commits

Author SHA1 Message Date
Ishaan Jaff
eb882bb103 feat - cancel tasks when fast api request is cancelled 2024-05-14 19:58:51 -07:00
Ishaan Jaff
2a2791cab5 fix - return success spend reset 2024-05-14 11:38:39 -07:00
Ishaan Jaff
9fded05246 fix security for global_spend_reset 2024-05-14 11:04:50 -07:00
Ishaan Jaff
84629ef78a feat - reset spend per team, api_key 2024-05-13 18:23:23 -07:00
Ishaan Jaff
8e7f2e93af backend - show spend per tag by time 2024-05-13 17:13:48 -07:00
Ishaan Jaff
255c39a7c8 feat - /spend/report endpoint 2024-05-13 15:01:02 -07:00
Ishaan Jaff
d512e81a34 fix - show team based spend reports 2024-05-13 13:56:48 -07:00
Ishaan Jaff
471cb148cb Merge pull request #3609 from BerriAI/litellm_send_daily_spend_report
[Feat] send weekly spend reports by Team/Tag
2024-05-13 12:45:37 -07:00
Ishaan Jaff
9da8443fd7 fix - only schedule spend alerting when db is not none 2024-05-13 12:30:54 -07:00
Ishaan Jaff
77191ff98a test - weekly / monthly spend report alerts on /health/services 2024-05-13 10:50:26 -07:00
Ishaan Jaff
fa94632bc2 schedule weekly/monthly spend reports 2024-05-13 10:44:19 -07:00
Ishaan Jaff
09c064c94c feat - show monthly spend reports 2024-05-13 10:10:44 -07:00
Ishaan Jaff
63e4176502 feat - _get_weekly_spend_reports 2024-05-13 09:26:51 -07:00
Krrish Dholakia
5f5fdb439b fix(proxy_server.py): return 'allowed-model-region' in headers 2024-05-13 08:48:16 -07:00
Ishaan Jaff
e82c00622e Merge pull request #3587 from BerriAI/litellm_proxy_use_batch_completions_model_csv
[Feat] Use csv values for proxy batch completions (OpenAI Python compatible)
2024-05-13 07:55:12 -07:00
Krrish Dholakia
2e89f56507 fix(proxy_server.py): linting fix 2024-05-11 22:05:01 -07:00
Ishaan Jaff
a844423e00 fix - use csv list for batch completions 2024-05-11 14:24:48 -07:00
Ishaan Jaff
b71f35de72 Merge pull request #3585 from BerriAI/litellm_router_batch_comp
[Litellm Proxy + litellm.Router] - Pass the same message/prompt to N models
2024-05-11 13:51:45 -07:00
Ishaan Jaff
c49033a687 support batch /chat/completions on proxy 2024-05-11 13:24:25 -07:00
Krish Dholakia
30117317a4 Merge branch 'main' into feat/add-azure-content-filter 2024-05-11 09:30:38 -07:00
Krrish Dholakia
f8db5414a7 fix(proxy_server.py): check + get end-user obj even for master key calls
fixes issue where region-based routing wasn't working for end-users if master key was given
2024-05-10 16:54:51 -07:00
Ishaan Jaff
3b2e295337 fix - proxy_server.py 2024-05-10 13:47:35 -07:00
Nick Wong
a1821428d1 added changes from upstream
Merge branch 'main' into fix/error-on-get-user-role
2024-05-09 16:14:14 -07:00
Nick Wong
ea0856408b removed extra default dict return, which causes error if user_role is a string 2024-05-09 16:13:26 -07:00
Krrish Dholakia
8bb710b9c1 feat(proxy_server.py): expose new /team/list endpoint
Closes https://github.com/BerriAI/litellm/issues/3523
2024-05-09 13:21:00 -07:00
Krrish Dholakia
29ef2261f9 fix(auth_checks.py): fix 'get_end_user_object'
await cache get
2024-05-09 13:05:56 -07:00
Krish Dholakia
64ca2fde53 Merge branch 'main' into litellm_region_based_routing 2024-05-08 22:19:51 -07:00
Krrish Dholakia
0ea8222508 feat(router.py): enable filtering model group by 'allowed_model_region' 2024-05-08 22:10:17 -07:00
Krrish Dholakia
7c99808529 feat(proxy_server.py): add CRUD endpoints for 'end_user' management
allow admin to specify region + default models for end users
2024-05-08 18:50:36 -07:00
Ishaan Jaff
eeece6e90b Merge branch 'main' into ui_show_spend_end_user 2024-05-08 18:29:25 -07:00
Ishaan Jaff
41fd17dc3d feat - get price by end_user 2024-05-08 18:19:27 -07:00
Ishaan Jaff
bfa693e829 fix /global/spend/end_users 2024-05-08 17:03:38 -07:00
Krrish Dholakia
5f93cae3ff feat(proxy_server.py): return litellm version in response headers 2024-05-08 16:00:08 -07:00
Krish Dholakia
269d6cb16f Merge pull request #3437 from msabramo/add-engines-model-chat-completions-endpoint
Add `/engines/{model}/chat/completions` endpoint
2024-05-08 14:30:39 -07:00
Krish Dholakia
1eb75273cf Merge branch 'main' into litellm_ui_fixes_6 2024-05-07 22:01:04 -07:00
Krrish Dholakia
ae442f895b feat(ui/model_dashboard.tsx): show if model is config or db model 2024-05-07 21:34:18 -07:00
Krrish Dholakia
b9ec7acb08 feat(model_dashboard.tsx): allow user to edit input cost per token for model on ui
also contains fixes for `/model/update`
2024-05-07 20:57:21 -07:00
Krrish Dholakia
1882ee1c4c feat(ui/model_dashboard.tsx): show if model is config or db model 2024-05-07 18:29:14 -07:00
Krish Dholakia
8b87500620 Merge pull request #3506 from BerriAI/litellm_reintegrate_langfuse_url_slack_alert
feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
2024-05-07 15:03:29 -07:00
Krrish Dholakia
17a7d4f602 fix(proxy_server.py): return budget duration in user response object 2024-05-07 13:47:32 -07:00
Krrish Dholakia
f503929918 fix(proxy_server.py): fix /v1/models bug where it would return empty list
handle 'all-team-models' being set for a given key
2024-05-07 13:43:15 -07:00
Krrish Dholakia
e5c57cdddb feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts
this ensures langfuse trace url returned in llm api exception err
2024-05-07 12:58:49 -07:00
Ishaan Jaff
8608795fc5 fix don't let slack alert block /model/new 2024-05-06 20:47:29 -07:00
Ishaan Jaff
43cd3f05d2 fix - /model/new 2024-05-06 20:45:17 -07:00
Krish Dholakia
185581d5a9 Merge branch 'main' into litellm_slack_daily_reports 2024-05-06 19:31:20 -07:00
Krrish Dholakia
3cada96f22 refactor(proxy_server.py): show ttl's on a top-level enum
Addresses - https://github.com/BerriAI/litellm/issues/2649#issuecomment-2097203372
2024-05-06 18:43:42 -07:00
Krrish Dholakia
2170e1723d feat(proxy_server.py): schedule slack daily report if enabled
if user enabled daily_reports, send them a slack report every 12 hours
2024-05-06 18:25:48 -07:00
Ishaan Jaff
881313f335 feat - send alert on adding new model 2024-05-06 15:45:07 -07:00
Ishaan Jaff
e5488a94dd fix - add better debugging on num_callbacks test 2024-05-06 13:42:20 -07:00
Ishaan Jaff
5ffe9a8008 fix - select startTime and endTime on UI 2024-05-03 21:20:19 -07:00