litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 03:34:10 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	ac71a89b0e	ui - new build	2024-05-15 21:04:09 -07:00
Ishaan Jaff	370049c549	Merge pull request #3661 from BerriAI/litellm_enable_alerts_completions_endpoint [FIX] raise alerts for exceptions on `/completions` endpoint	2024-05-15 20:46:02 -07:00
Krish Dholakia	25e4b34574	Merge pull request #3660 from BerriAI/litellm_proxy_ui_general_settings feat(proxy_server.py): Enabling Admin to control general settings on proxy ui	2024-05-15 20:36:42 -07:00
Krrish Dholakia	594ca947c8	fix(parallel_request_limiter.py): fix max parallel request limiter on retries	2024-05-15 20:16:11 -07:00
Ishaan Jaff	8e3aa14287	fix revert 3600	2024-05-15 19:40:34 -07:00
Krrish Dholakia	153ce0d085	build(ui/general_settings.tsx): support updating global max parallel requests on the ui	2024-05-15 19:26:57 -07:00
Marc Abramowitz	ddf77656fb	Allow non-admins to use OpenAI routes The problem we were having is non-admin users trying to use `/engines/{model}/chat/completions` were getting an HTTP 401 error. ```shell $ curl -sSL 'http://localhost:4000/engines/gpt-35-turbo-0125/chat/completions' \ --header "Authorization: Bearer ${LITELLM_KEY}" \ --header 'Content-Type: application/json' \ --data ' { "model": "gpt-35-turbo-0125", "messages": [ { "role": "user", "content": "Write a poem about LiteLLM" } ] }' \ \| jq '.' { "error": { "message": "Authentication Error, Only proxy admin can be used to generate, delete, update info for new keys/users/teams. Route=/engines/gpt-35-turbo-0125/chat/completions. Your role=unknown. Your user_id=someone@company.com", "type": "auth_error", "param": "None", "code": 401 } } ``` This seems to be related to code in `user_api_key_auth` that checks that the URL matches a list of routes that are allowed for non-admin users, where the list of routes is in `LiteLLMRoutes.openai_routes.value`. The problem is that the route `/engines/{model}/chat/completions` is not in that list and furthermore, that wouldn't even work if it were, because the comparison is done with `request.url.path` and that will have the actual model name in it (e.g.: `gpt-35-turbo-0125`), rather than `{model}`. I added a new list `LiteLLMRoutes.openai_route_names` and added the route names to that list. Then I added a check in `user_api_key_auth` to see if the route name is in the list of route names.	2024-05-15 17:36:11 -07:00
Ishaan Jaff	89dd1ffee8	fix deep copy logic in during_call_hook	2024-05-15 17:34:54 -07:00
Ishaan Jaff	1262c5cb39	fix - /organization/new logic	2024-05-15 17:27:46 -07:00
Ishaan Jaff	9f43761000	fix - show alerts on /completions endpoint	2024-05-15 15:45:21 -07:00
Krrish Dholakia	6a357b4275	feat(proxy_server.py): Enabling Admin to control general settings on proxy ui	2024-05-15 15:26:57 -07:00
Ishaan Jaff	c00d232900	Merge pull request #3655 from BerriAI/litellm_only_show_debug_info_server_side Fix Proxy Server - only show API base, Model server log exceptions, not on client side	2024-05-15 14:20:14 -07:00
Ishaan Jaff	1a11b151b7	fix - show litellm debug info in errors	2024-05-15 14:18:12 -07:00
Ishaan Jaff	f6e342303e	show debug info on slack alerts	2024-05-15 13:23:44 -07:00
Ishaan Jaff	74f093bb4b	Merge pull request #3653 from BerriAI/litellm_fix_text_completions [Fix] - Alerting on `/completions` - don't raise hanging request alert for /completions	2024-05-15 11:28:55 -07:00
Krrish Dholakia	dba713ea43	fix(router.py): add validation for how router fallbacks are setup prevent user errors	2024-05-15 10:44:16 -07:00
Ishaan Jaff	2e81347607	fix - don't raise hanging request alert for /completions	2024-05-15 10:27:02 -07:00
Krrish Dholakia	5dcf3d672c	feat(proxy_server.py): new `/end_user/info` endpoint get spend for a specific end-user	2024-05-15 09:50:52 -07:00
Krrish Dholakia	51a02de4cf	refactor(proxy_server.py): update doc string for /user/update	2024-05-15 08:25:14 -07:00
Krrish Dholakia	8117af664c	fix(huggingface_restapi.py): fix task extraction from model name	2024-05-15 07:28:19 -07:00
Ishaan Jaff	e7af8d61cd	fix check_request_disconnected = None case	2024-05-14 20:38:32 -07:00
Ishaan Jaff	4466982507	feat - cancel tasks when fast api request is cancelled	2024-05-14 19:58:51 -07:00
Krrish Dholakia	9eee2f3889	docs(prod.md): add 'disable load_dotenv' tutorial to docs	2024-05-14 19:13:22 -07:00
Krrish Dholakia	1ab4974773	fix: disable 'load_dotenv' for prod environments	2024-05-14 19:09:36 -07:00
Krrish Dholakia	1db1af1154	fix(types): fix typing	2024-05-14 17:09:36 -07:00
Krrish Dholakia	888c53e774	fix(proxy/_types.py): fix linting errors	2024-05-14 17:02:11 -07:00
Ishaan Jaff	aa1615c757	Merge pull request #3626 from BerriAI/litellm_reset_spend_per_team_api_key feat - reset spend per team, api_key [Only Master Key]	2024-05-14 11:49:07 -07:00
Ishaan Jaff	787c02c8db	fix - return success spend reset	2024-05-14 11:38:39 -07:00
Ishaan Jaff	7e56e27226	fix security for global_spend_reset	2024-05-14 11:04:50 -07:00
Krish Dholakia	2c867ea9a5	Merge pull request #3600 from msabramo/msabramo/fix-pydantic-warnings Update pydantic code to fix warnings	2024-05-13 22:00:39 -07:00
Ishaan Jaff	3686789c36	feat - reset spend per team, api_key	2024-05-13 18:23:23 -07:00
Ishaan Jaff	f19e6724ee	ui - new build	2024-05-13 17:48:24 -07:00
Ishaan Jaff	6ea5a276f8	backend - show spend per tag by time	2024-05-13 17:13:48 -07:00
Krrish Dholakia	c7b3193944	fix(proxy/_types.py): allow jwt admin to access `/team/list` route	2024-05-13 16:07:31 -07:00
Ishaan Jaff	ea9b4dc439	Merge pull request #3619 from BerriAI/litellm_show_spend_reports [Feat] - `/global/spend/report`	2024-05-13 16:06:02 -07:00
Ishaan Jaff	eb2d6ba20a	ui - new build	2024-05-13 15:56:59 -07:00
Krrish Dholakia	b4a8665d11	fix(utils.py): fix custom pricing when litellm model != response obj model name	2024-05-13 15:25:35 -07:00
Ishaan Jaff	12cf9d71c7	feat - /spend/report endpoint	2024-05-13 15:01:02 -07:00
Ishaan Jaff	9d4b727913	fix - show team based spend reports	2024-05-13 13:56:48 -07:00
Ishaan Jaff	21845bc061	Merge pull request #3609 from BerriAI/litellm_send_daily_spend_report [Feat] send weekly spend reports by Team/Tag	2024-05-13 12:45:37 -07:00
Ishaan Jaff	40b2f33a80	fix - only schedule spend alerting when db is not none	2024-05-13 12:30:54 -07:00
Marc Abramowitz	1efa2e65ac	Merge branch 'main' into msabramo/fix-pydantic-warnings	2024-05-13 11:34:06 -07:00
Marc Abramowitz	73541b1f17	Merge branch 'msabramo/pydantic_replace_root_validator_with_model_validator' into msabramo/fix-pydantic-warnings	2024-05-13 11:25:55 -07:00
Ishaan Jaff	aac81c59b5	test - weekly / monthly spend report alerts on /health/services	2024-05-13 10:50:26 -07:00
Ishaan Jaff	197eb44832	fix scheduling spend reports	2024-05-13 10:45:22 -07:00
Ishaan Jaff	4a679bb640	schedule weekly/monthly spend reports	2024-05-13 10:44:19 -07:00
Marc Abramowitz	4ccdbfd67d	model_config = ConfigDict instead of model_config = dict	2024-05-13 10:38:01 -07:00
Marc Abramowitz	abd90e6296	Revert root_validator => model_validator change I have it in a separate PR: https://github.com/BerriAI/litellm/pull/3611	2024-05-13 10:33:04 -07:00
Marc Abramowitz	ad8c3ac2c3	Change pydantic root_validator to model_validator pydantic v1 uses `root_validator` and pydantic v2 uses `model_validator`. pydantic v2 emits a warning when `root_validator` is used. E.g.: ``` litellm/proxy/_types.py:225 /Users/abramowi/Code/OpenSource/litellm/litellm/proxy/_types.py:225: PydanticDeprecatedSince20: Pydantic V1 style `@root_validator` validators are deprecated. You should migrate to Pydantic V2 style `@model_validator` validators, see the migration guide for more details. Deprecated in Pydantic V2.0 to be removed in V3.0. See Pydantic V2 Migration Guide at https://errors.pydantic.dev/2.7/migration/ @root_validator(pre=True) ``` This change eliminates those warnings with pydantic v2, while retaining compatibility with pydantic v1. pydantic 2.7.1 before ``` $ env -i PATH=$PATH poetry run pytest litellm/tests/test_proxy_server.py ... litellm/proxy/_types.py:225 /Users/abramowi/Code/OpenSource/litellm/litellm/proxy/_types.py:225: PydanticDeprecatedSince20: Pydantic V1 style `@root_validator` validators are deprecated. You should migrate to Pydantic V2 style `@model_validator` validators, see the migration guide for more details. Deprecated in Pydantic V2.0 to be removed in V3.0. See Pydantic V2 Migration Guide at https://errors.pydantic.dev/2.7/migration/ @root_validator(pre=True) ... ========================== 10 passed, 2 skipped, 39 warnings in 8.67s =========================== ``` pydantic 2.7.1 after ``` $ env -i PATH=$PATH poetry run pytest litellm/tests/test_proxy_server.py ... ========================== 10 passed, 2 skipped, 27 warnings in 9.85s =========================== ``` pydantic 1.10.5 after ``` $ poetry run pip install 'pydantic<2' ... Successfully installed pydantic-1.10.15 $ env -i PATH=$PATH poetry run pytest litellm/tests/test_proxy_server.py ... =========================== 10 passed, 2 skipped, 1 warning in 8.13s ============================ ```	2024-05-13 10:23:44 -07:00
Ishaan Jaff	2cd584a3ad	feat - send_monthly_spend_report	2024-05-13 10:17:40 -07:00

1 2 3 4 5 ...

2190 commits