litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 10:44:24 +00:00

Author	SHA1	Message	Date
Krish Dholakia	33ead69c0a	Support checking provider `/models` endpoints on proxy `/v1/models` endpoint (#9958 ) * feat(utils.py): support global flag for 'check_provider_endpoints' enables setting this for `/models` on proxy * feat(utils.py): add caching to 'get_valid_models' Prevents checking endpoint repeatedly * fix(utils.py): ensure mutations don't impact cached results * test(test_utils.py): add unit test to confirm cache invalidation logic * feat(utils.py): get_valid_models - support passing litellm params dynamically Allows for checking endpoints based on received credentials * test: update test * feat(model_checks.py): pass router credentials to get_valid_models - ensures it checks correct credentials * refactor(utils.py): refactor for simpler functions * fix: fix linting errors * fix(utils.py): fix test * fix(utils.py): set valid providers to custom_llm_provider, if given * test: update test * fix: fix ruff check error	2025-04-14 23:23:20 -07:00
Krish Dholakia	00e49380df	Litellm UI qa 04 12 2025 p1 (#9955 ) * fix(model_info_view.tsx): cleanup text * fix(key_management_endpoints.py): fix filtering litellm-dashboard keys for internal users * fix(proxy_track_cost_callback.py): prevent flooding spend logs with admin endpoint errors * test: add unit testing for logic * test(test_auth_exception_handler.py): add more unit testing * fix(router.py): correctly handle retrieving model info on get_model_group_info fixes issue where model hub was showing None prices * fix: fix linting errors	2025-04-12 19:30:48 -07:00
Krish Dholakia	8ee32291e0	Squashed commit of the following: (#9709 ) commit `b12a9892b7` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Apr 2 08:09:56 2025 -0700 fix(utils.py): don't modify openai_token_counter commit `294de31803` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 21:22:40 2025 -0700 fix: fix linting error commit `cb6e9fbe40` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:52:45 2025 -0700 refactor: complete migration commit `bfc159172d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:09:59 2025 -0700 refactor: refactor more constants commit `43ffb6a558` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:45:24 2025 -0700 fix: test commit `04dbe4310c` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:28:58 2025 -0700 refactor: refactor: move more constants into constants.py commit `3c26284aff` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:14:46 2025 -0700 refactor: migrate hardcoded constants out of __init__.py commit `c11e0de69d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:11:21 2025 -0700 build: migrate all constants into constants.py commit `7882bdc787` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:07:37 2025 -0700 build: initial test banning hardcoded numbers in repo	2025-04-02 21:24:54 -07:00
Ishaan Jaff	01bd418e0b	remove deprecated logic	2025-04-01 07:55:28 -07:00
Ishaan Jaff	f8ce30763e	fix check_if_token_is_service_account	2025-04-01 07:36:35 -07:00
Krish Dholakia	9b7ebb6a7d	build(pyproject.toml): add new dev dependencies - for type checking (#9631 ) * build(pyproject.toml): add new dev dependencies - for type checking * build: reformat files to fit black * ci: reformat to fit black * ci(test-litellm.yml): make tests run clear * build(pyproject.toml): add ruff * fix: fix ruff checks * build(mypy/): fix mypy linting errors * fix(hashicorp_secret_manager.py): fix passing cert for tls auth * build(mypy/): resolve all mypy errors * test: update test * fix: fix black formatting * build(pre-commit-config.yaml): use poetry run black * fix(proxy_server.py): fix linting error * fix: fix ruff safe representation error	2025-03-29 11:02:13 -07:00
Ishaan Jaff	7142b0b610	refactor PrismaDBExceptionHandler	2025-03-26 19:22:24 -07:00
Ishaan Jaff	23aa7f81b5	fix ProxyException	2025-03-26 15:53:33 -07:00
Ishaan Jaff	ff33ed020c	fix auth checks	2025-03-26 15:45:58 -07:00
Ishaan Jaff	8bd2081dec	fix get_key_object	2025-03-26 15:41:40 -07:00
Ishaan Jaff	f8caebc54b	is_database_connection_error	2025-03-26 14:55:40 -07:00
Ishaan Jaff	ce49e27217	fixes for auth checks	2025-03-25 15:44:13 -07:00
Ishaan Jaff	4c87084ff7	UserAPIKeyAuthExceptionHandler	2025-03-25 14:07:14 -07:00
Krrish Dholakia	f0d047a0e4	fix: fix linting error	2025-03-21 18:23:58 -07:00
Krrish Dholakia	8b0c75d88f	fix: fix linting error	2025-03-21 18:08:00 -07:00
Krrish Dholakia	6034187aa5	fix: remove unused import	2025-03-21 17:25:57 -07:00
Krrish Dholakia	4b49dab717	fix: remove unused import	2025-03-21 17:22:58 -07:00
Krrish Dholakia	95ef5f1009	refactor(user_api_key_auth.py): move is_route_allowed to inside common_checks ensures consistent behaviour inside api key + jwt routes	2025-03-21 17:21:07 -07:00
Ishaan Jaff	f5ef0c3cb7	fix code quality checks	2025-03-18 22:34:43 -07:00
Ishaan Jaff	de97cda445	refactor create_litellm_proxy_request_started_spen	2025-03-18 16:12:16 -07:00
Krrish Dholakia	26226d475f	feat(proxy_server.py): support retrieving models for a team, if user is a member - via `/models?team_id` Allows user to see team models on UI when creating a key	2025-03-14 19:34:06 -07:00
Steve Farthing	dbfb7ebdaf	Merge branch 'main' into stevefarthing/bing-search-pass-thru	2025-03-11 08:06:56 -04:00
Krish Dholakia	c93a5e2301	Merge pull request #9047 from BerriAI/litellm_dev_03_06_2025_p4 feat(handle_jwt.py): support multiple jwt url's	2025-03-10 22:37:35 -07:00
Ishaan Jaff	aa5ac6ba3d	can_team_access_model	2025-03-10 20:03:19 -07:00
Ishaan Jaff	0d6df360bf	test_can_team_access_model fix	2025-03-10 19:09:50 -07:00
Ishaan Jaff	9dcc25d63b	Merge branch 'main' into litellm_fix_team_model_access_checks	2025-03-10 19:05:11 -07:00
Krish Dholakia	c58941d49c	Merge branch 'main' into litellm_dev_03_06_2025_p4	2025-03-10 18:41:10 -07:00
Krish Dholakia	5591354309	Support master key rotations (#9041 ) * feat(key_management_endpoints.py): adding support for rotating master key * feat(key_management_endpoints.py): support decryption-re-encryption of models in db, when master key rotated * fix(user_api_key_auth.py): raise valid token is None error earlier enables easier debugging with api key hash in error message * feat(key_management_endpoints.py): rotate any env vars * fix(key_management_endpoints.py): uncomment check * fix: fix linting error	2025-03-06 23:13:30 -08:00
Krrish Dholakia	805679becc	feat(handle_jwt.py): support multiple jwt url's	2025-03-06 23:05:54 -08:00
Ishaan Jaff	8d6815ce98	Revert "(UI) - Security Improvement, move to JWT Auth for Admin UI Sessions (#8995 )" This reverts commit `01a44a4e47`.	2025-03-05 08:49:20 -08:00
Ishaan Jaff	01a44a4e47	(UI) - Security Improvement, move to JWT Auth for Admin UI Sessions (#8995 ) * (UI) - Improvements to session handling logic (#8970) * add cookieUtils * use utils for clearing cookies * on logout use clearTokenCookies * ui use correct clearTokenCookies * navbar show userEmail on UserID page * add timestamp on token cookie * update generate_authenticated_redirect_response * use common getAuthToken * fix clearTokenCookies * fixes for get auth token * fix invitation link sign in logic * Revert "fix invitation link sign in logic" This reverts commit `30e5308cb3`. * fix getAuthToken * update setAuthToken * fix ui session handling * fix ui session handler * bug fix stop generating LiteLLM Virtual keys for access * working JWT insert into cookies * use central place to build UI JWT token * add _validate_ui_token * fix ui session handler * fix fetchWithCredentials * check allowed routes for ui session tokens * expose validate_session endpoint * validate session endpoint * call sso/session/validate * getUISessionDetails * ui move to getUISessionDetails * /sso/session/validate * fix cookie utils * use getUISessionDetails * use ui_session_id * "/spend/logs/ui" in spend_tracking_routes * working sign in JWT flow for proxy admin * allow proxy admin to access ui routes * use check_route_access * update types * update login method * fixes to ui session handler * working flow for admin and internal users * fixes for invite links * use JWTs for SSO sign in * fix /invitation/new flow * fix code quality checks * fix _get_ui_session_token_from_cookies * /organization/list * ui sso sign in * TestUISessionHandler * TestUISessionHandler	2025-03-04 21:48:23 -08:00
Ishaan Jaff	f85d5afd58	Merge branch 'main' into litellm_fix_team_model_access_checks	2025-03-01 17:36:45 -08:00
Ishaan Jaff	5508239375	patch - auth checks for model access (#8924 )	2025-03-01 10:11:44 -08:00
Krish Dholakia	c1527ebf52	UI - Allow admin to control default model access for internal users (#8912 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 36s Details * fix(create_user_button.tsx): allow admin to set models user has access to, on invite Enables controlling model access on invite * feat(auth_checks.py): enforce 'no-model-access' special model name on backend prevent user from calling models if default key has no model access * fix(chat_ui.tsx): allow user to input custom model * fix(chat_ui.tsx): pull available models based on models key has access to * style(create_user_button.tsx): move default model inside 'personal key creation' accordion * fix(chat_ui.tsx): fix linting error * test(test_auth_checks.py): add unit-test for special model name * docs(internal_user_endpoints.py): update docstring * fix test_moderations_bad_model * Litellm dev 02 27 2025 p6 (#8891) * fix(http_parsing_utils.py): orjson can throw errors on some emoji's in text, default to json.loads * fix(sagemaker/handler.py): support passing model id on async streaming * fix(litellm_pre_call_utils.py): Fixes https://github.com/BerriAI/litellm/issues/7237 * Fix calling claude via invoke route + response_format support for claude on invoke route (#8908) * fix(anthropic_claude3_transformation.py): fix amazon anthropic claude 3 tool calling transformation on invoke route move to using anthropic config as base * fix(utils.py): expose anthropic config via providerconfigmanager * fix(llm_http_handler.py): support json mode on async completion calls * fix(invoke_handler/make_call): support json mode for anthropic called via bedrock invoke * fix(anthropic/): handle 'response_format: {"type": "text"}` + migrate amazon claude 3 invoke config to inherit from anthropic config Prevents error when passing in 'response_format: {"type": "text"} * test: fix test * fix(utils.py): fix base invoke provider check * fix(anthropic_claude3_transformation.py): don't pass 'stream' param * fix: fix linting errors * fix(converse_transformation.py): handle response_format type=text for converse * converse_transformation: pass 'description' if set in response_format (#8907) * test(test_bedrock_completion.py): e2e test ensuring tool description is passed in * fix(converse_transformation.py): pass description, if set * fix(transformation.py): Fixes https://github.com/BerriAI/litellm/issues/8767#issuecomment-2689887663 * Fix bedrock passing `response_format: {"type": "text"}` (#8900) * fix(converse_transformation.py): ignore type: text, value in response_format no-op for bedrock * fix(converse_transformation.py): handle adding response format value to tools * fix(base_invoke_transformation.py): fix 'get_bedrock_invoke_provider' to handle cross-region-inferencing models * test(test_bedrock_completion.py): add unit testing for bedrock invoke provider logic * test: update test * fix(exception_mapping_utils.py): add context window exceeded error handling for databricks provider route * fix(fireworks_ai/): support passing tools + response_format together * fix: cleanup * fix(base_invoke_transformation.py): fix imports * (Feat) - Show Error Logs on LiteLLM UI (#8904) * fix test_moderations_bad_model * use async_post_call_failure_hook * basic logging errors in DB * show status on ui * show status on ui * ui show request / response side by side * stash fixes * working, track raw request * track error info in metadata * fix showing error / request / response logs * show traceback on error viewer * ui with traceback of error * fix async_post_call_failure_hook * fix(http_parsing_utils.py): orjson can throw errors on some emoji's in text, default to json.loads * test_get_error_information * fix code quality * rename proxy track cost callback test * _should_store_errors_in_spend_logs * feature flag error logs * Revert "_should_store_errors_in_spend_logs" This reverts commit `7f345df477`. * Revert "feature flag error logs" This reverts commit `0e90c022bb`. * test_spend_logs_payload * fix OTEL log_db_metrics * fix import json * fix ui linting error * test_async_post_call_failure_hook * test_chat_completion_bad_model_with_spend_logs --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> * ui new build * test_chat_completion_bad_model_with_spend_logs * docs(release_cycle.md): document release cycle * bump: version 1.62.0 → 1.62.1 --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-02-28 23:23:03 -08:00
Krish Dholakia	740bd7e9ce	(security fix) - Enforce model access restrictions on Azure OpenAI route (#8888 ) * fix(user_api_key_auth.py): Fixes https://github.com/BerriAI/litellm/issues/8780 security fix - enforce model access checks on azure routes * test(test_user_api_key_auth.py): add unit testing * test(test_openai_endpoints.py): add e2e test to ensure azure routes also run through model validation checks	2025-02-27 21:24:58 -08:00
Ishaan Jaff	81039d8faf	(Bug fix) - allow using Assistants GET, DELETE on `/openai` pass through routes (#8818 ) * test_openai_assistants_e2e_operations * test openai assistants pass through * fix GET request on pass through handler * _make_non_streaming_http_request * _is_assistants_api_request * test_openai_assistants_e2e_operations * test_openai_assistants_e2e_operations * openai_proxy_route * docs openai pass through * docs openai pass through * docs openai pass through * test pass through handler * Potential fix for code scanning alert no. 2240: Incomplete URL substring sanitization Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-02-25 19:19:00 -08:00
Ishaan Jaff	b6d6e270b4	can_team_access_model	2025-02-25 14:51:57 -08:00
Ishaan Jaff	eeee61db65	can_team_access_model	2025-02-25 14:50:10 -08:00
Ishaan Jaff	300d7825f5	(Observability) - Add more detailed dd tracing on Proxy Auth, Bedrock Auth (#8693 ) * add dd tracer * fix dd tracing * add @tracer.wrap() on def user_api_key_auth * add async_function_with_retries * remove dead code * add tracer.wrap on base aws llm * add tracer.wrap on base aws llm * fix print verbose * fix dd tracing * trace base aws llm * fix test base aws llm * fix converse transform * test base aws llm * BASE_AWS_LLM_PATH * BASE_AWS_LLM_PATH * test dd tracing	2025-02-20 18:00:41 -08:00
Krrish Dholakia	7bfd816d3b	build: merge commit `1b15568af7` All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Feb 17 21:37:36 2025 -0800 fix(proxy/_types.py): fix linting error commit `dc4d5cffa6` Author: Krrish Dholakia <krrishdholakia@gmail.com>	2025-02-17 21:56:00 -08:00
Krish Dholakia	57e5ec07cc	Improved wildcard route handling on `/models` and `/model_group/info` (#8473 ) * fix(model_checks.py): update returning known model from wildcard to filter based on given model prefix ensures wildcard route - `vertex_ai/gemini-` just returns known vertex_ai/gemini- models test(test_proxy_utils.py): add unit testing for new 'get_known_models_from_wildcard' helper * test(test_models.py): add e2e testing for `/model_group/info` endpoint * feat(prometheus.py): support tracking total requests by user_email on prometheus adds initial support for tracking total requests by user_email * test(test_prometheus.py): add testing to ensure user email is always tracked * test: update testing for new prometheus metric * test(test_prometheus_unit_tests.py): add user email to total proxy metric * test: update tests * test: fix spend tests * test: fix test * fix(pagerduty.py): fix linting error	2025-02-11 19:37:43 -08:00
Ishaan Jaff	81109893ec	(round 4 fixes) - Team model alias setting (#8474 ) * update team info endpoint * clean up model alias * fix model alias * fix model alias card * clean up naming on docs * fix model alias card * fix _model_in_team_aliases * team alias - fix litellm.model_alias_map * fix _update_model_if_team_alias_exists * fix test_aview_spend_per_user * Test model alias functionality with teams: * complete e2e test * test_update_model_if_team_alias_exists	2025-02-11 16:40:01 -08:00
Ishaan Jaff	5cd20d2abc	(UI) allow adding model aliases for teams (#8471 ) * update team info endpoint * clean up model alias * fix model alias * fix model alias card * clean up naming on docs * fix model alias card * fix _model_in_team_aliases * fix key_model_access_denied * test_can_key_call_model_with_aliases * fix test_aview_spend_per_user	2025-02-11 16:18:43 -08:00
Krish Dholakia	9c4c7813fb	Allow org admin to create teams on UI (#8407 ) * fix(client_initialization_utils.py): handle custom llm provider set with valid value not from model name * fix(handle_jwt.py): handle groups not existing in jwt token if user not in group, this won't exist * fix(handle_jwt.py): add new `enforce_team_based_model_access` flag to jwt auth allows proxy admin to enforce user can only call model if team has access * feat(navbar.tsx): expose new dropdown in navbar - allow org admin to create teams within org context * fix(navbar.tsx): remove non-functional cogicon * fix(proxy/utils.py): include user-org memberships in `/user/info` response return orgs user is a member of and the user role within org * feat(organization_endpoints.py): allow internal user to query `/organizations/list` and get all orgs they belong to enables org admin to select org they belong to, to create teams * fix(navbar.tsx): show change in ui when org switcher clicked * feat(page.tsx): update user role based on org they're in allows org admin to create teams in the org context * feat(teams.tsx): working e2e flow for allowing org admin to add new teams * style(navbar.tsx): clarify switching orgs on UI is in BETA * fix(organization_endpoints.py): handle getting but not setting members * test: fix test * fix(client_initialization_utils.py): revert custom llm provider handling fix - causing unintended issues * docs(token_auth.md): cleanup docs	2025-02-09 00:07:15 -08:00
Krish Dholakia	d720744656	Litellm dev 02 06 2025 p3 (#8343 ) * feat(handle_jwt.py): initial commit to allow scope based model access * feat(handle_jwt.py): allow model access based on token scopes allow admin to control model access from IDP * test(test_jwt.py): add unit testing for scope based model access * docs(token_auth.md): add scope based model access to docs * docs(token_auth.md): update docs * docs(token_auth.md): update docs * build: add gemini commercial rate limits * fix: fix linting error	2025-02-06 23:15:33 -08:00
Krish Dholakia	b4e5c0de69	Improve rpm check on keys (#8301 ) * fix(parallel_request_limiter.py): initial commit that solves the rpm limit check on keys Fixes https://github.com/BerriAI/litellm/issues/6938 * fix(parallel_request_limiter.py): simpler approach - just increment RPM in pre call hook instead of on success * fix(parallel_request_limiter.py): pass testing * fix: fix linting error * fix(parallel_request_limiter.py): fix parallel request check for keys	2025-02-05 20:23:08 -08:00
Krish Dholakia	4e34fc3bf8	[BETA] Support OIDC `role` based access to proxy (#8260 ) * feat(proxy/_types.py): add new jwt field params allows users + services to auth into proxy * feat(handle_jwt.py): allow team role proxy access allows proxy admin to set allowed team roles * fix(proxy/_types.py): add 'routes' to role based permissions allow proxy admin to restrict what routes a team can access easily * feat(handle_jwt.py): support more flexible role based route access v2 on role based 'allowed_routes' * test(test_jwt.py): add unit test for rbac for proxy routes * feat(handle_jwt.py): ensure cost tracking always works for any jwt request with `enforce_rbac=True` * docs(token_auth.md): add documentation on controlling model access via OIDC Roles * test: increase time delay before retrying * test: handle model overloaded for test	2025-02-04 21:59:39 -08:00
Steve Farthing	9724ee94df	Feedback	2025-02-04 21:11:19 -05:00
Ishaan Jaff	c0f3100934	[Bug Fix] - `/vertex_ai/` was not detected as llm_api_route on pass through but `vertex-ai` was (#8186 ) * fix mapped_pass_through_routes * fix route checks * update test_is_llm_api_route	2025-02-01 17:26:08 -08:00
Krish Dholakia	8d0db8b379	build(schema.prisma): add new `sso_user_id` to LiteLLM_UserTable (#8167 ) * build(schema.prisma): add new `sso_user_id` to LiteLLM_UserTable easier way to store sso id for existing user Allows existing user added to team, to login via SSO * test(test_auth_checks.py): add unit testing for fuzzy user object get * fix(handle_jwt.py): fix merge conflicts	2025-01-31 23:04:05 -08:00

1 2 3 4 5 ...

317 commits