litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 11:43:54 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	03553e00f0	(UI + SpendLogs) - Store SpendLogs in UTC Timezone, Fix filtering logs by start/end time (#8190 ) * fix request_id field * spend logs store time in UTC * fix ui_view_spend_logs * UI make time filter queries in UTC * fix time filters * fix TimeCellProps * ui use UTC for filtering time	2025-02-01 17:26:18 -08:00
Ishaan Jaff	eaa436aaef	[Bug Fix] - `/vertex_ai/` was not detected as llm_api_route on pass through but `vertex-ai` was (#8186 ) * fix mapped_pass_through_routes * fix route checks * update test_is_llm_api_route	2025-02-01 17:26:08 -08:00
Ishaan Jaff	8fdab540e4	ui new build	2025-02-01 11:41:30 -08:00
Krish Dholakia	e56061c122	test: add more unit testing for team member endpoints (#8170 ) * test: add more unit testing for team member add * fix(team_endpoints.py): add validation check to prevent same user from being added to team again prevents duplicates * fix(team_endpoints.py): raise error if `/team/member_delete` called on member that's not in team prevent being able to call delete on same member multiple times * test: update initial tests * test: fix test * test: update test to handle no member duplication	2025-02-01 11:23:00 -08:00
Krish Dholakia	87ec637f7d	Litellm dev contributor prs 01 31 2025 (#8168 ) * Add O3-Mini for Azure and Remove Vision Support (#8161) * Azure Released O3-mini at the same time as OAI, so i've added support here. Confirmed to work with Sweden Central. * [FIX] replace cgi for python 3.13 with email.Message as suggested in PEP 594 (#8160) * Update model_prices_and_context_window.json (#8120) codestral2501 pricing on vertex_ai * Fix/db view names (#8119) * Fix to case sensitive DB Views name * Fix to case sensitive DB View names * Added quotes to check query as well * Added quotes to create view query * test: handle server error for flaky test vertex ai has unstable endpoints --------- Co-authored-by: Wanis Elabbar <70503629+elabbarw@users.noreply.github.com> Co-authored-by: Honghua Dong <dhh1995@163.com> Co-authored-by: superpoussin22 <vincent.nadal@orange.fr> Co-authored-by: Miguel Armenta <37154380+ma-armenta@users.noreply.github.com>	2025-02-01 09:05:20 -08:00
Krish Dholakia	9ee32ace6d	build(schema.prisma): add new `sso_user_id` to LiteLLM_UserTable (#8167 ) * build(schema.prisma): add new `sso_user_id` to LiteLLM_UserTable easier way to store sso id for existing user Allows existing user added to team, to login via SSO * test(test_auth_checks.py): add unit testing for fuzzy user object get * fix(handle_jwt.py): fix merge conflicts	2025-01-31 23:04:05 -08:00
Krish Dholakia	a008a2d4f4	Litellm dev 01 31 2025 p2 (#8164 ) * docs(token_auth.md): clarify title * refactor(handle_jwt.py): add jwt auth manager + refactor to handle groups allows user to call model if user belongs to group with model access * refactor(handle_jwt.py): refactor to first check if service call then check user call * feat(handle_jwt.py): new `enforce_team_access` param only allows user to call model if a team they belong to has model access allows controlling user model access by team * fix(handle_jwt.py): fix error string, remove unecessary param * docs(token_auth.md): add controlling model access for jwt tokens via teams to docs * test: fix tests post refactor * fix: fix linting errors * fix: fix linting error * test: fix import error	2025-01-31 22:52:35 -08:00
Krish Dholakia	16b5de07af	Doc updates + management endpoint fixes (#8138 ) * Litellm dev 01 29 2025 p4 (#8107) * fix(key_management_endpoints.py): always get db team Fixes https://github.com/BerriAI/litellm/issues/7983 * test(test_key_management.py): add unit test enforcing check_db_only is always true on key generate checks * test: fix test * test: skip gemini thinking * Litellm dev 01 29 2025 p3 (#8106) * fix(__init__.py): reduces size of __init__.py and reduces scope for errors by using correct param * refactor(__init__.py): refactor init by cleaning up redundant params * refactor(__init__.py): move more constants into constants.py cleanup root * refactor(__init__.py): more cleanup * feat(__init__.py): expose new 'disable_hf_tokenizer_download' param enables hf model usage in offline env * docs(config_settings.md): document new disable_hf_tokenizer_download param * fix: fix linting error * fix: fix unsafe comparison * test: fix test * docs(public_teams.md): add doc showing how to expose public teams for users to join * docs: add beta disclaimer on public teams * test: update tests	2025-01-30 22:56:41 -08:00
Krish Dholakia	2eee7f978f	Litellm dev 01 30 2025 p2 (#8134 ) * feat(lowest_tpm_rpm_v2.py): fix redis cache check to use >= instead of > makes it consistent * test(test_custom_guardrails.py): add more unit testing on default on guardrails ensure it runs if user sent guardrail list is empty * docs(quick_start.md): clarify default on guardrails run even if user guardrails list contains other guardrails * refactor(litellm_logging.py): refactor no-log to helper util allows for more consistent behavior * feat(litellm_logging.py): add event hook to verbose logs * fix(litellm_logging.py): add unit testing to ensure `litellm.disable_no_log_param` is respected * docs(logging.md): document how to disable 'no-log' param * test: fix test to handle feb * test: cleanup old bedrock model * fix: fix router check	2025-01-30 22:18:53 -08:00
Ishaan Jaff	44d6c436a7	ui new build	2025-01-30 21:19:39 -08:00
Ishaan Jaff	fa1c42378f	(Refactor / QA) - Use `LoggingCallbackManager` to append callbacks and ensure no duplicate callbacks are added (#8112 ) * LoggingCallbackManager * add logging_callback_manager * use logging_callback_manager * add add_litellm_failure_callback * use add_litellm_callback * use add_litellm_async_success_callback * add_litellm_async_failure_callback * linting fix * fix logging callback manager * test_duplicate_multiple_loggers_test * use _reset_all_callbacks * fix testing with dup callbacks * test_basic_image_generation * reset callbacks for tests * fix check for _add_custom_logger_to_list * fix test_amazing_sync_embedding * fix _get_custom_logger_key * fix batches testing * fix _reset_all_callbacks * fix _check_callback_list_size * add callback_manager_test * fix test gemini-2.0-flash-thinking-exp-01-21	2025-01-30 19:35:50 -08:00
Ishaan Jaff	ee7254488b	(UI) Fix SpendLogs page - truncate `bedrock` models + show `end_user` (#8118 ) * ui spend logs table truncate bedrock page * ui - show user / internal user fields	2025-01-30 13:59:13 -08:00
Krish Dholakia	177f565d6f	Litellm dev 01 29 2025 p2 (#8102 ) * docs: cleanup doc * feat(bedrock/): initial commit adding bedrock/converse_like/<model> route support allows routing to a converse like endpoint Resolves https://github.com/BerriAI/litellm/issues/8085 * feat(bedrock/chat/converse_transformation.py): make converse config base config compatible enables new 'converse_like' route * feat(converse_transformation.py): enables using the proxy with converse like api endpoint Resolves https://github.com/BerriAI/litellm/issues/8085	2025-01-29 20:53:37 -08:00
Ishaan Jaff	dee1c00b92	ui new build	2025-01-29 18:02:30 -08:00
Ishaan Jaff	9d8769fa1c	(Feat) pass through vertex - allow using credentials defined on litellm router for vertex pass through (#8100 ) * test_add_vertex_pass_through_deployment * VertexPassThroughRouter * fix use_in_pass_through * VertexPassThroughRouter * fix vertex_credentials * allow using _initialize_deployment_for_pass_through * test_add_vertex_pass_through_deployment * _set_default_vertex_config * fix verbose_proxy_logger * fix use_in_pass_through * fix _get_token_and_url * test_get_vertex_location_from_url * test_get_vertex_credentials_none * run pt unit testing again * fix add_vertex_credentials * test_adding_deployments.py * rename file	2025-01-29 17:54:02 -08:00
Ishaan Jaff	a23e292128	(UI) - View Logs Page - Refinement (#8087 ) * working refetch interval * ui show provider logo in SpendLogs Table * fix padding * improve time range filter * ui fix diff minutes * fix refresh button placement	2025-01-29 08:46:20 -08:00
Ishaan Jaff	86f85f54ea	ui new build	2025-01-28 22:15:08 -08:00
Ishaan Jaff	05111a1f1c	(fix) - proxy reliability, ensure duplicate callbacks are not added to proxy (#8067 ) * refactor _add_callbacks_from_db_config * fix check for _custom_logger_exists_in_litellm_callbacks * move loc of test utils * run ci/cd again * test_add_custom_logger_callback_to_specific_event_with_duplicates_callbacks * fix _custom_logger_class_exists_in_success_callbacks * unit testing for test_add_callbacks_from_db_config * test_custom_logger_exists_in_callbacks_individual_functions * fix config.yml * fix test test_stream_chunk_builder_openai_audio_output_usage - use direct dict comparison	2025-01-28 21:01:56 -08:00
Ishaan Jaff	a9e6c09776	(beta ui - spend logs view fixes & Improvements 1) (#8062 ) * ui 1 - show correct msg on no logs * fix dup country col * backend - allow filtering by team_id and api_key * fix ui_view_spend_logs * ui update query params * working team id and key hash filters * fix filter ref - don't hold on them as they are * fix _model_custom_llm_provider_matches_wildcard_pattern * fix test test_stream_chunk_builder_openai_audio_output_usage - use direct dict comparison	2025-01-28 20:34:22 -08:00
Ishaan Jaff	42467c1d2f	ui new build	2025-01-28 18:25:45 -08:00
Krish Dholakia	78da805f89	Litellm dev 01 27 2025 p3 (#8047 ) * docs(reliability.md): add doc on disabling fallbacks per request * feat(litellm_pre_call_utils.py): support reading request timeout from request headers - new `x-litellm-timeout` param Allows setting dynamic model timeouts from vercel's AI sdk * test(test_proxy_server.py): add simple unit test for reading request timeout * test(test_fallbacks.py): add e2e test to confirm timeout passed in request headers is correctly read * feat(main.py): support passing metadata to openai in preview Resolves https://github.com/BerriAI/litellm/issues/6022#issuecomment-2616119371 * fix(main.py): fix passing openai metadata * docs(request_headers.md): document new request headers * build: Merge branch 'main' into litellm_dev_01_27_2025_p3 * test: loosen test	2025-01-28 18:01:27 -08:00
Krish Dholakia	e092635838	Bedrock document processing fixes (#8005 ) * refactor(factory.py): refactor async bedrock message transformation to use async get request for image url conversion improve latency of bedrock call * test(test_bedrock_completion.py): add unit testing to ensure async image url get called for async bedrock call * refactor(factory.py): refactor bedrock translation to use BedrockImageProcessor reduces duplicate code * fix(factory.py): fix bug not allowing pdf's to be processed * fix(factory.py): fix bedrock converse document understanding with image url * docs(bedrock.md): clarify all bedrock document types are supported * refactor: cleanup redundant test + unused imports * perf: improve perf with reusable clients * test: fix test	2025-01-28 17:48:32 -08:00
Krish Dholakia	169171268f	feat(handle_jwt.py): initial commit adding custom RBAC support on jwt… (#8037 ) * feat(handle_jwt.py): initial commit adding custom RBAC support on jwt auth allows admin to define user role field and allowed roles which map to 'internal_user' on litellm * fix(auth_checks.py): ensure user allowed to access model, when calling via personal keys Fixes https://github.com/BerriAI/litellm/issues/8029 * feat(handle_jwt.py): support role based access with model permission control on proxy Allows admin to just grant users roles on IDP (e.g. Azure AD/Keycloak) and user can immediately start calling models * docs(rbac): add docs on rbac for model access control make it clear how admin can use roles to control model access on proxy * fix: fix linting errors * test(test_user_api_key_auth.py): add unit testing to ensure rbac role is correctly enforced * test(test_user_api_key_auth.py): add more testing * test(test_users.py): add unit testing to ensure user model access is always checked for new keys Resolves https://github.com/BerriAI/litellm/issues/8029 * test: fix unit test * fix(dot_notation_indexing.py): fix typing to work with python 3.8	2025-01-28 16:27:06 -08:00
Ishaan Jaff	8f849c011d	ui new build	2025-01-27 18:35:04 -08:00
Ishaan Jaff	d0cf0a55bb	(UI) - allow assigning wildcard models to a team / key (#8041 ) * fix message.error * fix add return_wildcard_routes * ui edit modelAvailableCall * fetchAvailableModelsForTeamOrKey * ui set all models for a team * ui define common helpers * edit create key button * fix viewing model display names * fix editing team models * update gitignore * add jest testing for ui * Revert "add jest testing for ui" This reverts commit `98f9a3ebfd`.	2025-01-27 18:06:22 -08:00
Ishaan Jaff	18808a5d64	(UI) - Adding new models enhancement - show provider logo (#8033 ) * ui allow wildcard models * ui show model dashboard * add advanced settings in card * fix button * ui - add provider logos on admin ui	2025-01-27 13:15:42 -08:00
Krish Dholakia	e96788ac0b	Litellm dev 01 25 2025 p4 (#8006 ) * feat(main.py): use asyncio.sleep for mock_Timeout=true on async request adds unit testing to ensure proxy does not fail if specific Openai requests hang (e.g. recent o1 outage) * fix(streaming_handler.py): fix deepseek r1 return reasoning content on streaming Fixes https://github.com/BerriAI/litellm/issues/7942 * Revert "fix(streaming_handler.py): fix deepseek r1 return reasoning content on streaming" This reverts commit `7a052a64e3`. * fix(deepseek-r-1): return reasoning_content as a top-level param ensures compatibility with existing tools that use it * fix: fix linting error	2025-01-26 08:01:05 -08:00
Krish Dholakia	96488ae118	Fix custom pricing - separate provider info from model info (#7990 ) * fix(utils.py): initial commit fixing custom cost tracking refactors out provider specific model info from `get_model_info` - this was causing custom costs to be registered incorrectly * fix(utils.py): cleanup `_supports_factory` to check provider info, if model info is None some providers support features like vision across all models * fix(utils.py): refactor to use _supports_factory * test: update testing * fix: fix linting errors * test: fix testing	2025-01-25 21:49:28 -08:00
Ishaan Jaff	d35a5c6bae	(QA / testing) - Add e2e tests for key model access auth checks (#8000 ) * fix _model_matches_any_wildcard_pattern_in_list * test key model access checks * add key_model_access_denied to ProxyErrorTypes * update auth checks * test_model_access_update * test_team_model_access_patterns * fix _team_model_access_check * fix config used for otel testing * test fix test_call_with_invalid_model * fix model acces check tests * test_team_access_groups * test _model_matches_any_wildcard_pattern_in_list	2025-01-25 17:15:11 -08:00
Ishaan Jaff	b0cfdd3411	fix check on guardrails (#8008 )	2025-01-25 17:14:35 -08:00
Krish Dholakia	26a4958be5	Litellm dev 01 25 2025 p2 (#8003 ) * fix(base_utils.py): supported nested json schema passed in for anthropic calls * refactor(base_utils.py): refactor ref parsing to prevent infinite loop * test(test_openai_endpoints.py): refactor anthropic test to use bedrock * fix(langfuse_prompt_management.py): add unit test for sync langfuse calls Resolves https://github.com/BerriAI/litellm/issues/7938#issuecomment-2613293757	2025-01-25 16:50:57 -08:00
Ishaan Jaff	fe24e729a9	(Feat) set guardrails per team (#7993 ) * _add_guardrails_from_key_or_team_metadata * e2e test test_guardrails_with_team_controls * add try/except on team new * test_guardrails_with_team_controls * test_guardrails_with_api_key_controls	2025-01-25 10:41:11 -08:00
Ishaan Jaff	9e64c7ca0c	(Prometheus) - emit key budget metrics on startup (#8002 ) * add UI_SESSION_TOKEN_TEAM_ID * add type KeyListResponseObject * add _list_key_helper * _initialize_api_key_budget_metrics * key / budget metrics * init key budget metrics on startup * test_initialize_api_key_budget_metrics * fix linting * test_list_key_helper * test_initialize_remaining_budget_metrics_exception_handling	2025-01-25 10:37:52 -08:00
Ishaan Jaff	96888afa17	(QA / testing) - Add unit testing for key model access checks (#7999 ) * fix _model_matches_any_wildcard_pattern_in_list * fix docstring	2025-01-25 10:01:35 -08:00
Krish Dholakia	82ba5b29f3	Litellm dev 01 24 2025 p4 (#7992 ) * feat(team_endpoints.py): new `/teams/available` endpoint - allows proxy admin to expose available teams for users to join on UI * build(ui/): available_teams.tsx allow user to join available teams on UI makes it easier to onboard new users to teams * fix(navbar.tsx): cleanup title * fix(team_endpoints.py): fix linting error * test: update groq model in test * build(model_prices_and_context_window.json): update groq 3.3 model with 'supports function calling'	2025-01-24 21:29:37 -08:00
Krish Dholakia	e01c9c1fc6	fix(spend_tracking_utils.py): revert api key pass through fix (#7977 ) * fix(spend_tracking_utils.py): revert api key pass through fix * fix: fix linting error * fix(spend_tracking_utils.py): add noqa - refactor post fixing standard logging payload on pass-through endpoints * test(test_groq.py): bump groq model * fix: fix positioning of noqa	2025-01-24 21:04:36 -08:00
Ishaan Jaff	67e9dbcc98	ui new build	2025-01-24 21:03:10 -08:00
Ishaan Jaff	31bd6149fa	(Feat) - Add GCS Pub/Sub Logging integration for sending DB `SpendLogs` to BigQuery (#7976 ) * add pub_sub * fix custom batch logger for GCS PUB/SUB * GCS_PUBSUB_PROJECT_ID * e2e gcs pub sub * add gcs pub sub * fix logging * add GcsPubSubLogger * fix pub sub * add pub sub * docs gcs pub / sub * docs on pub sub controls * test_gcs_pub_sub * fix publish_message * test_async_gcs_pub_sub * test_async_gcs_pub_sub	2025-01-24 20:57:20 -08:00
Ishaan Jaff	3099ddfb7c	(Testing) e2e testing for team budget enforcement checks (#7988 ) * test_team_and_key_budget_enforcement * test_team_budget_update * test_gemini_pro_json_schema_httpx_content_policy_error	2025-01-24 18:18:12 -08:00
Krrish Dholakia	2646b20c92	fix(langsmith.py): add `/api/v1` to langsmith base url ensures it works with self hosted langsmith	2025-01-24 17:58:42 -08:00
Ishaan Jaff	1b4c1cca52	Revert "test_team_and_key_budget_enforcement" This reverts commit `9d44f51847`.	2025-01-24 15:32:41 -08:00
Ishaan Jaff	171c4012b1	test_team_and_key_budget_enforcement	2025-01-24 15:31:48 -08:00
Ishaan Jaff	d1bc955d97	(Feat) - allow setting `default_on` guardrails (#7973 ) * test_default_on_guardrail * update debug on custom guardrail * refactor guardrails init * guardrail registry * allow switching guardrails default_on * fix circle import issue * fix bedrock applying guardrails where content is a list * fix unused import * docs default on guardrail * docs fix per api key	2025-01-24 10:14:05 -08:00
Krish Dholakia	e6e4da75d7	Ollama ssl verify = False + Spend Logs reliability fixes (#7931 ) * fix(http_handler.py): support passing ssl verify dynamically and using the correct httpx client based on passed ssl verify param Fixes https://github.com/BerriAI/litellm/issues/6499 * feat(llm_http_handler.py): support passing `ssl_verify=False` dynamically in call args Closes https://github.com/BerriAI/litellm/issues/6499 * fix(proxy/utils.py): prevent bad logs from breaking all cost tracking + reset list regardless of success/failure prevents malformed logs from causing all spend tracking to break since they're constantly retried * test(test_proxy_utils.py): add test to ensure bad log is dropped * test(test_proxy_utils.py): ensure in-memory spend logs reset after bad log error * test(test_user_api_key_auth.py): add unit test to ensure end user id as str works * fix(auth_utils.py): ensure extracted end user id is always a str prevents db cost tracking errors * test(test_auth_utils.py): ensure get end user id from request body always returns a string * test: update tests * test: skip bedrock test- behaviour now supported * test: fix testing * refactor(spend_tracking_utils.py): reduce size of get_logging_payload * test: fix test * bump: version 1.59.4 → 1.59.5 * Revert "bump: version 1.59.4 → 1.59.5" This reverts commit `1182b46b2e`. * fix(utils.py): fix spend logs retry logic * fix(spend_tracking_utils.py): fix get tags * fix(spend_tracking_utils.py): fix end user id spend tracking on pass-through endpoints	2025-01-23 23:05:41 -08:00
Krish Dholakia	fe460f19f5	Add datadog health check support + fix bedrock converse cost tracking w/ region name specified (#7958 ) * fix(bedrock/converse_handler.py): fix bedrock region name on async calls * fix(utils.py): fix split model handling Fixes bedrock cost calculation when region name is given * feat(_health_endpoints.py): support health checking datadog integration Closes https://github.com/BerriAI/litellm/issues/7921	2025-01-23 22:17:09 -08:00
Ishaan Jaff	c0e83ab377	ui new build -	2025-01-23 21:11:23 -08:00
Krish Dholakia	7cced62815	Litellm dev 01 23 2025 p2 (#7962 ) * fix(ui/): revert user team key view * fix(view_key_table.tsx): fix default team view - show all personal keys * fix(navbar.tsx): fix custom logo Fixes https://github.com/BerriAI/litellm/issues/7895 --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-01-23 21:02:15 -08:00
Ishaan Jaff	40edee3fd6	fix LiteLLM_ManagementEndpoint_MetadataFields	2025-01-23 20:59:38 -08:00
Ishaan Jaff	36a7954045	(UI) Set guardrails on Team Create and Edit page (#7963 ) * team edit guardrail * LiteLLM_TeamTable	2025-01-23 20:57:34 -08:00
Ishaan Jaff	7564190036	(Feat) allow setting guardrails on a team on the API (#7959 ) * allow setting guardrails on a team * test set guardrails on team * set guardrails on a team * fix LiteLLM_ManagementEndpoint_MetadataFields_Premium	2025-01-23 20:26:51 -08:00

1 2 3 4 5 ...

4171 commits