litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 19:24:27 +00:00

Author	SHA1	Message	Date
Krish Dholakia	e26d7df91b	Litellm dev 02 10 2025 p2 (#8443 ) * Fixed issue #8246 (#8250) * Fixed issue #8246 * Added unit tests for discard() and for remove_callback_from_list_by_object() * fix(openai.py): support dynamic passing of organization param to openai handles scenario where client-side org id is passed to openai --------- Co-authored-by: Erez Hadad <erezh@il.ibm.com>	2025-02-10 17:53:46 -08:00
Krish Dholakia	47f46f92c8	Litellm dev 02 10 2025 p1 (#8438 ) * fix(azure/chat/gpt_transformation.py): fix str compare to use int - ensure correct api version check is done Resolves https://github.com/BerriAI/litellm/issues/8241#issuecomment-2647142891 * test(test_azure_openai.py): add better testing	2025-02-10 16:25:04 -08:00
Krish Dholakia	9c4c7813fb	Allow org admin to create teams on UI (#8407 ) * fix(client_initialization_utils.py): handle custom llm provider set with valid value not from model name * fix(handle_jwt.py): handle groups not existing in jwt token if user not in group, this won't exist * fix(handle_jwt.py): add new `enforce_team_based_model_access` flag to jwt auth allows proxy admin to enforce user can only call model if team has access * feat(navbar.tsx): expose new dropdown in navbar - allow org admin to create teams within org context * fix(navbar.tsx): remove non-functional cogicon * fix(proxy/utils.py): include user-org memberships in `/user/info` response return orgs user is a member of and the user role within org * feat(organization_endpoints.py): allow internal user to query `/organizations/list` and get all orgs they belong to enables org admin to select org they belong to, to create teams * fix(navbar.tsx): show change in ui when org switcher clicked * feat(page.tsx): update user role based on org they're in allows org admin to create teams in the org context * feat(teams.tsx): working e2e flow for allowing org admin to add new teams * style(navbar.tsx): clarify switching orgs on UI is in BETA * fix(organization_endpoints.py): handle getting but not setting members * test: fix test * fix(client_initialization_utils.py): revert custom llm provider handling fix - causing unintended issues * docs(token_auth.md): cleanup docs	2025-02-09 00:07:15 -08:00
Krish Dholakia	e4411e4815	Allow editing model api key + provider on UI (#8406 ) * fix(parallel_request_limiter.py): add back parallel request information to max parallel request limiter Resolves https://github.com/BerriAI/litellm/issues/8392 * test: mark flaky test to handle time based tracking issues * feat(model_management_endpoints.py): expose new patch `/model/{model_id}/update` endpoint Allows updating specific values of a model in db - makes it easy for admin to know this by calling it a PA TCH * feat(edit_model_modal.tsx): allow user to update llm provider + api key on the ui * fix: fix linting error	2025-02-08 23:50:47 -08:00
Ishaan Jaff	88093608e3	(Bug Fix) - Bedrock completions with aws_region_name (#8384 ) * test_bedrock_completion_with_region_name * test_bedrock_base_model_helper * test_bedrock_base_model_helper * fix aws_bedrock_runtime_endpoint * test_dynamic_aws_params_propagation * test_dynamic_aws_params_propagation	2025-02-08 16:33:17 -08:00
Ishaan Jaff	12ac414839	(Feat) - Allow calling Nova models on `/bedrock/invoke/` (#8397 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details * add nova to BEDROCK_INVOKE_PROVIDERS_LITERAL * BedrockInvokeNovaRequest * nova + invoke config * add AmazonInvokeNovaConfig * AmazonInvokeNovaConfig * run transform_request for invoke/nova models * AmazonInvokeNovaConfig * rename invoke tests * fix linting error * TestBedrockInvokeNovaJson * TestBedrockInvokeNovaJson * add converse_chunk_parser * test_nova_invoke_remove_empty_system_messages * test_nova_invoke_streaming_chunk_parsing	2025-02-08 13:03:05 -08:00
Krish Dholakia	c83498f0cd	Litellm dev 02 07 2025 p3 (#8387 ) * add back streaming for base o3 (#8361) * test(base_llm_unit_tests.py): add base test for o-series models - ensure streaming always works * fix(base_llm_unit_tests.py): fix test for o series models * refactor: move test --------- Co-authored-by: Matteo Boschini <12133566+mbosc@users.noreply.github.com>	2025-02-07 22:45:45 -08:00
Ishaan Jaff	b242c66a3b	(Feat) - Add `/bedrock/invoke` support for all Anthropic models (#8383 ) * use anthropic transformation for bedrock/invoke * use anthropic transforms for bedrock invoke claude * TestBedrockInvokeClaudeJson * add AmazonAnthropicClaudeStreamDecoder * pass bedrock_invoke_provider to make_call * fix _get_base_bedrock_model * fix get_bedrock_route * fix bedrock routing * fixes for bedrock invoke * test_all_model_configs * fix AWSEventStreamDecoder linting * fix code qa * test_bedrock_get_base_model * test_get_model_info_bedrock_models * test_bedrock_base_model_helper * test_bedrock_route_detection	2025-02-07 22:41:11 -08:00
Krish Dholakia	1dd3713f1a	Anthropic Citations API Support (#8382 ) * test(test_anthropic_completion.py): add test ensuring anthropic structured output response is consistent Resolves https://github.com/BerriAI/litellm/issues/8291 * feat(anthropic.py): support citations api with new user document message format Resolves https://github.com/BerriAI/litellm/issues/7970 * fix(anthropic/chat/transformation.py): return citations as a provider-specific-field Resolves https://github.com/BerriAI/litellm/issues/7970 * feat(anthropic/chat/handler.py): add streaming citations support Resolves https://github.com/BerriAI/litellm/issues/7970 * fix(handler.py): fix code qa error * fix(handler.py): only set provider specific fields if non-empty dict * docs(anthropic.md): add citations api to anthropic docs	2025-02-07 22:27:01 -08:00
Krish Dholakia	b5850b6b65	Handle azure deepseek reasoning response (#8288 ) (#8366 ) * Handle azure deepseek reasoning response (#8288) * Handle deepseek reasoning response * Add helper method + unit test * Fix: Follow infinity api url format (#8346) * Follow infinity api url format * Update test_infinity.py * fix(infinity/transformation.py): fix linting error --------- Co-authored-by: vibhavbhat <vibhavb00@gmail.com> Co-authored-by: Hao Shan <53949959+haoshan98@users.noreply.github.com>	2025-02-07 17:45:51 -08:00
Krish Dholakia	f651d51f26	Litellm dev 02 07 2025 p2 (#8377 ) * fix(caching_routes.py): mask redis password on `/cache/ping` route * fix(caching_routes.py): fix linting erro * fix(caching_routes.py): fix linting error on caching routes * fix: fix test - ignore mask_dict - has a breakpoint * fix(azure.py): add timeout param + elapsed time in azure timeout error * fix(http_handler.py): add elapsed time to http timeout request makes it easier to debug how long request took before failing	2025-02-07 17:30:38 -08:00
Krish Dholakia	5d170162d3	fix(nvidia_nim/embed.py): add 'dimensions' support (#8302 ) * fix(nvidia_nim/embed.py): add 'dimensions' support Fixes https://github.com/BerriAI/litellm/issues/8238 * fix(proxy_Server.py): initialize router redis cache if setup on proxy Fixes https://github.com/BerriAI/litellm/issues/6602 * test: add unit testing for new helper function	2025-02-07 16:19:32 -08:00
Krish Dholakia	6b8b49451f	Fix azure max retries error (#8340 ) * fix(azure.py): ensure max_retries=0 is respected Fixes https://github.com/BerriAI/litellm/issues/6129 * fix(test_openai.py): add unit test to ensure openai sdk calls always respect max_retries = 0 * test(test_azure_openai.py): add unit testing for azure_text/ route * fix(azure.py): fix passing max retries on streaming * fix(azure.py): fix azure max retries on async completion + streaming * fix(completion/handler.py): fix azure text async completion + streaming * test(test_azure_openai.py): ensure azure openai max retries always respected * test(test_azure_o_series.py): add testing to ensure max retries always respected * Added gemini providers for 2.0-flash and 2.0-flash lite (#8321) * Update model_prices_and_context_window.json added gemini providers for 2.0-flash and 2.0-flash light * Update model_prices_and_context_window.json fixed URL --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> * Convert tool use arguments to string before counting tokens (#6989) In at least some cases the `messages["tool_calls"]["function"]["arguments"]` is a dict, not a string. In order to tokenize it properly it needs to be a string. In the case that it is already a string this is a noop, which is also fine. * build(model_prices_and_context_window.json): add gemini 2.0 flash lite pricing * build(model_prices_and_context_window.json): add gemini commercial rate limits * fix(utils.py): fix linting error * refactor(utils.py): refactor to maintain function size --------- Co-authored-by: Bardia Khosravi <bardiakhosravi95@gmail.com> Co-authored-by: Josh Morrow <josh@jcmorrow.com>	2025-02-06 23:20:48 -08:00
Krish Dholakia	d720744656	Litellm dev 02 06 2025 p3 (#8343 ) * feat(handle_jwt.py): initial commit to allow scope based model access * feat(handle_jwt.py): allow model access based on token scopes allow admin to control model access from IDP * test(test_jwt.py): add unit testing for scope based model access * docs(token_auth.md): add scope based model access to docs * docs(token_auth.md): update docs * docs(token_auth.md): update docs * build: add gemini commercial rate limits * fix: fix linting error	2025-02-06 23:15:33 -08:00
Ishaan Jaff	65c91cbbbc	(QA+UI) - e2e flow for adding assembly ai passthrough endpoints (#8337 ) * add initial test for assembly ai * start using PassthroughEndpointRouter * migrate to lllm passthrough endpoints * add assembly ai as a known provider * fix PassthroughEndpointRouter * fix set_pass_through_credentials * working EU request to assembly ai pass through endpoint * add e2e test assembly * test_assemblyai_routes_with_bad_api_key * clean up pass through endpoint router * e2e testing for assembly ai pass through * test assembly ai e2e testing * delete assembly ai models * fix code quality * ui working assembly ai api base flow * fix install assembly ai * update model call details with kwargs for pass through logging * fix tracking assembly ai model in response * _handle_assemblyai_passthrough_logging * fix test_initialize_deployment_for_pass_through_unsupported_provider * TestPassthroughEndpointRouter * _get_assembly_transcript * fix assembly ai pt logging tests * fix assemblyai_proxy_route * fix _get_assembly_region_from_url	2025-02-06 18:27:54 -08:00
Ishaan Jaff	5dcb87a88b	(bug fix router.py) - safely handle `choices=[]` on llm responses (#8342 ) * test fix test_router_with_empty_choices * fix _should_raise_content_policy_error	2025-02-06 18:22:08 -08:00
Krish Dholakia	f031926b82	fix(utils.py): handle key error in msg validation (#8325 ) * fix(utils.py): handle key error in msg validation * Support running Aim Guard during LLM call (#7918) * support running Aim Guard during LLM call * Rename header * adjust docs and fix type annotations * fix(timeout.md): doc fix for openai example on dynamic timeouts --------- Co-authored-by: Tomer Bin <117278227+hxtomer@users.noreply.github.com>	2025-02-06 18:13:46 -08:00
Ishaan Jaff	b535c9bdc0	(Bug Fix - Langfuse) - fix for when model response has `choices=[]` (#8339 ) * refactor _get_langfuse_input_output_content * test_langfuse_logging_completion_with_malformed_llm_response * fix _get_langfuse_input_output_content * fixes for langfuse linting * unit testing for get chat/text content for langfuse * fix _should_raise_content_policy_error	2025-02-06 18:02:26 -08:00
Krish Dholakia	b4e5c0de69	Improve rpm check on keys (#8301 ) * fix(parallel_request_limiter.py): initial commit that solves the rpm limit check on keys Fixes https://github.com/BerriAI/litellm/issues/6938 * fix(parallel_request_limiter.py): simpler approach - just increment RPM in pre call hook instead of on success * fix(parallel_request_limiter.py): pass testing * fix: fix linting error * fix(parallel_request_limiter.py): fix parallel request check for keys	2025-02-05 20:23:08 -08:00
Krish Dholakia	443ae55904	Azure OpenAI improvements - o3 native streaming, improved tool call + response format handling (#8292 ) * fix(convert_dict_to_response.py): only convert if response is the response_format tool call passed in Fixes https://github.com/BerriAI/litellm/issues/8241 * fix(gpt_transformation.py): makes sure response format / tools conversion doesn't remove previous tool calls * refactor(gpt_transformation.py): refactor out json schema converstion to base config keeps logic consistent across providers * fix(o_series_transformation.py): support o3 mini native streaming Fixes https://github.com/BerriAI/litellm/issues/8274 * fix(gpt_transformation.py): remove unused variables * test: update test	2025-02-05 19:38:58 -08:00
Ishaan Jaff	818792228c	(Refactor) - migrate bedrock invoke to `BaseLLMHTTPHandler` class (#8290 ) * initial transform for invoke * invoke transform_response * working - able to make request * working get_complete_url * working - invoke now runs on llm_http_handler * fix unused imports * track litellm overhead ms * working stream request * sign_request transform * sign_request update * use has_async_custom_stream_wrapper property * use get_async_custom_stream_wrapper in base llm http handler * fix make_call in invoke handler * fix invoke with streaming get_async_custom_stream_wrapper * working bedrock async streaming with invoke * fix make call handler for bedrock * test_all_model_configs * fix test_bedrock_custom_prompt_template * sync streaming for bedrock invoke * fix _add_stream_param_to_request_body * test_async_text_completion_bedrock * fix transform_request * fix get_supported_openai_params * fix test supports tool choice * fix test_supports_tool_choice * add unit test coverage for bedrock invoke transform * fix location of transformation files * update import loc * fix bedrock invoke unit tests * fix import for max completion tokens	2025-02-05 18:58:55 -08:00
Ishaan Jaff	6cef115bb0	(Security fix) - remove code block that inserts master key hash into DB (#8268 ) * remove code block upserting master key hash to db * run test to check if key upserted into db * run ci/cd again * litellm_proxy_security_tests * litellm_proxy_security_tests * run prisma entrypoint * ci/cd run again * fix test master key not in db	2025-02-05 17:25:42 -08:00
Krish Dholakia	3c813b3a87	Fix deepseek calling - refactor to use base_llm_http_handler (#8266 ) * refactor(deepseek/): move deepseek to base llm http handler Fixes https://github.com/BerriAI/litellm/issues/8128#issuecomment-2635430457 * fix(gpt_transformation.py): support stream parsing for gpt-like calls * test(test_deepseek_completion.py): add async streaming test * fix(gpt_transformation.py): fix import * fix(gpt_transformation.py): return full api base and content type	2025-02-04 22:30:00 -08:00
Krish Dholakia	4e34fc3bf8	[BETA] Support OIDC `role` based access to proxy (#8260 ) * feat(proxy/_types.py): add new jwt field params allows users + services to auth into proxy * feat(handle_jwt.py): allow team role proxy access allows proxy admin to set allowed team roles * fix(proxy/_types.py): add 'routes' to role based permissions allow proxy admin to restrict what routes a team can access easily * feat(handle_jwt.py): support more flexible role based route access v2 on role based 'allowed_routes' * test(test_jwt.py): add unit test for rbac for proxy routes * feat(handle_jwt.py): ensure cost tracking always works for any jwt request with `enforce_rbac=True` * docs(token_auth.md): add documentation on controlling model access via OIDC Roles * test: increase time delay before retrying * test: handle model overloaded for test	2025-02-04 21:59:39 -08:00
Ishaan Jaff	3a6349d871	(Feat) - Add support for structured output on `bedrock/nova` models + add util `litellm.supports_tool_choice` (#8264 ) * fix supports_tool_choice * TestBedrockNovaJson * use supports_tool_choice * fix supports_tool_choice * add supports_tool_choice param * script to add fields to model cost map * test_supports_tool_choice * test_supports_tool_choice * fix supports tool choice check * test_supports_tool_choice_simple_tests * fix supports_tool_choice check * fix supports_tool_choice bedrock * test_supports_tool_choice * test_supports_tool_choice * fix bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 * ci/cd run again * test_supports_tool_choice_simple_tests * TestGoogleAIStudioGemini temp - remove to run ci/cd * test_aaalangfuse_logging_metadata * TestGoogleAIStudioGemini * test_check_provider_match * remove add param to map	2025-02-04 21:47:16 -08:00
Ishaan Jaff	06a9829749	all e2e langfuse tests now run on test_langfuse_e2e_test.p	2025-02-04 21:22:51 -08:00
Ishaan Jaff	b965d9bd9a	Fix passing top_k parameter for Bedrock Anthropic models (#8131 ) (#8269 ) * Fix Bedrock Anthropic topK bug * Remove extra import * Add unit test + make tests mocked * Fix camel case * Fix tests to remove exception handling Co-authored-by: vibhavbhat <vibhavb00@gmail.com>	2025-02-04 21:16:21 -08:00
Ishaan Jaff	7e1b79d446	(Bug fix) - Langfuse / Callback settings stored in DB (#8251 ) * fix _decrypt_and_set_db_env_variables * fix proxy config * test callbacks in DB * test langfuse callbacks in db * test_e2e_langfuse_callbacks_in_db * proxy_store_model_in_db_tests * fix proxy_store_model_in_db_tests * proxy_store_model_in_db_tests * fix store_model_db_config.yaml * fix check_langfuse_request * fix test langfuse base url * ci/cd run again	2025-02-04 21:09:37 -08:00
Ishaan Jaff	1d5370b9e6	(feat) - track org_id in SpendLogs (#8253 ) * track org id in spend logs * read org id from team table * show user_api_key_org_id in spend logs * test_spend_logs_payload * test_spend_logs_with_org_id * test_spend_logs_with_org_id	2025-02-04 21:08:05 -08:00
Krish Dholakia	8d4ad47ec3	fix(prometheus.py): fix setting key budget metrics (#8234 ) * fix(prometheus.py): fix setting key budget metrics ensures custom metadata works with key budget metric this is a patch. root cause pr is written in a separate branch * test: fix test	2025-02-04 19:15:50 -08:00
Krish Dholakia	df93debbc7	Internal User Endpoint - vulnerability fix + response type fix (#8228 ) * fix(key_management_endpoints.py): fix vulnerability where a user could update another user's keys Resolves https://github.com/BerriAI/litellm/issues/8031 * test(key_management_endpoints.py): return consistent 403 forbidden error when modifying key that doesn't belong to user * fix(internal_user_endpoints.py): return model max budget in internal user create response Fixes https://github.com/BerriAI/litellm/issues/7047 * test: fix test * test: update test to handle gemini token counter change * fix(factory.py): fix bedrock http:// handling * docs: fix typo in lm_studio.md (#8222) * test: fix testing * test: fix test --------- Co-authored-by: foreign-sub <51928805+foreign-sub@users.noreply.github.com>	2025-02-04 06:41:14 -08:00
Krish Dholakia	c17342ac5b	fix(openai/): allows 'reasoning_effort' param to be passed correctly (#8227 ) * fix(openai/): allows 'reasoning_effort' param to be passed correctly Fixes https://github.com/BerriAI/litellm/issues/8217 * test: update test to handle gemini token counter change * fix(factory.py): fix bedrock http:// handling * test: fix test * test: update testing for new openai sdk	2025-02-03 22:39:10 -08:00
Ishaan Jaff	915cc064c5	fix test test_is_assemblyai_route	2025-02-03 21:58:32 -08:00
Ishaan Jaff	8fd60a420d	(Feat) - New pass through add assembly ai passthrough endpoints (#8220 ) * add assembly ai pass through request * fix assembly pass through * fix test_assemblyai_basic_transcribe * fix assemblyai auth check * test_assemblyai_transcribe_with_non_admin_key * working assembly ai test * working assembly ai proxy route * use helper func to pass through logging * clean up logging assembly ai * test: update test to handle gemini token counter change * fix(factory.py): fix bedrock http:// handling * add unit testing for assembly pt handler * docs assembly ai pass through endpoint * fix proxy_pass_through_endpoint_tests * fix standard_passthrough_logging_object * fix ASSEMBLYAI_API_KEY * test test_assemblyai_proxy_route_basic_post * test_assemblyai_proxy_route_get_transcript * fix is is_assemblyai_route * test_is_assemblyai_route --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2025-02-03 21:54:32 -08:00
Krrish Dholakia	7ddb034b31	test: update test to handle gemini token counter change	2025-02-03 18:12:53 -08:00
Krish Dholakia	c8494abdea	test(base_llm_unit_tests.py): add test to ensure drop params is respe… (#8224 ) * test(base_llm_unit_tests.py): add test to ensure drop params is respected * fix(types/prometheus.py): use typing_extensions for python3.8 compatibility * build: add cherry picked commits	2025-02-03 16:04:44 -08:00
Krish Dholakia	97b8de17ab	LiteLLM Minor Fixes & Improvements (01/16/2025) - p2 (#7828 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details * fix(vertex_ai/gemini/transformation.py): handle 'http://' image urls * test: add base test for `http:` url's * fix(factory.py/get_image_details): follow redirects allows http calls to work * fix(codestral/): fix stream chunk parsing on last chunk of stream * Azure ad token provider (#6917) * Update azure.py Added optional parameter azure ad token provider * Added parameter to main.py * Found token provider arg location * Fixed embeddings * Fixed ad token provider --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> * fix: fix linting errors * fix(main.py): leave out o1 route for azure ad token provider, for now get v0 out for sync azure gpt route to begin with * test: skip http:// test for fireworks ai model does not support it * refactor: cleanup dead code * fix: revert http:// url passthrough for gemini google ai studio raises errors * test: fix test --------- Co-authored-by: bahtman <anton@baht.dk>	2025-02-02 23:17:50 -08:00
Krish Dholakia	6834c5ecaf	Easier user onboarding via SSO (#8187 ) * fix(ui_sso.py): use common `get_user_object` logic across jwt + ui sso auth Allows finding users by their email, and attaching the sso user id to the user if found * Improve Team Management flow on UI (#8204) * build(teams.tsx): refactor teams page to make it easier to add members to a team make a row in table clickable -> allows user to add users to team they intended * build(teams.tsx): make it clear user should click on team id to view team details simplifies team management by putting team details on separate page * build(team_info.tsx): separately show user id and user email make it easy for user to understand the information they're seeing * build(team_info.tsx): add back in 'add member' button * build(team_info.tsx): working team member update on team_info.tsx * build(team_info.tsx): enable team member delete on ui allow user to delete accidental adds * build(internal_user_endpoints.py): expose new endpoint for ui to allow filtering on user table allows proxy admin to quickly find user they're looking for * feat(team_endpoints.py): expose new team filter endpoint for ui allows proxy admin to easily find team they're looking for * feat(user_search_modal.tsx): allow admin to filter on users when adding new user to teams * test: mark flaky test * test: mark flaky test * fix(exception_mapping_utils.py): fix anthropic text route error * fix(ui_sso.py): handle situation when user not in db	2025-02-02 23:02:33 -08:00
Krish Dholakia	1105e35538	Complete o3 model support (#8183 ) * fix(o_series_transformation.py): add 'reasoning_effort' as o series model param Closes https://github.com/BerriAI/litellm/issues/8182 * fix(main.py): ensure `reasoning_effort` is a mapped openai param * refactor(azure/): rename o1_[x] files to o_series_[x] * refactor(base_llm_unit_tests.py): refactor testing for o series reasoning effort * test(test_azure_o_series.py): have azure o series tests correctly inherit from base o series model tests * feat(base_utils.py): support translating 'developer' role to 'system' role for non-openai providers Makes it easy to switch from openai to anthropic * fix: fix linting errors * fix(base_llm_unit_tests.py): fix test * fix(main.py): add missing param	2025-02-02 22:36:37 -08:00
Krish Dholakia	e4566d7b1c	fix(main.py): fix passing openrouter specific params (#8184 ) * fix(main.py): fix passing openrouter specific params Fixes https://github.com/BerriAI/litellm/issues/8130 * test(test_get_model_info.py): add check for region name w/ cris model Resolves https://github.com/BerriAI/litellm/issues/8115	2025-02-02 22:23:14 -08:00
Ishaan Jaff	c0f3100934	[Bug Fix] - `/vertex_ai/` was not detected as llm_api_route on pass through but `vertex-ai` was (#8186 ) * fix mapped_pass_through_routes * fix route checks * update test_is_llm_api_route	2025-02-01 17:26:08 -08:00
Krish Dholakia	9e65f867ab	test: add more unit testing for team member endpoints (#8170 ) * test: add more unit testing for team member add * fix(team_endpoints.py): add validation check to prevent same user from being added to team again prevents duplicates * fix(team_endpoints.py): raise error if `/team/member_delete` called on member that's not in team prevent being able to call delete on same member multiple times * test: update initial tests * test: fix test * test: update test to handle no member duplication	2025-02-01 11:23:00 -08:00
Krish Dholakia	23f458d2da	Improved O3 + Azure O3 support (#8181 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details * fix: support azure o3 model family for fake streaming workaround (#8162) * fix: support azure o3 model family for fake streaming workaround * refactor: rename helper to is_o_series_model for clarity * update function calling parameters for o3 models (#8178) * refactor(o1_transformation.py): refactor o1 config to be o series config, expand o series model check to o3 ensures max_tokens is correctly translated for o3 * feat(openai/): refactor o1 files to be 'o_series' files expands naming to cover o3 * fix(azure/chat/o1_handler.py): azure openai is an instance of openai - was causing resets * test(test_azure_o_series.py): assert stream faked for azure o3 mini Resolves https://github.com/BerriAI/litellm/pull/8162 * fix(o1_transformation.py): fix o1 transformation logic to handle explicit o1_series routing * docs(azure.md): update doc with `o_series/` model name --------- Co-authored-by: byrongrogan <47910641+byrongrogan@users.noreply.github.com> Co-authored-by: Low Jian Sheng <15527690+lowjiansheng@users.noreply.github.com>	2025-02-01 09:52:28 -08:00
Krish Dholakia	91ed05df29	Litellm dev contributor prs 01 31 2025 (#8168 ) * Add O3-Mini for Azure and Remove Vision Support (#8161) * Azure Released O3-mini at the same time as OAI, so i've added support here. Confirmed to work with Sweden Central. * [FIX] replace cgi for python 3.13 with email.Message as suggested in PEP 594 (#8160) * Update model_prices_and_context_window.json (#8120) codestral2501 pricing on vertex_ai * Fix/db view names (#8119) * Fix to case sensitive DB Views name * Fix to case sensitive DB View names * Added quotes to check query as well * Added quotes to create view query * test: handle server error for flaky test vertex ai has unstable endpoints --------- Co-authored-by: Wanis Elabbar <70503629+elabbarw@users.noreply.github.com> Co-authored-by: Honghua Dong <dhh1995@163.com> Co-authored-by: superpoussin22 <vincent.nadal@orange.fr> Co-authored-by: Miguel Armenta <37154380+ma-armenta@users.noreply.github.com>	2025-02-01 09:05:20 -08:00
Krish Dholakia	8d0db8b379	build(schema.prisma): add new `sso_user_id` to LiteLLM_UserTable (#8167 ) * build(schema.prisma): add new `sso_user_id` to LiteLLM_UserTable easier way to store sso id for existing user Allows existing user added to team, to login via SSO * test(test_auth_checks.py): add unit testing for fuzzy user object get * fix(handle_jwt.py): fix merge conflicts	2025-01-31 23:04:05 -08:00
Krish Dholakia	2147cad307	Litellm dev 01 31 2025 p2 (#8164 ) * docs(token_auth.md): clarify title * refactor(handle_jwt.py): add jwt auth manager + refactor to handle groups allows user to call model if user belongs to group with model access * refactor(handle_jwt.py): refactor to first check if service call then check user call * feat(handle_jwt.py): new `enforce_team_access` param only allows user to call model if a team they belong to has model access allows controlling user model access by team * fix(handle_jwt.py): fix error string, remove unecessary param * docs(token_auth.md): add controlling model access for jwt tokens via teams to docs * test: fix tests post refactor * fix: fix linting errors * fix: fix linting error * test: fix import error	2025-01-31 22:52:35 -08:00
Ishaan Jaff	9ff27809b2	(Feat) add bedrock/deepseek custom import models (#8132 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details * add support for using llama spec with bedrock * fix get_bedrock_invoke_provider * add support for using bedrock provider in mappings * working request * test_bedrock_custom_deepseek * test_bedrock_custom_deepseek * fix _get_model_id_for_llama_like_model * test_bedrock_custom_deepseek * doc DeepSeek-R1-Distill-Llama-70B * test_bedrock_custom_deepseek	2025-01-31 18:40:44 -08:00
Ishaan Jaff	2cf0daa31c	(Fixes) OpenAI Streaming Token Counting + Fixes usage track when `litellm.turn_off_message_logging=True` (#8156 ) * working streaming usage tracking * fix test_async_chat_openai_stream_options * fix await asyncio.sleep(1) * test_async_chat_azure * fix s3 logging * fix get_stream_options * fix get_stream_options * fix streaming handler * test_stream_token_counting_with_redaction * fix codeql concern	2025-01-31 15:06:37 -08:00
Krish Dholakia	de261e2120	Doc updates + management endpoint fixes (#8138 ) * Litellm dev 01 29 2025 p4 (#8107) * fix(key_management_endpoints.py): always get db team Fixes https://github.com/BerriAI/litellm/issues/7983 * test(test_key_management.py): add unit test enforcing check_db_only is always true on key generate checks * test: fix test * test: skip gemini thinking * Litellm dev 01 29 2025 p3 (#8106) * fix(__init__.py): reduces size of __init__.py and reduces scope for errors by using correct param * refactor(__init__.py): refactor init by cleaning up redundant params * refactor(__init__.py): move more constants into constants.py cleanup root * refactor(__init__.py): more cleanup * feat(__init__.py): expose new 'disable_hf_tokenizer_download' param enables hf model usage in offline env * docs(config_settings.md): document new disable_hf_tokenizer_download param * fix: fix linting error * fix: fix unsafe comparison * test: fix test * docs(public_teams.md): add doc showing how to expose public teams for users to join * docs: add beta disclaimer on public teams * test: update tests	2025-01-30 22:56:41 -08:00
Krish Dholakia	69a6da4727	Litellm dev 01 30 2025 p2 (#8134 ) * feat(lowest_tpm_rpm_v2.py): fix redis cache check to use >= instead of > makes it consistent * test(test_custom_guardrails.py): add more unit testing on default on guardrails ensure it runs if user sent guardrail list is empty * docs(quick_start.md): clarify default on guardrails run even if user guardrails list contains other guardrails * refactor(litellm_logging.py): refactor no-log to helper util allows for more consistent behavior * feat(litellm_logging.py): add event hook to verbose logs * fix(litellm_logging.py): add unit testing to ensure `litellm.disable_no_log_param` is respected * docs(logging.md): document how to disable 'no-log' param * test: fix test to handle feb * test: cleanup old bedrock model * fix: fix router check	2025-01-30 22:18:53 -08:00

... 14 15 16 17 18 ...

1677 commits