litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 11:14:04 +00:00

Author	SHA1	Message	Date
Krish Dholakia	a65bfab697	Fix calling claude via invoke route + response_format support for claude on invoke route (#8908 ) * fix(anthropic_claude3_transformation.py): fix amazon anthropic claude 3 tool calling transformation on invoke route move to using anthropic config as base * fix(utils.py): expose anthropic config via providerconfigmanager * fix(llm_http_handler.py): support json mode on async completion calls * fix(invoke_handler/make_call): support json mode for anthropic called via bedrock invoke * fix(anthropic/): handle 'response_format: {"type": "text"}` + migrate amazon claude 3 invoke config to inherit from anthropic config Prevents error when passing in 'response_format: {"type": "text"} * test: fix test * fix(utils.py): fix base invoke provider check * fix(anthropic_claude3_transformation.py): don't pass 'stream' param * fix: fix linting errors * fix(converse_transformation.py): handle response_format type=text for converse	2025-02-28 17:56:26 -08:00
Krish Dholakia	8f86959c32	Litellm dev 02 27 2025 p6 (#8891 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details * fix(http_parsing_utils.py): orjson can throw errors on some emoji's in text, default to json.loads * fix(sagemaker/handler.py): support passing model id on async streaming * fix(litellm_pre_call_utils.py): Fixes https://github.com/BerriAI/litellm/issues/7237	2025-02-28 14:34:17 -08:00
Krish Dholakia	887c66c6b7	Show 'user_email' on key table on UI (#8887 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 35s Details * refactor(internal_user_endpoints.py): refactor `/user/list` to accept 'user_ids' and use prisma for db calls enables bulk search from UI * fix(internal_user_endpoints.py): fix linting errors * fix(all_keys_table.tsx): show user email on create key table make it easier for admin to know which key is associated to which user * docs(internal_user_endpoints.py): improve docstring * fix: sync schema with main * fix(columns.tsx): display SSO ID on Internal User Table make it easy to identify what the SSO ID for a user is * fix(columns.tsx): add tooltip to header help user understand what SSO ID means * style: add more tooltips in the management flows make it easier to understand what you're seeing * style(all_keys_table.tsx): replace 'Not Set' with '-' reduces words on table * fix(internal_user_endpoints.py): fix user ids check * test: fix test * fix(internal_user_endpoints.py): maintain returning key count in `/user/list`	2025-02-27 21:56:14 -08:00
Krish Dholakia	740bd7e9ce	(security fix) - Enforce model access restrictions on Azure OpenAI route (#8888 ) * fix(user_api_key_auth.py): Fixes https://github.com/BerriAI/litellm/issues/8780 security fix - enforce model access checks on azure routes * test(test_user_api_key_auth.py): add unit testing * test(test_openai_endpoints.py): add e2e test to ensure azure routes also run through model validation checks	2025-02-27 21:24:58 -08:00
Ishaan Jaff	51590742bd	ui new build	2025-02-27 20:09:44 -08:00
Ishaan Jaff	51a6a219cd	(Improvements) use `/openai/` pass through with OpenAI Ruby for Assistants API (#8884 ) * add ruby assistants testing * _join_url_paths * run ruby tests on ci/cd * TestBaseOpenAIPassThroughHandler * _join_url_paths * fix _join_url_paths * Install Ruby and Bundler * Install Ruby and Bundler	2025-02-27 20:01:16 -08:00
Ishaan Jaff	378e3d9e4d	(Proxy improvement) - Raise `BadRequestError` when unknown model passed in request (#8886 ) * fix safe access model in request body * litellm.BadRequestError * don't pass model in request body * test_chat_completion_bad_model	2025-02-27 19:30:57 -08:00
Krish Dholakia	2d2d1b9df5	Add `created_by` and `updated_by` fields to Keys table (#8885 ) * fix(proxy/_types.py): return created_by and updated_by on /key/list enables better trail of who made a key * fix(all_keys_table.tsx): add created by to key table allows easier tracking of who generated the key * fix(key_management_endpoints.py): track 'created_by' and 'updated_by' fields enable easier tracking of who created proxy keys	2025-02-27 18:12:58 -08:00
Krish Dholakia	91cdc01149	Allow team/org filters to be searchable on the Create Key Page (#8881 ) * fix(filtercomponent): always show apply filters button fix hiding behavior * style(create_key_button.tsx): style improvements on create key modal remove the numbering * feat(filter.tsx): allow searching team by team alias * style(filter.tsx): style improvements to ensure dropdown + custom value works as expected * style(filter.tsx): add explicit button allowing reset filters * fix(filter.tsx): fix linting error * feat(all_keys_table.tsx): show team alias on keys table * style(all_keys_table.tsx): enforce length constraints on table make it easier to see all columns	2025-02-27 18:11:03 -08:00
Ishaan Jaff	1e7b9cf767	(fix) Pass through spend tracking - ensure `custom_llm_provider` is tracked for Vertex, Google AI Studio, Anthropic (#8882 ) * fix track custom llm provider on pass through routes * fix use correct provider for google ai studio * fix tracking custom llm provider on pass through route * ui fix get provider logo * update tests to track custom llm provider * test_anthropic_streaming_with_headers * Potential fix for code scanning alert no. 2263: Incomplete URL substring sanitization Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-02-27 17:09:43 -08:00
Ishaan Jaff	047d1b1208	(Bug Fix) - Accurate token counting for `/anthropic/` API Routes on LiteLLM Proxy (#8880 ) * fix _create_anthropic_response_logging_payload * fix - pass through don't create standard logging payload * fix logged key hash * test_init_kwargs_for_pass_through_endpoint_basic * test_unit_test_anthropic_pass_through * fix anthropic pass through logging handler * test_stream_token_counting_anthropic_with_include_usage * convert_str_chunk_to_generic_chunk * _build_complete_streaming_response * test_anthropic_basic_completion_with_headers * test_anthropic_streaming_with_headers * improve test for pass through token counting	2025-02-27 15:43:03 -08:00
Ishaan Jaff	24df2331ec	(fix) Anthropic pass through cost tracking (#8874 ) * fix _create_anthropic_response_logging_payload * fix - pass through don't create standard logging payload * fix logged key hash * test_init_kwargs_for_pass_through_endpoint_basic * test_unit_test_anthropic_pass_through * fix anthropic pass through logging handler	2025-02-27 15:42:43 -08:00
Krrish Dholakia	88ef3b41b6	docs(bedrock.md): cleanup doc	2025-02-27 12:35:03 -08:00
Krrish Dholakia	5b804e5d9b	fix(main.py): pass 'thinking' param on async completion call All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 38s Details	2025-02-26 23:16:39 -08:00
Krish Dholakia	88eedb22b9	vertex ai anthropic thinking param support (#8853 ) * fix(vertex_llm_base.py): handle credentials passed in as dictionary * fix(router.py): support vertex credentials as json dict * test(test_vertex.py): allows easier testing mock anthropic thinking response for vertex ai * test(vertex_ai_partner_models/): don't remove "@" from model breaks anthropic cost calculation * test: move testing * fix: fix linting error * fix: fix linting error * fix(vertex_ai_partner_models/main.py): split @ for codestral model * test: fix test * fix: fix stripping "@" on mistral models * fix: fix test * test: fix test	2025-02-26 21:37:18 -08:00
Krish Dholakia	ab7c4d1a0e	Litellm dev bedrock anthropic 3 7 v2 (#8843 ) * feat(bedrock/converse/transformation.py): support claude-3-7-sonnet reasoning_Content transformation Closes https://github.com/BerriAI/litellm/issues/8777 * fix(bedrock/): support returning `reasoning_content` on streaming for claude-3-7 Resolves https://github.com/BerriAI/litellm/issues/8777 * feat(bedrock/): unify converse reasoning content blocks for consistency across anthropic and bedrock * fix(anthropic/chat/transformation.py): handle deepseek-style 'reasoning_content' extraction within transformation.py simpler logic * feat(bedrock/): fix streaming to return blocks in consistent format * fix: fix linting error * test: fix test * feat(factory.py): fix bedrock thinking block translation on tool calling allows passing the thinking blocks back to bedrock for tool calling * fix(types/utils.py): don't exclude provider_specific_fields on model dump ensures consistent responses * fix: fix linting errors * fix(convert_dict_to_response.py): pass reasoning_content on root * fix: test * fix(streaming_handler.py): add helper util for setting model id * fix(streaming_handler.py): fix setting model id on model response stream chunk * fix(streaming_handler.py): fix linting error * fix(streaming_handler.py): fix linting error * fix(types/utils.py): add provider_specific_fields to model stream response * fix(streaming_handler.py): copy provider specific fields and add them to the root of the streaming response * fix(streaming_handler.py): fix check * fix: fix test * fix(types/utils.py): ensure messages content is always openai compatible * fix(types/utils.py): fix delta object to always be openai compatible only introduce new params if variable exists * test: fix bedrock nova tests * test: skip flaky test * test: skip flaky test in ci/cd	2025-02-26 16:05:33 -08:00
Krrish Dholakia	fcf4ea3608	build: merge squashed commit Squashed commit of the following: commit `6678e15381` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 09:29:15 2025 -0800 test_prompt_caching commit `bd86e0ac47` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 08:57:16 2025 -0800 test_prompt_caching commit `2fc21ad51e` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 08:13:45 2025 -0800 test_aprompt_caching commit `d94cff55ff` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 08:13:12 2025 -0800 test_prompt_caching commit `49c5e7811e` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 07:43:53 2025 -0800 ui new build commit `cb8d5e5917` Author: Ishaan Jaff <ishaanjaffer0324@gmail.com> Date: Wed Feb 26 07:38:56 2025 -0800 (UI) - Create Key flow for existing users (#8844) * working create user button * working create user for a key flow * allow searching users * working create user + key * use clear sections on create key * better search for users * fix create key * ui fix create key button - make it neater / cleaner * ui fix all keys table commit `335ba30467` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Feb 26 08:53:17 2025 -0800 fix: fix file name commit `b8c5b31a4e` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Tue Feb 25 22:54:46 2025 -0800 fix: fix utils commit `ac6e503461` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Feb 24 10:43:31 2025 -0800 fix(main.py): fix openai message for assistant msg if role is missing - openai allows this Fixes https://github.com/BerriAI/litellm/issues/8661 commit `de3989dbc5` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Feb 24 21:19:25 2025 -0800 fix(get_litellm_params.py): handle no-log being passed in via kwargs Fixes https://github.com/BerriAI/litellm/issues/8380	2025-02-26 09:39:27 -08:00
Ishaan Jaff	54f4e35a58	ui new build	2025-02-26 07:43:53 -08:00
Ishaan Jaff	14d94dca12	ui new build	2025-02-25 20:03:03 -08:00
Ishaan Jaff	7021f2f244	(Bug fix) dd-trace used by default on litellm proxy (#8817 ) * fix _should_use_dd_tracer * fix _should_use_dd_tracer * _should_use_dd_tracer * _should_use_dd_tracer * _should_use_dd_tracer * _init_dd_tracer * _should_use_dd_tracer * fix should use dd-tracer * fix dd tracer	2025-02-25 19:54:22 -08:00
Ishaan Jaff	81039d8faf	(Bug fix) - allow using Assistants GET, DELETE on `/openai` pass through routes (#8818 ) * test_openai_assistants_e2e_operations * test openai assistants pass through * fix GET request on pass through handler * _make_non_streaming_http_request * _is_assistants_api_request * test_openai_assistants_e2e_operations * test_openai_assistants_e2e_operations * openai_proxy_route * docs openai pass through * docs openai pass through * docs openai pass through * test pass through handler * Potential fix for code scanning alert no. 2240: Incomplete URL substring sanitization Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-02-25 19:19:00 -08:00
Krish Dholakia	b829475587	Litellm dev 02 25 2025 p1 (#8816 ) * build(model_prices_and_context_window.json): add bedrock cross-region inferencing model information Closes https://github.com/BerriAI/litellm/issues/8801#issuecomment-2683438528 * build(model_prices_and_context_window.json): add claude sonnet `-latest` models to model cost map Closes https://github.com/BerriAI/litellm/discussions/8770#discussioncomment-12318880 * build(model_prices_and_context_window.json): add remaining anthropic `-latest` models to model cost map Closes https://github.com/BerriAI/litellm/discussions/8770#discussioncomment-12318880 * test: update test with new model	2025-02-25 15:20:39 -08:00
Ishaan Jaff	d963568970	(Bug fix) - running litellm proxy on wndows (#8735 ) * fix running litellm on windows * fix importing litellm * _init_hypercorn_server * linting fix * TestProxyInitializationHelpers * ci/cd run again * ci/cd run again	2025-02-25 15:19:19 -08:00
Ishaan Jaff	c0aec0cc5d	(Bug fix) - reading /parsing request body when on hypercorn (#8734 ) * _safe_get_request_parsed_body * use scope on hypercorn * test http parsing utils * ci/cd run again	2025-02-25 15:18:04 -08:00
Ishaan Jaff	b6d6e270b4	can_team_access_model	2025-02-25 14:51:57 -08:00
Ishaan Jaff	eeee61db65	can_team_access_model	2025-02-25 14:50:10 -08:00
Ishaan Jaff	e271f38356	ui new build	2025-02-24 23:51:00 -08:00
Ishaan Jaff	f8e43296fb	(UI) Fixes for managing Internal Users (#8786 ) * allow bulk adding internal users * allow sorting users by created at * cleanup * clean up user table * show total num users * show per user error when bulk adding users * fix - don't allow creating duplicate internal users in DB * ui flow fix for bulk adding users * allow adding user in multiple teams with models * correctly extract info * working invitation link * fix fill in table after bulk add * fix the results from creating new users in bulkd * bulk invite users * fix view user flow * fix ui type errors * fix type errors * fix type errors	2025-02-24 23:40:13 -08:00
Krish Dholakia	142b195784	Add anthropic thinking + reasoning content support (#8778 ) * feat(anthropic/chat/transformation.py): add anthropic thinking param support * feat(anthropic/chat/transformation.py): support returning thinking content for anthropic on streaming responses * feat(anthropic/chat/transformation.py): return list of thinking blocks (include block signature) allows usage in tool call responses * fix(types/utils.py): extract and map reasoning_content from anthropic as content str * test: add testing to ensure thinking_blocks are returned at the root * fix(anthropic/chat/handler.py): return thinking blocks on streaming - include signature * feat(factory.py): handle anthropic thinking blocks translation if in assistant response * test: handle openai internal instability * test: handle openai audio instability * ci: pin anthropic dep * test: handle openai audio instability * fix: fix linting error * refactor(anthropic/chat/transformation.py): refactor function to remain <50 LOC * fix: fix linting error * fix: fix linting error * fix: fix linting error * fix: fix linting error	2025-02-24 21:54:30 -08:00
Krish Dholakia	566d9354aa	fix(proxy/_types.py): fixes issue where internal user able to escalat… (#8740 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 41s Details * fix(proxy/_types.py): fixes issue where internal user able to escalate their role with ui key Fixes https://github.com/BerriAI/litellm/issues/8029 * style: cleanup * test: handle bedrock instability	2025-02-22 22:59:58 -08:00
Krish Dholakia	09462ba80c	Add cohere v2/rerank support (#8421 ) (#8605 ) * Add cohere v2/rerank support (#8421) * Support v2 endpoint cohere rerank * Add tests and docs * Make v1 default if old params used * Update docs * Update docs pt 2 * Update tests * Add e2e test * Clean up code * Use inheritence for new config * Fix linting issues (#8608) * Fix cohere v2 failing test + linting (#8672) * Fix test and unused imports * Fix tests * fix: fix linting errors * test: handle tgai instability * fix: skip service unavailable err * test: print logs for unstable test * test: skip unreliable tests --------- Co-authored-by: vibhavbhat <vibhavb00@gmail.com>	2025-02-22 22:25:29 -08:00
Krish Dholakia	c2aec21b4d	fix(amazon_deepseek_transformation.py): remove </think> from stream o… (#8717 ) * fix(amazon_deepseek_transformation.py): remove </think> from stream output - cleanup user facing stream * fix(key_managenet_endpoints.py): return `/key/list` sorted by created_at makes it easier to see created key * style: cleanup team table * feat(key_edit_view.tsx): support setting model specific tpm/rpm limits on keys	2025-02-22 21:46:55 -08:00
Ishaan Jaff	e67d72f660	ui new build	2025-02-21 19:31:59 -08:00
Ishaan Jaff	d23d4305fb	ui new build	2025-02-20 18:40:21 -08:00
Ishaan Jaff	55b938dd6e	(Infra/DB) - Allow running older litellm version when out of sync with current state of DB (#8695 ) * fix check migration * clean up should_update_prisma_schema * update test * db_migration_disable_update_check * Check container logs for expected message * db_migration_disable_update_check * test_check_migration_out_of_sync * test_should_update_prisma_schema * db_migration_disable_update_check * pip install aiohttp	2025-02-20 18:30:23 -08:00
Ishaan Jaff	300d7825f5	(Observability) - Add more detailed dd tracing on Proxy Auth, Bedrock Auth (#8693 ) * add dd tracer * fix dd tracing * add @tracer.wrap() on def user_api_key_auth * add async_function_with_retries * remove dead code * add tracer.wrap on base aws llm * add tracer.wrap on base aws llm * fix print verbose * fix dd tracing * trace base aws llm * fix test base aws llm * fix converse transform * test base aws llm * BASE_AWS_LLM_PATH * BASE_AWS_LLM_PATH * test dd tracing	2025-02-20 18:00:41 -08:00
Ishaan Jaff	ccfbb77b73	(Redis fix) - use mget_non_atomic (#8682 ) * use mget_nonatomic * redis cluster override MGET op * fix redis cluster + MGET * test redis cluster	2025-02-20 17:51:31 -08:00
Ishaan Jaff	bb6f43d12e	(Bug fix) - Cache Health not working when configured with prometheus service logger (#8687 ) * fix serialize on safe json dumps * test_non_standard_dict_keys_complex * ui fix HealthCheckCacheParams * fix HealthCheckCacheParams * fix code qa * test_cache_ping_failure * test_cache_ping_health_check_includes_only_cache_attributes * test_cache_ping_health_check_includes_only_cache_attributes	2025-02-20 13:41:56 -08:00
Krish Dholakia	982ee4b96b	LiteLLM Contributor PRs (02/18/2025). (#8643 ) * fix: prisma migration script logging #6991 (#8617) * fix: prisma migration script logging #6991 * chore: refactor usng proxy logger * remove noqa and move import below abs path * Fix(litellm-vetexai-gemini): adding the supported files as per gemini documentation (#8559) * bugfix(litellm-vetexai-gemini): adding the supported files as per gemini documentations * fix(files.py): correct file type constant from 'text' to 'TXT' * test: handle internal server error --------- Co-authored-by: Justin Law <81255462+justinthelaw@users.noreply.github.com> Co-authored-by: alymedhat10 <48028013+alymedhat10@users.noreply.github.com>	2025-02-19 21:52:46 -08:00
Krish Dholakia	cc77138b37	Add all `/key/generate` api params to UI + add metadata fields on team AND org add/update (#8667 ) * feat(create_key_button.tsx): initial commit using openapi.json to ensure all values via api are supported on ui for `/key/generate` Closes https://github.com/BerriAI/litellm/issues/7763 * style(create_key_button.tsx): put openapi settings inside 'advanced setting' accordion * fix(check_openapi_schema.tsx): style improvements for advanced settings * style(create_key_button.tsx): add tooltip explaining what the settings mean * fix(team_info.tsx): render metadata field on team update allow updating a team's metadata * fix(networking.tsx): add 'metadata' field to create team form * refactor: cleanup dead codeblock * fix(organization_endpoints.py): fix metadata param support on `/organization/new` * feat(organization_endpoints.py): support updating metadata for organization on api + ui * test: mark flaky test	2025-02-19 21:13:06 -08:00
Krrish Dholakia	9470f57e86	build: extract <think>..</think> block for amazon deepseek r1 and put in reasoning_content	2025-02-19 21:10:38 -08:00
Ishaan Jaff	1460a79ef9	ui new build	2025-02-19 20:09:51 -08:00
Ishaan Jaff	fff15543d9	(UI + Proxy) Cache Health Check Page - Cleanup/Improvements (#8665 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details * fixes for redis cache ping serialization * fix cache ping check * fix cache health check ui * working error details on ui * ui expand / collapse error * move cache health check to diff file * fix displaying error from cache health check * ui allow copying errors * ui cache health fixes * show redis details * clean up cache health page * ui polish fixes * fix error handling on cache health page * fix redis_cache_params on cache ping response * error handling * cache health ping response * fx error response from cache ping * parsedLitellmParams * fix cache health check * fix cache health page * cache safely handle json dumps issues * test caching routes * test_primitive_types * fix caching routes * litellm_mapped_tests * fix pytest-mock * fix _serialize * fix linting on safe dumps * test_default_max_depth * pip install "pytest-mock==3.12.0" * litellm_mapped_tests_coverage * add readme on new litellm test dir	2025-02-19 19:08:50 -08:00
Krrish Dholakia	39db3147e8	fix(spend_tracking_utils.py): move info to debug	2025-02-19 15:36:32 -08:00
Krish Dholakia	0ff56504d7	build: build ui (#8654 )	2025-02-18 22:56:01 -08:00
Krish Dholakia	0319559295	fix(team_endpoints.py): allow team member to view team info (#8644 ) * fix(team_endpoints.py): allow team member to view team info * test: handle model overloaded in tool calling test * test: handle internal server error	2025-02-18 22:28:57 -08:00
Ishaan Jaff	6cae24ed08	ui new build	2025-02-18 21:26:06 -08:00
Ishaan Jaff	889feb2ea8	patch on LiteLLM_AuditLogs	2025-02-18 21:13:14 -08:00
Ishaan Jaff	b88762b63c	(Polish/Fixes) - Fixes for Adding Team Specific Models (#8645 ) * refactor get model info for team models * allow adding a model to a team when creating team specific model * ui update selected Team on Team Dropdown * test_team_model_association * testing for team specific models * test_get_team_specific_model * test: skip on internal server error * remove model alias card on teams page * linting fix _get_team_specific_model * fix DeploymentTypedDict * fix linting error * fix code quality * fix model info checks --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2025-02-18 21:11:57 -08:00
Krish Dholakia	2b7755f8d8	Litellm dev 02 18 2025 p3 (#8640 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details * fix(team_endpoints.py): cleanup user <-> team association on team delete Fixes issue where user table still listed team membership post delete * test(test_team.py): update e2e test - ensure user/team membership is deleted on team delete * fix(base_invoke_transformation.py): fix deepseek r1 transformation remove deepseek name from model url * test(test_completion.py): assert model route not in url * feat(base_invoke_transformation.py): infer region name from model arn prevent errors due to different region name in env var vs. model arn, respect if explicitly set in call though * test: fix test * test: skip on internal server error	2025-02-18 19:14:20 -08:00

... 8 9 10 11 12 ...

4695 commits