litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 02:34:29 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	9fdad1f133	Merge `b1eca590d7` into `b82af5b826`	2025-04-24 01:01:43 -07:00
Ishaan Jaff	0c2f705417	[Feat] Add Responses API - Routing Affinity logic for sessions (#10193 ) * test for test_responses_api_routing_with_previous_response_id * test_responses_api_routing_with_previous_response_id * add ResponsesApiDeploymentCheck * ResponsesApiDeploymentCheck * ResponsesApiDeploymentCheck * fix ResponsesApiDeploymentCheck * test_responses_api_routing_with_previous_response_id * ResponsesApiDeploymentCheck * test_responses_api_deployment_check.py * docs routing affinity * simplify ResponsesApiDeploymentCheck * test response id * fix code quality check	2025-04-21 20:00:27 -07:00
Krish Dholakia	0c3b7bb37d	fix(router.py): handle edge case where user sets 'model_group' inside… (#10191 ) * fix(router.py): handle edge case where user sets 'model_group' inside 'model_info' * fix(key_management_endpoints.py): security fix - return hashed token in 'token' field Ensures when creating a key on UI - only hashed token shown * test(test_key_management_endpoints.py): add unit test * test: update test	2025-04-21 16:17:45 -07:00
Krish Dholakia	ce828408da	fix(proxy_server.py): pass llm router to get complete model list (#10176 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 21s Details Helm unit test / unit-test (push) Successful in 27s Details allows model auth to work	2025-04-19 22:27:49 -07:00
Ishaan Jaff	b0024bb229	[Bug Fix] Spend Tracking Bug Fix, don't modify in memory default litellm params (#10167 ) * _update_kwargs_with_default_litellm_params * test_update_kwargs_does_not_mutate_defaults_and_merges_metadata	2025-04-19 14:13:59 -07:00
Krish Dholakia	00e49380df	Litellm UI qa 04 12 2025 p1 (#9955 ) * fix(model_info_view.tsx): cleanup text * fix(key_management_endpoints.py): fix filtering litellm-dashboard keys for internal users * fix(proxy_track_cost_callback.py): prevent flooding spend logs with admin endpoint errors * test: add unit testing for logic * test(test_auth_exception_handler.py): add more unit testing * fix(router.py): correctly handle retrieving model info on get_model_group_info fixes issue where model hub was showing None prices * fix: fix linting errors	2025-04-12 19:30:48 -07:00
Krish Dholakia	3ca82c22b6	Support CRUD endpoints for Managed Files (#9924 ) * fix(openai.py): ensure openai file object shows up on logs * fix(managed_files.py): return unified file id as b64 str allows retrieve file id to work as expected * fix(managed_files.py): apply decoded file id transformation * fix: add unit test for file id + decode logic * fix: initial commit for litellm_proxy support with CRUD Endpoints * fix(managed_files.py): support retrieve file operation * fix(managed_files.py): support for DELETE endpoint for files * fix(managed_files.py): retrieve file content support supports retrieve file content api from openai * fix: fix linting error * test: update tests * fix: fix linting error * fix(files/main.py): pass litellm params to azure route * test: fix test	2025-04-11 21:48:27 -07:00
Ishaan Jaff	f9ce754817	[Feat] Add litellm.supports_reasoning() util to track if an llm supports reasoning (#9923 ) * add supports_reasoning for xai models * add "supports_reasoning": true for o1 series models * add supports_reasoning util * add litellm.supports_reasoning * add supports reasoning for claude 3-7 models * add deepseek as supports reasoning * test_supports_reasoning * add supports reasoning to model group info * add supports_reasoning * docs supports reasoning * fix supports_reasoning test * "supports_reasoning": false, * fix test * supports_reasoning	2025-04-11 17:56:04 -07:00
Krish Dholakia	0415f1205e	Litellm dev 04 10 2025 p3 (#9903 ) * feat(managed_files.py): encode file type in unified file id simplify calling gemini models * fix(common_utils.py): fix extracting file type from unified file id * fix(litellm_logging.py): create standard logging payload for create file call * fix: fix linting error	2025-04-11 09:29:42 -07:00
Krish Dholakia	9f27e8363f	Realtime API: Support 'base_model' cost tracking + show response in spend logs (if enabled) (#9897 ) * refactor(litellm_logging.py): refactor realtime cost tracking to use common code as rest Ensures basic features like base model just work * feat(realtime/): support 'base_model' cost tracking on realtime api Fixes issue where base model was not working on realtime * fix: fix ruff linting error * test: fix test	2025-04-10 21:24:45 -07:00
Krish Dholakia	0dbd663877	fix(cost_calculator.py): handle custom pricing at deployment level fo… (#9855 ) * fix(cost_calculator.py): handle custom pricing at deployment level for router * test: add unit tests * fix(router.py): show custom pricing on UI check correct model str * fix: fix linting error * docs(custom_pricing.md): clarify custom pricing for proxy Fixes https://github.com/BerriAI/litellm/issues/8573#issuecomment-2790420740 * test: update code qa test * fix: cleanup traceback * fix: handle litellm param custom pricing * test: update test * fix(cost_calculator.py): add router model id to list of potential model names * fix(cost_calculator.py): fix router model id check * fix: router.py - maintain older model registry approach * fix: fix ruff check * fix(router.py): router get deployment info add custom values to mapped dict * test: update test * fix(utils.py): update only if value is non-null * test: add unit test	2025-04-09 22:13:10 -07:00
Krish Dholakia	c555c15ad7	fix(router.py): support reusable credentials via passthrough router (#9758 ) * fix(router.py): support reusable credentials via passthrough router enables reusable vertex credentials to be used in passthrough * test: fix test * test(test_router_adding_deployments.py): add unit testing	2025-04-04 18:40:14 -07:00
Krish Dholakia	6dda1ba6dd	LiteLLM Minor Fixes & Improvements (04/02/2025) (#9725 ) * Add date picker to usage tab + Add reasoning_content token tracking across all providers on streaming (#9722) * feat(new_usage.tsx): add date picker for new usage tab allow user to look back on their usage data * feat(anthropic/chat/transformation.py): report reasoning tokens in completion token details allows usage tracking on how many reasoning tokens are actually being used * feat(streaming_chunk_builder.py): return reasoning_tokens in anthropic/openai streaming response allows tracking reasoning_token usage across providers * Fix update team metadata + fix bulk adding models on Ui (#9721) * fix(handle_add_model_submit.tsx): fix bulk adding models * fix(team_info.tsx): fix team metadata update Fixes https://github.com/BerriAI/litellm/issues/9689 * (v0) Unified file id - allow calling multiple providers with same file id (#9718) * feat(files_endpoints.py): initial commit adding 'target_model_names' support allow developer to specify all the models they want to call with the file * feat(files_endpoints.py): return unified files endpoint * test(test_files_endpoints.py): add validation test - if invalid purpose submitted * feat: more updates * feat: initial working commit of unified file id translation * fix: additional fixes * fix(router.py): remove model replace logic in jsonl on acreate_file enables file upload to work for chat completion requests as well * fix(files_endpoints.py): remove whitespace around model name * fix(azure/handler.py): return acreate_file with correct response type * fix: fix linting errors * test: fix mock test to run on github actions * fix: fix ruff errors * fix: fix file too large error * fix(utils.py): remove redundant var * test: modify test to work on github actions * test: update tests * test: more debug logs to understand ci/cd issue * test: fix test for respx * test: skip mock respx test fails on ci/cd - not clear why * fix: fix ruff check * fix: fix test * fix(model_connection_test.tsx): fix linting error * test: update unit tests	2025-04-03 11:48:52 -07:00
Krish Dholakia	8ee32291e0	Squashed commit of the following: (#9709 ) commit `b12a9892b7` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Apr 2 08:09:56 2025 -0700 fix(utils.py): don't modify openai_token_counter commit `294de31803` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 21:22:40 2025 -0700 fix: fix linting error commit `cb6e9fbe40` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:52:45 2025 -0700 refactor: complete migration commit `bfc159172d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:09:59 2025 -0700 refactor: refactor more constants commit `43ffb6a558` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:45:24 2025 -0700 fix: test commit `04dbe4310c` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:28:58 2025 -0700 refactor: refactor: move more constants into constants.py commit `3c26284aff` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:14:46 2025 -0700 refactor: migrate hardcoded constants out of __init__.py commit `c11e0de69d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:11:21 2025 -0700 build: migrate all constants into constants.py commit `7882bdc787` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:07:37 2025 -0700 build: initial test banning hardcoded numbers in repo	2025-04-02 21:24:54 -07:00
Krish Dholakia	9b7ebb6a7d	build(pyproject.toml): add new dev dependencies - for type checking (#9631 ) * build(pyproject.toml): add new dev dependencies - for type checking * build: reformat files to fit black * ci: reformat to fit black * ci(test-litellm.yml): make tests run clear * build(pyproject.toml): add ruff * fix: fix ruff checks * build(mypy/): fix mypy linting errors * fix(hashicorp_secret_manager.py): fix passing cert for tls auth * build(mypy/): resolve all mypy errors * test: update test * fix: fix black formatting * build(pre-commit-config.yaml): use poetry run black * fix(proxy_server.py): fix linting error * fix: fix ruff safe representation error	2025-03-29 11:02:13 -07:00
Krish Dholakia	122ee634f4	Merge pull request #9473 from BerriAI/litellm_dev_03_22_2025_p2 All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 18s Details Helm unit test / unit-test (push) Successful in 20s Details Litellm dev 03 22 2025 p2	2025-03-22 21:57:15 -07:00
Krrish Dholakia	59e14fc45c	fix(router.py): fix get_model_list to return all wildcard models enables viewing all wildcard models on `/model/info`	2025-03-22 15:39:23 -07:00
Ishaan Jaff	3a454d00df	Merge branch 'main' into litellm_web_search_2	2025-03-22 14:35:32 -07:00
Ishaan Jaff	7dd37a5b18	fix supports_web_search	2025-03-22 14:02:51 -07:00
Krrish Dholakia	b44b3bd36b	feat(llm_passthrough_endpoints.py): base case passing for refactored vertex passthrough route	2025-03-22 11:06:52 -07:00
Krrish Dholakia	f089b1e23f	feat(endpoints.py): support adding credentials by model id Allows user to reuse existing model credentials	2025-03-14 12:32:32 -07:00
Ishaan Jaff	da2669154a	_update_kwargs_with_default_litellm_params	2025-03-12 19:26:12 -07:00
Ishaan Jaff	bcf8ecc9fc	_update_kwargs_with_default_litellm_params	2025-03-12 19:10:19 -07:00
Ishaan Jaff	9e821c915c	_update_kwargs_with_default_litellm_params	2025-03-12 18:33:56 -07:00
Ishaan Jaff	c82ef41dc4	test_openai_responses_litellm_router_no_metadata	2025-03-12 18:18:07 -07:00
Ishaan Jaff	d808fa3c23	test_openai_responses_litellm_router	2025-03-12 16:13:48 -07:00
Ishaan Jaff	89d30d39f6	factory_function	2025-03-12 15:27:34 -07:00
Ishaan Jaff	32688df0c2	_generic_api_call_with_fallbacks	2025-03-12 15:26:37 -07:00
Krrish Dholakia	42af49cd87	fix: fix merge conflicts	2025-03-11 18:41:41 -07:00
Krish Dholakia	8dea6e91a6	Merge branch 'litellm_dev_03_10_2025_p3' into litellm_router_client_init_migration	2025-03-11 18:27:56 -07:00
Krrish Dholakia	e4fc6422e2	fix: fix max parallel requests client	2025-03-11 18:25:48 -07:00
Krrish Dholakia	2469072c50	fix: remove unused imports	2025-03-11 18:15:10 -07:00
Krrish Dholakia	58888f117c	feat(azure.py): fix azure client init	2025-03-11 18:05:11 -07:00
Krrish Dholakia	f56c5ca380	feat: working e2e credential management - support reusing existing credentials	2025-03-10 19:29:24 -07:00
Krrish Dholakia	fdd5ba3084	feat(credential_accessor.py): support loading in credentials from credential_list Resolves https://github.com/BerriAI/litellm/issues/9114	2025-03-10 17:15:58 -07:00
Krrish Dholakia	5458b08425	fix(router.py): comment out azure/openai client init - not necessary	2025-03-10 16:47:43 -07:00
Krish Dholakia	5591354309	Support master key rotations (#9041 ) * feat(key_management_endpoints.py): adding support for rotating master key * feat(key_management_endpoints.py): support decryption-re-encryption of models in db, when master key rotated * fix(user_api_key_auth.py): raise valid token is None error earlier enables easier debugging with api key hash in error message * feat(key_management_endpoints.py): rotate any env vars * fix(key_management_endpoints.py): uncomment check * fix: fix linting error	2025-03-06 23:13:30 -08:00
Ishaan Jaff	2a377b161d	_create_redis_cache	2025-03-06 21:15:48 -08:00
Ogun Oz	85d1427710	Fix: Create RedisClusterCache when startup nodes provided in cache args of router (#9010 ) Co-authored-by: Ogün Öz <ogun.oz@cobrainer.com>	2025-03-06 17:14:32 -08:00
Ishaan Jaff	f47987e673	(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013 ) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion	2025-03-06 00:43:08 -08:00
Krrish Dholakia	8ea3d4c046	build: merge litellm_dev_03_01_2025_p2	2025-03-03 23:05:41 -08:00
Krish Dholakia	2fc6262675	fix(route_llm_request.py): move to using common router, even for clie… (#8966 ) * fix(route_llm_request.py): move to using common router, even for client-side credentials ensures fallbacks / cooldown logic still works * test(test_route_llm_request.py): add unit test for route request * feat(router.py): generate unique model id when clientside credential passed in Prevents cooldowns for api key 1 from impacting api key 2 * test(test_router.py): update testing to ensure original litellm params not mutated * fix(router.py): upsert clientside call into llm router model list enables cooldown logic to work accurately * fix: fix linting error * test(test_router_utils.py): add direct test for new util on router	2025-03-03 22:57:08 -08:00
Krrish Dholakia	4418e6dd14	build: merge branch	2025-03-02 08:31:57 -08:00
Ishaan Jaff	300d7825f5	(Observability) - Add more detailed dd tracing on Proxy Auth, Bedrock Auth (#8693 ) * add dd tracer * fix dd tracing * add @tracer.wrap() on def user_api_key_auth * add async_function_with_retries * remove dead code * add tracer.wrap on base aws llm * add tracer.wrap on base aws llm * fix print verbose * fix dd tracing * trace base aws llm * fix test base aws llm * fix converse transform * test base aws llm * BASE_AWS_LLM_PATH * BASE_AWS_LLM_PATH * test dd tracing	2025-02-20 18:00:41 -08:00
Ishaan Jaff	b88762b63c	(Polish/Fixes) - Fixes for Adding Team Specific Models (#8645 ) * refactor get model info for team models * allow adding a model to a team when creating team specific model * ui update selected Team on Team Dropdown * test_team_model_association * testing for team specific models * test_get_team_specific_model * test: skip on internal server error * remove model alias card on teams page * linting fix _get_team_specific_model * fix DeploymentTypedDict * fix linting error * fix code quality * fix model info checks --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2025-02-18 21:11:57 -08:00
Krish Dholakia	2340f1b31f	Pass router tags in request headers - `x-litellm-tags` (#8609 ) * feat(litellm_pre_call_utils.py): support `x-litellm-tags` request header allow tag based routing + spend tracking via request headers * docs(request_headers.md): document new `x-litellm-tags` for tag based routing and spend tracking * docs(tag_routing.md): add to docs * fix(utils.py): only pass str values for openai metadata param * fix(utils.py): drop non-str values for metadata param to openai preview-feature, otel span was being sent in	2025-02-18 08:26:22 -08:00
Ishaan Jaff	8024300825	(UI) Improvements to Add Team Model Flow (#8603 ) * ui - use common team dropdown component * re-use team component * rename org field on add model * handle add model submit * working view model_id and team_id on root models page * cleaner * show all fields * working model info view * working team info selector * clean up team id * new component for model dashboard * ui show table with dropdown * make public model names like email * revert changes to litellm model name * fix litellm model name * ui fix public model * fix mappings * fix conditional text input * fix message * ui fix bulk add models * _add_team_model_to_db * move model mgmt helper funcs * test_add_team_model_to_db * ui - display model team model name * fix add model tab * fix remove redundant info tab on models page * dont pass model mappings all the way through * fix jarring model name when adding team models * fix edit model button * delete button on model info * ui fix model dashboard * fix DeploymentTypedDict * _is_model_access_group_for_wildcard_route * test _get_public_model_name * ui fix viewing public model name * fix linting error * fix linting errors * fix selectedModel logic	2025-02-17 18:37:14 -08:00
Ishaan Jaff	6b3bfa2b42	(Feat) - return `x-litellm-attempted-fallbacks` in responses from litellm proxy (#8558 ) * add_fallback_headers_to_response * test x-litellm-attempted-fallbacks * unit test attempted fallbacks * fix add_fallback_headers_to_response * docs document response headers * fix file name	2025-02-15 14:54:23 -08:00
Krish Dholakia	a9276f27f9	fix(main.py): fix key leak error when unknown provider given (#8556 ) * fix(main.py): fix key leak error when unknown provider given don't return passed in args if unknown route on embedding * fix(main.py): remove instances of {args} being passed in exception prevent potential key leaks * test(code_coverage/prevent_key_leaks_in_codebase.py): ban usage of {args} in codebase * fix: fix linting errors * fix: remove unused variable	2025-02-15 14:02:55 -08:00
Krish Dholakia	f5841eb84d	fix(router.py): add more deployment timeout debug information for tim… (#8523 ) * fix(router.py): add more deployment timeout debug information for timeout errors help understand why some calls in high-traffic don't respect their model-specific timeouts * test(test_convert_dict_to_response.py): unit test ensuring empty str is not converted to None Addresses https://github.com/BerriAI/litellm/issues/8507 * fix(convert_dict_to_response.py): handle empty message str - don't return back as 'None' Fixes https://github.com/BerriAI/litellm/issues/8507 * test(test_completion.py): add e2e test	2025-02-13 17:10:22 -08:00

1 2 3 4 5 ...

694 commits