litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 19:54:13 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	772c2b1fff	Revert "ui new build" This reverts commit `94563ab1e7`.	2025-03-04 13:28:54 -08:00
Krrish Dholakia	9baf4f7e56	fix: fix linting errors	2025-03-04 06:13:53 -08:00
Krish Dholakia	b5beed5812	Litellm dev 03 01 2025 p2 (#8944 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details * test(test_router_tag_routing.py): add unit test for tag-based routing on embeddings * fix(router.py): pass request kwargs on async embeddings to async_get_available_deployment function * fix(router.py): require request kwargs to always be passed in ensures tag-based routing always works, across endpoints * feat(langfuse_prompt_management.py): support using prompt management per langfuse project with key/team based logging * fix: fix linting error * fix: fix test * fix: fix test * fix: fix test * fix: fix linting error	2025-03-03 23:06:11 -08:00
Krrish Dholakia	8ea3d4c046	build: merge litellm_dev_03_01_2025_p2	2025-03-03 23:05:41 -08:00
Krish Dholakia	2fc6262675	fix(route_llm_request.py): move to using common router, even for clie… (#8966 ) * fix(route_llm_request.py): move to using common router, even for client-side credentials ensures fallbacks / cooldown logic still works * test(test_route_llm_request.py): add unit test for route request * feat(router.py): generate unique model id when clientside credential passed in Prevents cooldowns for api key 1 from impacting api key 2 * test(test_router.py): update testing to ensure original litellm params not mutated * fix(router.py): upsert clientside call into llm router model list enables cooldown logic to work accurately * fix: fix linting error * test(test_router_utils.py): add direct test for new util on router	2025-03-03 22:57:08 -08:00
Ishaan Jaff	94563ab1e7	ui new build	2025-03-03 22:21:31 -08:00
Ishaan Jaff	c015fb34f1	(UI) - Improvements to session handling logic (#8970 ) * add cookieUtils * use utils for clearing cookies * on logout use clearTokenCookies * ui use correct clearTokenCookies * navbar show userEmail on UserID page * add timestamp on token cookie * update generate_authenticated_redirect_response * use common getAuthToken * fix clearTokenCookies * fixes for get auth token * fix invitation link sign in logic * Revert "fix invitation link sign in logic" This reverts commit `30e5308cb3`. * fix getAuthToken * update setAuthToken * fix ui session handling * fix ui session handler	2025-03-03 22:17:21 -08:00
Sunny Wan	f2c2266fd7	Merge branch 'BerriAI:main' into main	2025-03-03 21:37:43 -05:00
Sunny Wan	bdd03405fe	Removed unnecessary comments	2025-03-03 18:18:24 -05:00
Sunny Wan	4d61ac5f13	Added models to model_prices_and_context	2025-03-03 18:11:33 -05:00
Krish Dholakia	94d28d59e4	Fix deepseek 'reasoning_content' error (#8963 ) * fix(streaming_handler.py): fix deepseek reasoning content streaming Fixes https://github.com/BerriAI/litellm/issues/8939 * test(test_streaming_handler.py): add unit test to streaming handle 'is_chunk_non_empty' function ensures 'reasoning_content' is handled correctly	2025-03-03 14:34:10 -08:00
Sunny Wan	162ea295e9	[CHORE] Removed old code	2025-03-03 01:45:13 -05:00
Sunny Wan	61ee71745a	[CHORE] Added proper typing	2025-03-03 01:42:48 -05:00
Sunny Wan	4f63e9a182	Merge branch 'BerriAI:main' into main	2025-03-03 01:35:04 -05:00
Sunny Wan	b87704cc34	[CHORE] Fixed some style issues and leaks	2025-03-03 01:34:44 -05:00
Krrish Dholakia	db83cbe5c0	build(model_prices_and_context_window.json): add new tracking for anthropic via invoke - clarify it does not support pdf input unlike converse All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 32s Details	2025-03-02 22:31:31 -08:00
Sunny Wan	fd090c8043	[FEAT] Added snowflake completion provider	2025-03-03 01:20:00 -05:00
Michael Schmid	842d8dec09	quote DailyTagSpend in order to look for the right View (#8947 ) PostgreSQL treats unquoted identifiers as lowercase by default. In our query, we're using "DailyTagSpend" (with capital letters), but PostgreSQL will be looking for "dailytagspend" (all lowercase).	2025-03-02 21:36:55 -08:00
Krish Dholakia	54b7f17ca6	fix(proxy_server.py): fix setting router redis cache, if cache enable… (#8859 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details * fix(proxy_server.py): fix setting router redis cache, if cache enabled on litellm_settings enables configurations like namespace to just work * fix(redis_cache.py): fix key for async increment, to use the set namespace prevents collisions if redis instance shared across environments * fix load tests on litellm release notes * fix caching on main branch (#8858) * fix(streaming_handler.py): fix is delta empty check to handle empty str * fix(streaming_handler.py): fix delta chunk on final response * [Bug]: Deepseek error on proxy after upgrading to 1.61.13-stable (#8860) * fix deepseek error * test_deepseek_provider_async_completion * fix get_complete_url * bump: version 1.61.17 → 1.61.18 * bump: version 1.61.18 → 1.61.19 * vertex ai anthropic thinking param support (#8853) * fix(vertex_llm_base.py): handle credentials passed in as dictionary * fix(router.py): support vertex credentials as json dict * test(test_vertex.py): allows easier testing mock anthropic thinking response for vertex ai * test(vertex_ai_partner_models/): don't remove "@" from model breaks anthropic cost calculation * test: move testing * fix: fix linting error * fix: fix linting error * fix(vertex_ai_partner_models/main.py): split @ for codestral model * test: fix test * fix: fix stripping "@" on mistral models * fix: fix test * test: fix test --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-03-02 08:39:06 -08:00
Krrish Dholakia	a63eb58f1b	fix(anthropic/chat/transformation.py): fix headers to be a set avoid duplicates	2025-03-02 08:36:43 -08:00
Krrish Dholakia	4418e6dd14	build: merge branch	2025-03-02 08:31:57 -08:00
Ishaan Jaff	88b1e315c8	ui new build All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details	2025-03-01 17:52:14 -08:00
Ishaan Jaff	f85d5afd58	Merge branch 'main' into litellm_fix_team_model_access_checks	2025-03-01 17:36:45 -08:00
Ishaan Jaff	1008da7c41	(UI) Error Logs improvements - Store Raw proxy server request for success and failure (#8917 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details * ui render error logs * add proxy_server_request to spend logs metadata * _add_proxy_server_request_to_metadata * proxy track cost callback fixes * _add_proxy_server_request_to_metadata * test spend logs * test_async_post_call_failure_hook	2025-03-01 16:26:47 -08:00
Ishaan Jaff	bc9b3e4847	(Bug fix) - don't log messages in `model_parameters` in StandardLoggingPayload (#8932 ) * define model param helper * use ModelParamHelper * get_standard_logging_model_parameters * fix code quality * get_standard_logging_model_parameters * StandardLoggingPayload * test_get_kwargs_for_cache_key * test_langsmith_key_based_logging * fix code qa * fix linting	2025-03-01 13:39:45 -08:00
Ishaan Jaff	ee7cd60fdb	Revert "(bug fix) - don't log messages, prompt, input in `model_parameters` in StandardLoggingPayload (#8923 )" This reverts commit `a119cb420b`.	2025-03-01 11:05:33 -08:00
Ishaan Jaff	6fc9aa1612	(bug fix) - dd tracer, only send traces when user opts into sending dd-trace (#8928 ) * fix dd tracing null tracer bug * fix dd tracing * fix base aws llm * test_should_use_dd_tracer	2025-03-01 10:53:36 -08:00
Ishaan Jaff	a119cb420b	(bug fix) - don't log messages, prompt, input in `model_parameters` in StandardLoggingPayload (#8923 ) * fix _get_model_parameters * test litellm logging * test litellm logging	2025-03-01 10:27:24 -08:00
Ishaan Jaff	5508239375	patch - auth checks for model access (#8924 )	2025-03-01 10:11:44 -08:00
Ishaan Jaff	428ed1360c	fix overly verbose non blocking error on dd get_request_response_payload	2025-03-01 10:09:18 -08:00
Krish Dholakia	c1527ebf52	UI - Allow admin to control default model access for internal users (#8912 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 36s Details * fix(create_user_button.tsx): allow admin to set models user has access to, on invite Enables controlling model access on invite * feat(auth_checks.py): enforce 'no-model-access' special model name on backend prevent user from calling models if default key has no model access * fix(chat_ui.tsx): allow user to input custom model * fix(chat_ui.tsx): pull available models based on models key has access to * style(create_user_button.tsx): move default model inside 'personal key creation' accordion * fix(chat_ui.tsx): fix linting error * test(test_auth_checks.py): add unit-test for special model name * docs(internal_user_endpoints.py): update docstring * fix test_moderations_bad_model * Litellm dev 02 27 2025 p6 (#8891) * fix(http_parsing_utils.py): orjson can throw errors on some emoji's in text, default to json.loads * fix(sagemaker/handler.py): support passing model id on async streaming * fix(litellm_pre_call_utils.py): Fixes https://github.com/BerriAI/litellm/issues/7237 * Fix calling claude via invoke route + response_format support for claude on invoke route (#8908) * fix(anthropic_claude3_transformation.py): fix amazon anthropic claude 3 tool calling transformation on invoke route move to using anthropic config as base * fix(utils.py): expose anthropic config via providerconfigmanager * fix(llm_http_handler.py): support json mode on async completion calls * fix(invoke_handler/make_call): support json mode for anthropic called via bedrock invoke * fix(anthropic/): handle 'response_format: {"type": "text"}` + migrate amazon claude 3 invoke config to inherit from anthropic config Prevents error when passing in 'response_format: {"type": "text"} * test: fix test * fix(utils.py): fix base invoke provider check * fix(anthropic_claude3_transformation.py): don't pass 'stream' param * fix: fix linting errors * fix(converse_transformation.py): handle response_format type=text for converse * converse_transformation: pass 'description' if set in response_format (#8907) * test(test_bedrock_completion.py): e2e test ensuring tool description is passed in * fix(converse_transformation.py): pass description, if set * fix(transformation.py): Fixes https://github.com/BerriAI/litellm/issues/8767#issuecomment-2689887663 * Fix bedrock passing `response_format: {"type": "text"}` (#8900) * fix(converse_transformation.py): ignore type: text, value in response_format no-op for bedrock * fix(converse_transformation.py): handle adding response format value to tools * fix(base_invoke_transformation.py): fix 'get_bedrock_invoke_provider' to handle cross-region-inferencing models * test(test_bedrock_completion.py): add unit testing for bedrock invoke provider logic * test: update test * fix(exception_mapping_utils.py): add context window exceeded error handling for databricks provider route * fix(fireworks_ai/): support passing tools + response_format together * fix: cleanup * fix(base_invoke_transformation.py): fix imports * (Feat) - Show Error Logs on LiteLLM UI (#8904) * fix test_moderations_bad_model * use async_post_call_failure_hook * basic logging errors in DB * show status on ui * show status on ui * ui show request / response side by side * stash fixes * working, track raw request * track error info in metadata * fix showing error / request / response logs * show traceback on error viewer * ui with traceback of error * fix async_post_call_failure_hook * fix(http_parsing_utils.py): orjson can throw errors on some emoji's in text, default to json.loads * test_get_error_information * fix code quality * rename proxy track cost callback test * _should_store_errors_in_spend_logs * feature flag error logs * Revert "_should_store_errors_in_spend_logs" This reverts commit `7f345df477`. * Revert "feature flag error logs" This reverts commit `0e90c022bb`. * test_spend_logs_payload * fix OTEL log_db_metrics * fix import json * fix ui linting error * test_async_post_call_failure_hook * test_chat_completion_bad_model_with_spend_logs --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> * ui new build * test_chat_completion_bad_model_with_spend_logs * docs(release_cycle.md): document release cycle * bump: version 1.62.0 → 1.62.1 --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-02-28 23:23:03 -08:00
Ishaan Jaff	04f152ceb3	ui new build	2025-02-28 20:12:06 -08:00
Ishaan Jaff	3a086cee06	(Feat) - Show Error Logs on LiteLLM UI (#8904 ) * fix test_moderations_bad_model * use async_post_call_failure_hook * basic logging errors in DB * show status on ui * show status on ui * ui show request / response side by side * stash fixes * working, track raw request * track error info in metadata * fix showing error / request / response logs * show traceback on error viewer * ui with traceback of error * fix async_post_call_failure_hook * fix(http_parsing_utils.py): orjson can throw errors on some emoji's in text, default to json.loads * test_get_error_information * fix code quality * rename proxy track cost callback test * _should_store_errors_in_spend_logs * feature flag error logs * Revert "_should_store_errors_in_spend_logs" This reverts commit `7f345df477`. * Revert "feature flag error logs" This reverts commit `0e90c022bb`. * test_spend_logs_payload * fix OTEL log_db_metrics * fix import json * fix ui linting error * test_async_post_call_failure_hook * test_chat_completion_bad_model_with_spend_logs --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2025-02-28 20:10:09 -08:00
Krish Dholakia	c84b489d58	Fix bedrock passing `response_format: {"type": "text"}` (#8900 ) * fix(converse_transformation.py): ignore type: text, value in response_format no-op for bedrock * fix(converse_transformation.py): handle adding response format value to tools * fix(base_invoke_transformation.py): fix 'get_bedrock_invoke_provider' to handle cross-region-inferencing models * test(test_bedrock_completion.py): add unit testing for bedrock invoke provider logic * test: update test * fix(exception_mapping_utils.py): add context window exceeded error handling for databricks provider route * fix(fireworks_ai/): support passing tools + response_format together * fix: cleanup * fix(base_invoke_transformation.py): fix imports	2025-02-28 20:09:59 -08:00
Krish Dholakia	c8dc4f3eec	converse_transformation: pass 'description' if set in response_format (#8907 ) * test(test_bedrock_completion.py): e2e test ensuring tool description is passed in * fix(converse_transformation.py): pass description, if set * fix(transformation.py): Fixes https://github.com/BerriAI/litellm/issues/8767#issuecomment-2689887663	2025-02-28 18:47:07 -08:00
Krish Dholakia	a65bfab697	Fix calling claude via invoke route + response_format support for claude on invoke route (#8908 ) * fix(anthropic_claude3_transformation.py): fix amazon anthropic claude 3 tool calling transformation on invoke route move to using anthropic config as base * fix(utils.py): expose anthropic config via providerconfigmanager * fix(llm_http_handler.py): support json mode on async completion calls * fix(invoke_handler/make_call): support json mode for anthropic called via bedrock invoke * fix(anthropic/): handle 'response_format: {"type": "text"}` + migrate amazon claude 3 invoke config to inherit from anthropic config Prevents error when passing in 'response_format: {"type": "text"} * test: fix test * fix(utils.py): fix base invoke provider check * fix(anthropic_claude3_transformation.py): don't pass 'stream' param * fix: fix linting errors * fix(converse_transformation.py): handle response_format type=text for converse	2025-02-28 17:56:26 -08:00
Krish Dholakia	8f86959c32	Litellm dev 02 27 2025 p6 (#8891 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details * fix(http_parsing_utils.py): orjson can throw errors on some emoji's in text, default to json.loads * fix(sagemaker/handler.py): support passing model id on async streaming * fix(litellm_pre_call_utils.py): Fixes https://github.com/BerriAI/litellm/issues/7237	2025-02-28 14:34:17 -08:00
Vivek Aditya	ed75dd61c2	Removed prints and added unit tests	2025-02-28 21:48:13 +05:30
Tomáš Dvořák	b2eb2365b9	fix: ollama chat async stream error propagation (#8870 ) Ref: #8868	2025-02-28 08:11:56 -08:00
Krish Dholakia	887c66c6b7	Show 'user_email' on key table on UI (#8887 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 35s Details * refactor(internal_user_endpoints.py): refactor `/user/list` to accept 'user_ids' and use prisma for db calls enables bulk search from UI * fix(internal_user_endpoints.py): fix linting errors * fix(all_keys_table.tsx): show user email on create key table make it easier for admin to know which key is associated to which user * docs(internal_user_endpoints.py): improve docstring * fix: sync schema with main * fix(columns.tsx): display SSO ID on Internal User Table make it easy to identify what the SSO ID for a user is * fix(columns.tsx): add tooltip to header help user understand what SSO ID means * style: add more tooltips in the management flows make it easier to understand what you're seeing * style(all_keys_table.tsx): replace 'Not Set' with '-' reduces words on table * fix(internal_user_endpoints.py): fix user ids check * test: fix test * fix(internal_user_endpoints.py): maintain returning key count in `/user/list`	2025-02-27 21:56:14 -08:00
Krish Dholakia	740bd7e9ce	(security fix) - Enforce model access restrictions on Azure OpenAI route (#8888 ) * fix(user_api_key_auth.py): Fixes https://github.com/BerriAI/litellm/issues/8780 security fix - enforce model access checks on azure routes * test(test_user_api_key_auth.py): add unit testing * test(test_openai_endpoints.py): add e2e test to ensure azure routes also run through model validation checks	2025-02-27 21:24:58 -08:00
Ishaan Jaff	f7bafbf2ab	Install Node.js	2025-02-27 21:09:04 -08:00
Jorge Piedrahita Ortiz	ad1357dd0e	update sambanova docs (#8875 )	2025-02-27 20:23:33 -08:00
Ishaan Jaff	51590742bd	ui new build	2025-02-27 20:09:44 -08:00
Ishaan Jaff	51a6a219cd	(Improvements) use `/openai/` pass through with OpenAI Ruby for Assistants API (#8884 ) * add ruby assistants testing * _join_url_paths * run ruby tests on ci/cd * TestBaseOpenAIPassThroughHandler * _join_url_paths * fix _join_url_paths * Install Ruby and Bundler * Install Ruby and Bundler	2025-02-27 20:01:16 -08:00
Ishaan Jaff	378e3d9e4d	(Proxy improvement) - Raise `BadRequestError` when unknown model passed in request (#8886 ) * fix safe access model in request body * litellm.BadRequestError * don't pass model in request body * test_chat_completion_bad_model	2025-02-27 19:30:57 -08:00
Krish Dholakia	2d2d1b9df5	Add `created_by` and `updated_by` fields to Keys table (#8885 ) * fix(proxy/_types.py): return created_by and updated_by on /key/list enables better trail of who made a key * fix(all_keys_table.tsx): add created by to key table allows easier tracking of who generated the key * fix(key_management_endpoints.py): track 'created_by' and 'updated_by' fields enable easier tracking of who created proxy keys	2025-02-27 18:12:58 -08:00
Krish Dholakia	91cdc01149	Allow team/org filters to be searchable on the Create Key Page (#8881 ) * fix(filtercomponent): always show apply filters button fix hiding behavior * style(create_key_button.tsx): style improvements on create key modal remove the numbering * feat(filter.tsx): allow searching team by team alias * style(filter.tsx): style improvements to ensure dropdown + custom value works as expected * style(filter.tsx): add explicit button allowing reset filters * fix(filter.tsx): fix linting error * feat(all_keys_table.tsx): show team alias on keys table * style(all_keys_table.tsx): enforce length constraints on table make it easier to see all columns	2025-02-27 18:11:03 -08:00
Ishaan Jaff	1e7b9cf767	(fix) Pass through spend tracking - ensure `custom_llm_provider` is tracked for Vertex, Google AI Studio, Anthropic (#8882 ) * fix track custom llm provider on pass through routes * fix use correct provider for google ai studio * fix tracking custom llm provider on pass through route * ui fix get provider logo * update tests to track custom llm provider * test_anthropic_streaming_with_headers * Potential fix for code scanning alert no. 2263: Incomplete URL substring sanitization Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>	2025-02-27 17:09:43 -08:00
Ishaan Jaff	047d1b1208	(Bug Fix) - Accurate token counting for `/anthropic/` API Routes on LiteLLM Proxy (#8880 ) * fix _create_anthropic_response_logging_payload * fix - pass through don't create standard logging payload * fix logged key hash * test_init_kwargs_for_pass_through_endpoint_basic * test_unit_test_anthropic_pass_through * fix anthropic pass through logging handler * test_stream_token_counting_anthropic_with_include_usage * convert_str_chunk_to_generic_chunk * _build_complete_streaming_response * test_anthropic_basic_completion_with_headers * test_anthropic_streaming_with_headers * improve test for pass through token counting	2025-02-27 15:43:03 -08:00

... 19 20 21 22 23 ...

13483 commits