litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 11:14:04 +00:00

Author	SHA1	Message	Date
Krrish Dholakia	8ef9129556	fix(types/utils.py): support openai 'file' message type Closes https://github.com/BerriAI/litellm/issues/9365	2025-03-19 23:13:51 -07:00
Ishaan Jaff	57e5c94360	Merge branch 'main' into litellm_arize_dynamic_logging	2025-03-18 22:13:35 -07:00
Krrish Dholakia	39ac9e3eca	fix(lowest_tpm_rpm_v2.py): fix updating limits	2025-03-18 17:10:17 -07:00
Ishaan Jaff	bd122f631e	fix arize config	2025-03-18 16:54:31 -07:00
Ishaan Jaff	57d08531a1	add arize_api_key to StandardCallbackDynamicParams	2025-03-18 14:46:55 -07:00
Krrish Dholakia	057c774c14	fix(http_handler.py): fix typing error	2025-03-17 16:42:32 -07:00
Krrish Dholakia	f089b1e23f	feat(endpoints.py): support adding credentials by model id Allows user to reuse existing model credentials	2025-03-14 12:32:32 -07:00
Krrish Dholakia	605a4d1121	feat(endpoints.py): enable retrieving existing credentials by model name Enables reusing existing credentials	2025-03-14 12:02:50 -07:00
Ishaan Jaff	241a36a74f	Merge pull request #9222 from BerriAI/litellm_snowflake_pr_mar_13 [Feat] Add Snowflake Cortex to LiteLLM	2025-03-13 21:35:39 -07:00
Krrish Dholakia	8a6e4715aa	feat(converse_transformation.py): fix type for bedrock cache usage block	2025-03-13 19:33:22 -07:00
Sunny Wan	f9a5109203	Merge branch 'BerriAI:main' into main	2025-03-13 19:37:22 -04:00
Krrish Dholakia	f99b1937db	feat(converse_transformation.py): translate converse usage block with cache creation values to openai format	2025-03-13 15:49:25 -07:00
Ishaan Jaff	d4ed384984	working spend tracking + logging for response api	2025-03-12 17:29:25 -07:00
Krish Dholakia	2d957a0ed9	Merge branch 'main' into litellm_dev_03_10_2025_p3	2025-03-12 14:56:01 -07:00
Ishaan Jaff	de473bee4b	fix mypy linting errors	2025-03-12 12:13:19 -07:00
Ishaan Jaff	342741ede1	Merge branch 'main' into litellm_responses_api_support	2025-03-12 12:04:12 -07:00
Ishaan Jaff	c6a9e8cafe	typing_extensions Annotated	2025-03-12 11:58:56 -07:00
Ishaan Jaff	e6b696370b	BaseLiteLLMOpenAIResponseObject	2025-03-12 11:39:51 -07:00
Ishaan Jaff	fde75a068a	working streaming logging	2025-03-12 00:02:39 -07:00
Ishaan Jaff	4ff6e41c15	ResponsesAPIStreamEvents	2025-03-11 23:42:35 -07:00
Ishaan Jaff	51dc24a405	_transform_response_api_usage_to_chat_usage	2025-03-11 22:26:44 -07:00
Ishaan Jaff	8ada5a469d	add responses api to call types	2025-03-11 22:02:48 -07:00
Ishaan Jaff	24cb83b0e4	Response API cost tracking	2025-03-11 22:02:14 -07:00
Ishaan Jaff	8da714104b	ResponsesAPIStreamingResponse	2025-03-11 17:48:15 -07:00
Ishaan Jaff	2ac5aa2477	ResponsesAPIOptionalRequestParams	2025-03-11 17:36:06 -07:00
Krrish Dholakia	cbc2e84044	refactor(azure.py): refactor to have client init work across all endpoints	2025-03-11 17:27:24 -07:00
Ishaan Jaff	eafc1be132	add ResponsesAPIResponse	2025-03-11 16:46:28 -07:00
Ishaan Jaff	0f8de3d0a5	add transform_request for OpenAI responses API	2025-03-11 16:33:26 -07:00
Ishaan Jaff	401a52e694	working transform	2025-03-11 15:24:42 -07:00
Ishaan Jaff	4b1b87eb67	openai reasoning initial types	2025-03-11 14:28:47 -07:00
Krrish Dholakia	a962a97fcb	feat(endpoints.py): support writing credentials to db	2025-03-10 18:27:43 -07:00
Krrish Dholakia	f1cdc26967	feat(endpoints.py): initial set of crud endpoints for reusable credentials on proxy	2025-03-10 17:48:02 -07:00
Krrish Dholakia	fdd5ba3084	feat(credential_accessor.py): support loading in credentials from credential_list Resolves https://github.com/BerriAI/litellm/issues/9114	2025-03-10 17:15:58 -07:00
Krrish Dholakia	4bd4bb16fd	feat(proxy_server.py): move credential list to being a top-level param	2025-03-10 17:04:05 -07:00
omrishiv	0674491386	add support for Amazon Nova Canvas model (#7838 ) * add initial support for Amazon Nova Canvas model Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> * adjust name to AmazonNovaCanvas and map function variables to config Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> * tighten model name check Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> * fix quality mapping Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> * add premium quality in config Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> * support all Amazon Nova Canvas tasks * remove unused import Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> * add tests for image generation tasks and fix payload Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> * add missing util file Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> * update model prices backup file Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> * remove image tasks other than text->image Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> --------- Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com> Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>	2025-03-10 08:02:00 -07:00
Krrish Dholakia	574f5056c8	fix(utils.py): fix linting error	2025-03-09 20:47:12 -07:00
5aaee9	42b7921ca1	fix: perplexity return both delta and message cause OpenWebUI repect text (#9081 )	2025-03-09 19:46:31 -07:00
Krish Dholakia	4330ef8e81	Fix batches api cost tracking + Log batch models in spend logs / standard logging payload (#9077 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 42s Details * feat(batches/): fix batch cost calculation - ensure it's accurate use the correct cost value - prev. defaulting to non-batch cost * feat(batch_utils.py): log batch models to spend logs + standard logging payload makes it easy to understand how cost was calculated * fix: fix stored payload for test * test: fix test	2025-03-08 11:47:25 -08:00
Krish Dholakia	0e3caf92b9	UI - new API Playground for testing LiteLLM translation (#9073 ) * feat: initial commit - enable dev to see translated request * feat(utils.py): expose new endpoint - `/utils/transform_request` to see the raw request sent by litellm * feat(transform_request.tsx): allow user to see their transformed request * refactor(litellm_logging.py): return raw request in 3 parts - api_base, headers, request body easier to render each individually on UI vs. extracting from combined string * feat: transform_request.tsx working e2e raw request viewing * fix(litellm_logging.py): fix transform viewing for bedrock models * fix(litellm_logging.py): don't return sensitive headers in raw request headers prevent accidental leak * feat(transform_request.tsx): style improvements	2025-03-07 19:39:31 -08:00
Ishaan Jaff	b02af305de	[Feat] - Display `thinking` tokens on OpenWebUI (Bedrock, Anthropic, Deepseek) (#9029 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details * if merge_reasoning_content_in_choices * _optional_combine_thinking_block_in_choices * stash changes * working merge_reasoning_content_in_choices with bedrock * fix litellm_params accessor * fix streaming handler * merge_reasoning_content_in_choices * _optional_combine_thinking_block_in_choices * test_bedrock_stream_thinking_content_openwebui * merge_reasoning_content_in_choices * fix for _optional_combine_thinking_block_in_choices * linting error fix	2025-03-06 18:32:58 -08:00
Ishaan Jaff	f47987e673	(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013 ) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion	2025-03-06 00:43:08 -08:00
Krish Dholakia	f6535ae6ad	Support `format` param for specifying image type (#9019 ) * fix(transformation.py): support a 'format' parameter for image's allow user to specify mime type * fix: pass mimetype via 'format' param * feat(gemini/chat/transformation.py): support 'format' param for gemini * fix(factory.py): support 'format' param on sync bedrock converse calls * feat(bedrock/converse_transformation.py): support 'format' param for bedrock async calls * refactor(factory.py): move to supporting 'format' param in base helper ensures consistency in param support * feat(gpt_transformation.py): filter out 'format' param don't send invalid param to openai * fix(gpt_transformation.py): fix translation * fix: fix translation error	2025-03-05 19:52:53 -08:00
Krish Dholakia	ec4f665e29	Return `signature` on anthropic streaming + migrate to `signature` field instead of `signature_delta` [MINOR bump] (#9021 ) * Fix missing signature_delta in thinking blocks when streaming from Claude 3.7 (#8797) Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> * test: update test to enforce signature found * feat(refactor-signature-param-to-be-'signature'-instead-of-'signature_delta'): keeps it in sync with anthropic * fix: fix linting error --------- Co-authored-by: Martin Krasser <krasserm@googlemail.com>	2025-03-05 19:33:54 -08:00
Krish Dholakia	5e386c28b2	Litellm dev 03 04 2025 p3 (#8997 ) * fix(core_helpers.py): handle litellm_metadata instead of 'metadata' * feat(batches/): ensure batches logs are written to db makes batches response dict compatible * fix(cost_calculator.py): handle batch response being a dictionary * fix(batches/main.py): modify retrieve endpoints to use @client decorator enables logging to work on retrieve call * fix(batches/main.py): fix retrieve batch response type to be 'dict' compatible * fix(spend_tracking_utils.py): send unique uuid for retrieve batch call type create batch and retrieve batch share the same id * fix(spend_tracking_utils.py): prevent duplicate retrieve batch calls from being double counted * refactor(batches/): refactor cost tracking for batches - do it on retrieve, and within the established litellm_logging pipeline ensures cost is always logged to db * fix: fix linting errors * fix: fix linting error	2025-03-04 21:58:03 -08:00
Ishaan Jaff	42931638df	(bug fix) - Fix Cache Health Check for Redis when redis_version is float (#8979 ) * fix allow flexible types for redis version * test_cache_ping_with_redis_version_float * test_cache_ping_with_redis_version_float	2025-03-04 21:26:18 -08:00
Krish Dholakia	662c59adcf	Support caching on reasoning content + other fixes (#8973 ) * fix(factory.py): pass on anthropic thinking content from assistant call * fix(factory.py): fix anthropic messages to handle thinking blocks Fixes https://github.com/BerriAI/litellm/issues/8961 * fix(factory.py): fix bedrock handling for assistant content in messages Fixes https://github.com/BerriAI/litellm/issues/8961 * feat(convert_dict_to_response.py): handle reasoning content + thinking blocks in chat completion block ensures caching works for anthropic thinking block * fix(convert_dict_to_response.py): pass all message params to delta block ensures streaming delta also contains the reasoning content / thinking block * test(test_prompt_factory.py): remove redundant test anthropic now supports assistant as the first message * fix(factory.py): fix linting errors * fix: fix code qa * test: remove falsy test * fix(litellm_logging.py): fix str conversion	2025-03-04 21:12:16 -08:00
Sunny Wan	fd090c8043	[FEAT] Added snowflake completion provider	2025-03-03 01:20:00 -05:00
Ishaan Jaff	ee7cd60fdb	Revert "(bug fix) - don't log messages, prompt, input in `model_parameters` in StandardLoggingPayload (#8923 )" This reverts commit `a119cb420b`.	2025-03-01 11:05:33 -08:00
Ishaan Jaff	a119cb420b	(bug fix) - don't log messages, prompt, input in `model_parameters` in StandardLoggingPayload (#8923 ) * fix _get_model_parameters * test litellm logging * test litellm logging	2025-03-01 10:27:24 -08:00
Ishaan Jaff	3a086cee06	(Feat) - Show Error Logs on LiteLLM UI (#8904 ) * fix test_moderations_bad_model * use async_post_call_failure_hook * basic logging errors in DB * show status on ui * show status on ui * ui show request / response side by side * stash fixes * working, track raw request * track error info in metadata * fix showing error / request / response logs * show traceback on error viewer * ui with traceback of error * fix async_post_call_failure_hook * fix(http_parsing_utils.py): orjson can throw errors on some emoji's in text, default to json.loads * test_get_error_information * fix code quality * rename proxy track cost callback test * _should_store_errors_in_spend_logs * feature flag error logs * Revert "_should_store_errors_in_spend_logs" This reverts commit `7f345df477`. * Revert "feature flag error logs" This reverts commit `0e90c022bb`. * test_spend_logs_payload * fix OTEL log_db_metrics * fix import json * fix ui linting error * test_async_post_call_failure_hook * test_chat_completion_bad_model_with_spend_logs --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2025-02-28 20:10:09 -08:00

1 2 3 4 5 ...

528 commits