Commit graph

225 commits

Author SHA1 Message Date
Ishaan Jaff
d5150e000d test openai common utils 2025-03-18 18:08:24 -07:00
Ishaan Jaff
55ea2370ba Union[TranscriptionResponse, Coroutine[Any, Any, TranscriptionResponse]]: 2025-03-18 14:23:14 -07:00
Ishaan Jaff
6987a73e36 initialize_azure_sdk_client 2025-03-18 12:27:17 -07:00
Krrish Dholakia
00b338cb9c test: refactor testing to handle routing correctly 2025-03-18 12:24:12 -07:00
Ishaan Jaff
d42f84dd4e
Merge pull request #9328 from BerriAI/litellm_control_internal_user_settings_ui
[UI] - Allow controlling default internal user settings on ui
2025-03-17 22:07:47 -07:00
Krish Dholakia
cd5024f3b1
Merge pull request #9333 from BerriAI/litellm_dev_03_17_2025_p2
fix(ollama/completions/transformation.py): pass prompt, untemplated o…
2025-03-17 21:48:30 -07:00
Ishaan Jaff
5400615ce8
Merge pull request #9329 from BerriAI/litellm_fix_reset_budget_job
[Bug fix] Reset Budget Job
2025-03-17 21:46:08 -07:00
Krish Dholakia
27768a4866
Merge pull request #9330 from BerriAI/litellm_dev_03_17_2025_p1
Litellm dev 03 17 2025 p1
2025-03-17 19:57:25 -07:00
Ishaan Jaff
fbace8d041 unit test reset budget job 2025-03-17 19:54:51 -07:00
Krrish Dholakia
22faf7d232 fix(ollama/completions/transformation.py): pass prompt, untemplated on /completions request
Fixes https://github.com/BerriAI/litellm/issues/6900
2025-03-17 18:35:44 -07:00
Ishaan Jaff
b4568762d2 update_internal_user_settings 2025-03-17 18:26:42 -07:00
Krrish Dholakia
8e27b2026a fix(http_handler.py): support reading ssl security level from env var
Allows user to specify lower security settings
2025-03-17 15:48:31 -07:00
Krrish Dholakia
078e2d341b feat(cost_calculator.py): support reading litellm response cost header in client sdk
allows consistent cost tracking when sdk is calling proxy
2025-03-17 15:12:01 -07:00
Krrish Dholakia
db92956ae3 fix(redis_cache.py): add 5s default timeout 2025-03-17 14:27:36 -07:00
Krrish Dholakia
c0a76427d2 fix(streaming_handler.py): pass complete streaming response on completion 2025-03-16 20:22:12 -07:00
Krrish Dholakia
612d5a284d refactor(litellm_logging.py): delegate returning a complete response to the streaming_handler
Removes incorrect logic for calculating complete streaming response from litellm logging
2025-03-15 09:55:33 -07:00
Krrish Dholakia
cc82d42d25 test(test_streaming_handler.py): add unit test to ensure model response stream with usage is always used 2025-03-15 09:35:48 -07:00
Krrish Dholakia
82252ecac1 test(test_streaming_handler.py): assert chunk is non-empty when usage block given 2025-03-15 09:26:01 -07:00
Krish Dholakia
d4caaae1be
Merge pull request #9274 from BerriAI/litellm_contributor_rebase_branch
All checks were successful
Read Version from pyproject.toml / read-version (push) Successful in 43s
Helm unit test / unit-test (push) Successful in 50s
Litellm contributor rebase branch
2025-03-14 21:57:49 -07:00
Krrish Dholakia
8a6e4715aa feat(converse_transformation.py): fix type for bedrock cache usage block 2025-03-13 19:33:22 -07:00
Krrish Dholakia
96bba9354e test(tests/litellm): add unit test for transform usage function 2025-03-13 15:56:55 -07:00
Krish Dholakia
2c011d9a93
Merge pull request #9123 from omrishiv/8911-fix-model-encoding
Fixes bedrock modelId encoding for Inference Profiles
2025-03-13 10:42:32 -07:00
sven
2d4e58cf23 add test code 2025-03-13 14:00:12 +09:00
Ishaan Jaff
d623183eee test_ensure_initialize_azure_sdk_client_always_used 2025-03-12 18:51:19 -07:00
Krish Dholakia
2d957a0ed9
Merge branch 'main' into litellm_dev_03_10_2025_p3 2025-03-12 14:56:01 -07:00
Ishaan Jaff
2460f3cbab test_validate_environment 2025-03-12 12:57:40 -07:00
Ishaan Jaff
f88380cfdf TestResponseAPILoggingUtils 2025-03-12 12:45:03 -07:00
Ishaan Jaff
f6f5420f0a TestResponsesAPIRequestUtils 2025-03-12 12:42:20 -07:00
Krrish Dholakia
23bf7b5700 fix(azure/completions): migrate completions endpoint to support base azure llm class
enables consistent auth logic across all azure calls
2025-03-11 18:47:30 -07:00
Krrish Dholakia
42af49cd87 fix: fix merge conflicts 2025-03-11 18:41:41 -07:00
Krrish Dholakia
687b2e6300 test: fix test 2025-03-11 18:13:27 -07:00
Krrish Dholakia
cbc2e84044 refactor(azure.py): refactor to have client init work across all endpoints 2025-03-11 17:27:24 -07:00
Krrish Dholakia
d99d60a182 refactor(batches/main.py): working refactored azure client init on batches 2025-03-11 14:36:38 -07:00
Krrish Dholakia
af71e14d79 refactor(azure/audio_transcriptions.py): support client init with common logic 2025-03-11 14:24:12 -07:00
Krrish Dholakia
152bc67d22 refactor(azure.py): working azure client init on audio speech endpoint 2025-03-11 14:19:45 -07:00
Krrish Dholakia
f7d9cce536 refactor(azure.py): refactor acompletion to use base azure sdk client 2025-03-11 13:59:13 -07:00
Krrish Dholakia
b58edb7fa1 test(test_azure_common_utils.py): add unit testing for common azure client params function 2025-03-11 12:24:08 -07:00
omrishiv
e2adbae9f8 Merge branch 'main' into 8911-fix-model-encoding 2025-03-11 08:28:33 -07:00
omrishiv
d25693b96c update test
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
2025-03-10 20:34:52 -07:00
omrishiv
338722beba add test
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
2025-03-10 20:28:55 -07:00
Utkash Dubey
074c8efdc1 cleanup 2025-03-10 20:13:03 -07:00
Utkash Dubey
14736283bc test original file instead of the backup 2025-03-10 20:09:05 -07:00
Utkash Dubey
7dbc434318 remove incorrect moderations mode 2025-03-10 19:33:40 -07:00
Utkash Dubey
a9d50a615b move test file to correct location 2025-03-10 19:21:51 -07:00
vivek-athina
cd4a53d6f2
Merge pull request #4 from BerriAI/main
Update main
2025-03-10 11:13:21 +05:30
Teja Vishwanadha
8c049dfffc
support bytes.IO for audio transcription (#9071) 2025-03-08 08:47:15 -08:00
Ishaan Jaff
e2d612efd9
Bug fix - String data: stripped from entire content in streamed Gemini responses (#9070)
* _strip_sse_data_from_chunk

* use _strip_sse_data_from_chunk

* use _strip_sse_data_from_chunk

* use _strip_sse_data_from_chunk

* _strip_sse_data_from_chunk

* test_strip_sse_data_from_chunk

* _strip_sse_data_from_chunk

* testing

* _strip_sse_data_from_chunk
2025-03-07 21:06:39 -08:00
Ishaan Jaff
b02af305de
[Feat] - Display thinking tokens on OpenWebUI (Bedrock, Anthropic, Deepseek) (#9029)
All checks were successful
Read Version from pyproject.toml / read-version (push) Successful in 14s
* if merge_reasoning_content_in_choices

* _optional_combine_thinking_block_in_choices

* stash changes

* working merge_reasoning_content_in_choices with bedrock

* fix litellm_params accessor

* fix streaming handler

* merge_reasoning_content_in_choices

* _optional_combine_thinking_block_in_choices

* test_bedrock_stream_thinking_content_openwebui

* merge_reasoning_content_in_choices

* fix for _optional_combine_thinking_block_in_choices

* linting error fix
2025-03-06 18:32:58 -08:00
Krish Dholakia
f6535ae6ad
Support format param for specifying image type (#9019)
* fix(transformation.py): support a 'format' parameter for image's

allow user to specify mime type

* fix: pass mimetype via 'format' param

* feat(gemini/chat/transformation.py): support 'format' param for gemini

* fix(factory.py): support 'format' param on sync bedrock converse calls

* feat(bedrock/converse_transformation.py): support 'format' param for bedrock async calls

* refactor(factory.py): move to supporting 'format' param in base helper

ensures consistency in param support

* feat(gpt_transformation.py): filter out 'format' param

don't send invalid param to openai

* fix(gpt_transformation.py): fix translation

* fix: fix translation error
2025-03-05 19:52:53 -08:00
Ishaan Jaff
8d6815ce98 Revert "(UI) - Security Improvement, move to JWT Auth for Admin UI Sessions (#8995)"
This reverts commit 01a44a4e47.
2025-03-05 08:49:20 -08:00