Commit graph

2246 commits

Author SHA1 Message Date
Ishaan Jaff
34142a1b62 _init_azure_client_for_cloudflare_ai_gateway 2025-03-18 10:11:54 -07:00
Ishaan Jaff
edfbf21c39 fix re-using azure openai client 2025-03-18 10:06:56 -07:00
Ishaan Jaff
f2026ef907 fix - correctly re-use azure openai client 2025-03-18 09:51:28 -07:00
Ishaan Jaff
b74f3cb76c _get_azure_openai_client 2025-03-18 09:38:27 -07:00
Ishaan Jaff
26be805ad3 rename to _get_azure_openai_client 2025-03-18 09:25:26 -07:00
Ishaan Jaff
6e351136d7 handle _get_async_http_client for OpenAI 2025-03-18 08:56:08 -07:00
Krish Dholakia
bcbb88d802
Merge pull request #9326 from andjsmi/main
Modify completion handler for SageMaker to use payload from `prepared_request`
2025-03-17 22:16:43 -07:00
Krish Dholakia
cd5024f3b1
Merge pull request #9333 from BerriAI/litellm_dev_03_17_2025_p2
fix(ollama/completions/transformation.py): pass prompt, untemplated o…
2025-03-17 21:48:30 -07:00
Krrish Dholakia
22faf7d232 fix(ollama/completions/transformation.py): pass prompt, untemplated on /completions request
Fixes https://github.com/BerriAI/litellm/issues/6900
2025-03-17 18:35:44 -07:00
Andrew Smith
81a7cf0f44
Update handler.py to use prepared_request.body for input 2025-03-18 11:07:38 +11:00
Krrish Dholakia
057c774c14 fix(http_handler.py): fix typing error 2025-03-17 16:42:32 -07:00
Andrew Smith
a92e99e946
Update handler.py to use prepared_request.body 2025-03-18 10:23:32 +11:00
Krrish Dholakia
8e27b2026a fix(http_handler.py): support reading ssl security level from env var
Allows user to specify lower security settings
2025-03-17 15:48:31 -07:00
Krish Dholakia
d0d8ec2c40
Merge branch 'main' into litellm_dev_03_16_2025_p1 2025-03-17 10:02:53 -07:00
Krrish Dholakia
b093157369 fix(converse_transformation.py): fix linting error 2025-03-15 19:33:17 -07:00
Krrish Dholakia
5dc46f0cf7 fix(converse_transformation.py): fix encoding model 2025-03-15 14:03:37 -07:00
Krrish Dholakia
dd2c980d5b fix(utils.py): Prevents final chunk w/ usage from being ignored
Fixes https://github.com/BerriAI/litellm/issues/7112
2025-03-15 09:12:14 -07:00
Krish Dholakia
d4caaae1be
Merge pull request #9274 from BerriAI/litellm_contributor_rebase_branch
All checks were successful
Read Version from pyproject.toml / read-version (push) Successful in 43s
Helm unit test / unit-test (push) Successful in 50s
Litellm contributor rebase branch
2025-03-14 21:57:49 -07:00
Lucas Raschek
56d3e75b33 Map total tokens to prompt_tokens too 2025-03-14 18:04:43 +01:00
Ishaan Jaff
241a36a74f
Merge pull request #9222 from BerriAI/litellm_snowflake_pr_mar_13
[Feat] Add Snowflake Cortex to LiteLLM
2025-03-13 21:35:39 -07:00
Ishaan Jaff
69b47cf738 fix code quality check 2025-03-13 20:10:41 -07:00
Krish Dholakia
e8c67f25e3
Merge pull request #9221 from BerriAI/litellm_dev_03_13_2025_p2
Support bedrock converse cache token tracking
2025-03-13 20:08:33 -07:00
Krish Dholakia
fd8a5960ec
Merge pull request #9216 from BerriAI/litellm_dev_03_12_2025_contributor_prs_p2
Litellm dev 03 12 2025 contributor prs p2
2025-03-13 20:03:57 -07:00
Krrish Dholakia
8a6e4715aa feat(converse_transformation.py): fix type for bedrock cache usage block 2025-03-13 19:33:22 -07:00
Ishaan Jaff
d3781dfe36 fix linting errors 2025-03-13 16:58:34 -07:00
Sunny Wan
c942f4cd86 Merge branch 'main' of https://github.com/SunnyWan59/litellm 2025-03-13 19:42:25 -04:00
Sunny Wan
70770b6aa4 Removed unnecessary code and refactored 2025-03-13 19:42:10 -04:00
Sunny Wan
f9a5109203
Merge branch 'BerriAI:main' into main 2025-03-13 19:37:22 -04:00
Krrish Dholakia
0af6cde994 fix(invoke_handler.py): support cache token tracking on converse streaming 2025-03-13 16:10:13 -07:00
Krrish Dholakia
f99b1937db feat(converse_transformation.py): translate converse usage block with cache creation values to openai format 2025-03-13 15:49:25 -07:00
Krrish Dholakia
5ffd3f56f8 fix(azure.py): track azure llm api latency metric 2025-03-13 14:47:35 -07:00
Krish Dholakia
2c011d9a93
Merge pull request #9123 from omrishiv/8911-fix-model-encoding
Fixes bedrock modelId encoding for Inference Profiles
2025-03-13 10:42:32 -07:00
Krish Dholakia
58e5f3e0c9
Merge pull request #9193 from youngchannelforyou/feat/gemini_response_status_code
(gemini)Handle HTTP 201 status code in Vertex AI response
2025-03-12 22:24:01 -07:00
Krish Dholakia
cff1c1f7d8
Merge branch 'main' into litellm_dev_03_12_2025_p1 2025-03-12 22:14:02 -07:00
sven
8d053930e9 (gemini)Handle HTTP 201 status code in Vertex AI response 2025-03-13 13:44:38 +09:00
Krrish Dholakia
88e9edf7db refactor: update method signature 2025-03-12 15:23:38 -07:00
Krrish Dholakia
738c0b873d fix(azure_ai/transformation.py): support passing api version to azure ai services endpoint
Fixes https://github.com/BerriAI/litellm/issues/7275
2025-03-12 15:16:42 -07:00
Krish Dholakia
2d957a0ed9
Merge branch 'main' into litellm_dev_03_10_2025_p3 2025-03-12 14:56:01 -07:00
Ishaan Jaff
2460f3cbab test_validate_environment 2025-03-12 12:57:40 -07:00
Ishaan Jaff
39d391d8e7 Optional[Dict] 2025-03-12 12:29:13 -07:00
Ishaan Jaff
342741ede1 Merge branch 'main' into litellm_responses_api_support 2025-03-12 12:04:12 -07:00
Ishaan Jaff
181072e15d fix code quality checks 2025-03-12 11:47:10 -07:00
Ishaan Jaff
047879c004 add aresponses 2025-03-12 09:22:44 -07:00
Ishaan Jaff
4ff6e41c15 ResponsesAPIStreamEvents 2025-03-11 23:42:35 -07:00
Ishaan Jaff
278b6fb5f6 add debug logging 2025-03-11 23:13:10 -07:00
Krish Dholakia
a7e0e7283e
Merge pull request #9166 from BerriAI/litellm_dev_03_11_2025_p2
Litellm dev 03 11 2025 p2
2025-03-11 22:51:20 -07:00
Krish Dholakia
8c0bf06c87
Merge branch 'main' into litellm_dev_contributor_prs_03_10_2025_p1 2025-03-11 22:50:02 -07:00
Krrish Dholakia
b8d590da0c fix(azure/audio_transcriptions.py): support azure cost tracking
extract content time and log correctly as duration
2025-03-11 22:25:13 -07:00
Ishaan Jaff
ddb819da45 fix order of imports 2025-03-11 22:04:24 -07:00
Krrish Dholakia
92d85555fe fix(invoke_handler.py): fix converse chunk parsing to only return empty dict on tool use
Fixes https://github.com/BerriAI/litellm/issues/9127
2025-03-11 22:04:17 -07:00