Commit graph

13468 commits

Author SHA1 Message Date
Ishaan Jaff
fafbaf6302 Merge branch 'main' into litellm_stability_fixes 2025-03-26 07:16:31 -07:00
Ishaan Jaff
fb31006cd8 select_model_for_request_transformation 2025-03-26 07:12:25 -07:00
Ishaan Jaff
27c085cc56 fix util vertex 2025-03-26 00:08:16 -07:00
Ishaan Jaff
2bef0481af _transform_request_body 2025-03-26 00:05:45 -07:00
Ishaan Jaff
830ecbdb8c fix get_optional_params 2025-03-25 23:54:57 -07:00
Ishaan Jaff
aa8ba9b8f2 fix base_model in param mapping 2025-03-25 23:54:29 -07:00
Krish Dholakia
6fd18651d1
Support litellm.api_base for vertex_ai + gemini/ across completion, embedding, image_generation (#9516)
All checks were successful
Read Version from pyproject.toml / read-version (push) Successful in 19s
Helm unit test / unit-test (push) Successful in 20s
* test(tests): add unit testing for litellm_proxy integration

* fix(cost_calculator.py): fix tracking cost in sdk when calling proxy

* fix(main.py): respect litellm.api_base on `vertex_ai/` and `gemini/` routes

* fix(main.py): consistently support custom api base across gemini + vertexai on embedding + completion

* feat(vertex_ai/): test

* fix: fix linting error

* test: set api base as None before starting loadtest
2025-03-25 23:46:20 -07:00
Ishaan Jaff
8657816477 fix gemini/gemini-2.0-flash-lite on model cost map 2025-03-25 23:20:43 -07:00
Ishaan Jaff
c61214dcf1
Merge pull request #9523 from BerriAI/litellm_add_gemini_flash_lite
[Feat - New Model] Add VertexAI `gemini-2.0-flash-lite` and Google AI Studio `gemini-2.0-flash-lite`
2025-03-25 23:11:44 -07:00
Nicholas Grabar
f68cc26f15 8864 Add support for anyOf union type while handling null fields 2025-03-25 22:37:28 -07:00
Ishaan Jaff
79ef184345 run ci/cd again 2025-03-25 21:57:45 -07:00
Ishaan Jaff
b4e745323a add test config 2025-03-25 19:21:51 -07:00
Ishaan Jaff
ce49e27217 fixes for auth checks 2025-03-25 15:44:13 -07:00
Ishaan Jaff
59040167ac fix ProxyErrorTypes 2025-03-25 14:40:11 -07:00
Ishaan Jaff
4c87084ff7 UserAPIKeyAuthExceptionHandler 2025-03-25 14:07:14 -07:00
Ishaan Jaff
0af9a5e8d0 add gemini/gemini-2.0-flash-lite 2025-03-25 07:51:42 -07:00
Ishaan Jaff
62bb7d6605 add vertex gemini-2.0-flash-lite 2025-03-25 07:48:33 -07:00
Krish Dholakia
92883560f0
fix vertex ai multimodal embedding translation (#9471)
All checks were successful
Read Version from pyproject.toml / read-version (push) Successful in 20s
Helm unit test / unit-test (push) Successful in 24s
* remove data:image/jpeg;base64, prefix from base64 image input

vertex_ai's multimodal embeddings endpoint expects a raw base64 string without `data:image/jpeg;base64,` prefix.

* Add Vertex Multimodal Embedding Test

* fix(test_vertex.py): add e2e tests on multimodal embeddings

* test: unit testing

* test: remove sklearn dep

* test: update test with fixed route

* test: fix test

---------

Co-authored-by: Jonarod <jonrodd@gmail.com>
Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com>
2025-03-24 23:23:28 -07:00
Krrish Dholakia
75994d0bf0 test: improve flaky test 2025-03-24 23:15:04 -07:00
Krish Dholakia
a619580bf8
Add vertexai topLogprobs support (#9518)
* Added support for top_logprobs in vertex gemini models

* Testing for top_logprobs feature in vertexai

* Update litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py

Co-authored-by: Tom Matthews <tomukmatthews@gmail.com>

* refactor(tests/): refactor testing to be in correct repo

---------

Co-authored-by: Aditya Thaker <adityathaker28@gmail.com>
Co-authored-by: Tom Matthews <tomukmatthews@gmail.com>
2025-03-24 22:42:38 -07:00
Ishaan Jaff
12639b7ccf fix sagemaker streaming error 2025-03-24 21:29:29 -07:00
Krish Dholakia
bd309a28c5
Merge pull request #9512 from BerriAI/litellm_dev_03_24_2025_p3
fix(invoke_handler.py): remove hard coded chunk on streaming usage
2025-03-24 21:21:36 -07:00
Ishaan Jaff
863fe3a4d2 fix import mcp router 2025-03-24 21:08:24 -07:00
Ishaan Jaff
fb3a67ba8d fix code quality 2025-03-24 21:06:17 -07:00
Ishaan Jaff
dcc2edbd4d fix mcp import 2025-03-24 21:02:54 -07:00
Ishaan Jaff
08a4ba1b7e Merge branch 'main' into litellm_exp_mcp_server 2025-03-24 19:03:56 -07:00
Ishaan Jaff
002837f962 fix linting errors 2025-03-24 18:48:43 -07:00
Ishaan Jaff
d17ab7da2a
Merge pull request #9508 from BerriAI/litellm_fix_gcs_pub_sub
[Fix] Use StandardLoggingPayload for GCS Pub Sub Logging Integration
2025-03-24 18:22:43 -07:00
Krrish Dholakia
5089dbfcfb fix(invoke_handler.py): remove hard code 2025-03-24 17:58:26 -07:00
Ishaan Jaff
fca5926600 default to use SLP for GCS PubSub 2025-03-24 15:21:59 -07:00
Krrish Dholakia
6a0cf3db50 fix(litellm_logging.py): always log the api base
Fixes issue where api base missing from spend logs due to refactor
2025-03-24 13:45:39 -07:00
Ishaan Jaff
8b8fca1e20 fix mcp type imports 2025-03-24 11:01:29 -07:00
Ishaan Jaff
5017332690 fix pydantic import error 2025-03-24 07:11:48 -07:00
Tyler Hutcherson
7864cd1f76 update redisvl dependency 2025-03-24 08:42:11 -04:00
Krish Dholakia
122ee634f4
Merge pull request #9473 from BerriAI/litellm_dev_03_22_2025_p2
All checks were successful
Read Version from pyproject.toml / read-version (push) Successful in 18s
Helm unit test / unit-test (push) Successful in 20s
Litellm dev 03 22 2025 p2
2025-03-22 21:57:15 -07:00
Krrish Dholakia
9fea7cec00 build: add new vertex text embedding model 2025-03-22 21:50:21 -07:00
Ishaan Jaff
f21a0c2da7 Merge branch 'main' into litellm_exp_mcp_server 2025-03-22 18:51:25 -07:00
Ishaan Jaff
d3cad97ebb add gpt-4o-2024-08-06 pricing for web tools 2025-03-22 18:48:33 -07:00
Ishaan Jaff
edfcff47b9 fix bug when unable to look up model info 2025-03-22 18:24:47 -07:00
Ishaan Jaff
cf03b2a797 test_langfuse_logging_completion 2025-03-22 18:09:04 -07:00
Ishaan Jaff
1b9f1837cd _get_file_search_tool_call 2025-03-22 18:04:33 -07:00
Ishaan Jaff
ded612563d _get_file_search_tool_call 2025-03-22 18:02:22 -07:00
Krrish Dholakia
790e063044 fix: fix linting error 2025-03-22 17:56:39 -07:00
Ishaan Jaff
4d4e0d8a6d FileSearchTool 2025-03-22 17:56:14 -07:00
Ishaan Jaff
8766b43a4b fix StandardBuiltInToolsParams 2025-03-22 17:53:06 -07:00
Ishaan Jaff
0f36b8afc2 chat_completion_response_includes_annotations 2025-03-22 17:36:22 -07:00
Ishaan Jaff
7f807cd6ba fix code quality check 2025-03-22 17:31:47 -07:00
Ishaan Jaff
f74640cb99 test_openai_responses_api_web_search_cost_tracking 2025-03-22 17:24:29 -07:00
Ishaan Jaff
81033dc8c9 test_openai_responses_api_web_search_cost_tracking 2025-03-22 17:21:41 -07:00
Ishaan Jaff
904795443d get_cost_for_built_in_tools 2025-03-22 17:06:34 -07:00