Ishaan Jaff
59040167ac
fix ProxyErrorTypes
2025-03-25 14:40:11 -07:00
Ishaan Jaff
4c87084ff7
UserAPIKeyAuthExceptionHandler
2025-03-25 14:07:14 -07:00
Ishaan Jaff
0af9a5e8d0
add gemini/gemini-2.0-flash-lite
2025-03-25 07:51:42 -07:00
Ishaan Jaff
62bb7d6605
add vertex gemini-2.0-flash-lite
2025-03-25 07:48:33 -07:00
Krish Dholakia
92883560f0
fix vertex ai multimodal embedding translation ( #9471 )
...
Read Version from pyproject.toml / read-version (push) Successful in 20s
Helm unit test / unit-test (push) Successful in 24s
* remove data:image/jpeg;base64, prefix from base64 image input
vertex_ai's multimodal embeddings endpoint expects a raw base64 string without `data:image/jpeg;base64,` prefix.
* Add Vertex Multimodal Embedding Test
* fix(test_vertex.py): add e2e tests on multimodal embeddings
* test: unit testing
* test: remove sklearn dep
* test: update test with fixed route
* test: fix test
---------
Co-authored-by: Jonarod <jonrodd@gmail.com>
Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com>
2025-03-24 23:23:28 -07:00
Krrish Dholakia
75994d0bf0
test: improve flaky test
2025-03-24 23:15:04 -07:00
Krish Dholakia
a619580bf8
Add vertexai topLogprobs support ( #9518 )
...
* Added support for top_logprobs in vertex gemini models
* Testing for top_logprobs feature in vertexai
* Update litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py
Co-authored-by: Tom Matthews <tomukmatthews@gmail.com>
* refactor(tests/): refactor testing to be in correct repo
---------
Co-authored-by: Aditya Thaker <adityathaker28@gmail.com>
Co-authored-by: Tom Matthews <tomukmatthews@gmail.com>
2025-03-24 22:42:38 -07:00
Ishaan Jaff
12639b7ccf
fix sagemaker streaming error
2025-03-24 21:29:29 -07:00
Krish Dholakia
bd309a28c5
Merge pull request #9512 from BerriAI/litellm_dev_03_24_2025_p3
...
fix(invoke_handler.py): remove hard coded chunk on streaming usage
2025-03-24 21:21:36 -07:00
Ishaan Jaff
863fe3a4d2
fix import mcp router
2025-03-24 21:08:24 -07:00
Ishaan Jaff
fb3a67ba8d
fix code quality
2025-03-24 21:06:17 -07:00
Ishaan Jaff
dcc2edbd4d
fix mcp import
2025-03-24 21:02:54 -07:00
Ishaan Jaff
08a4ba1b7e
Merge branch 'main' into litellm_exp_mcp_server
2025-03-24 19:03:56 -07:00
Ishaan Jaff
002837f962
fix linting errors
2025-03-24 18:48:43 -07:00
Ishaan Jaff
d17ab7da2a
Merge pull request #9508 from BerriAI/litellm_fix_gcs_pub_sub
...
[Fix] Use StandardLoggingPayload for GCS Pub Sub Logging Integration
2025-03-24 18:22:43 -07:00
Krrish Dholakia
5089dbfcfb
fix(invoke_handler.py): remove hard code
2025-03-24 17:58:26 -07:00
Ishaan Jaff
fca5926600
default to use SLP for GCS PubSub
2025-03-24 15:21:59 -07:00
Krrish Dholakia
6a0cf3db50
fix(litellm_logging.py): always log the api base
...
Fixes issue where api base missing from spend logs due to refactor
2025-03-24 13:45:39 -07:00
Ishaan Jaff
8b8fca1e20
fix mcp type imports
2025-03-24 11:01:29 -07:00
Ishaan Jaff
5017332690
fix pydantic import error
2025-03-24 07:11:48 -07:00
Tyler Hutcherson
7864cd1f76
update redisvl dependency
2025-03-24 08:42:11 -04:00
Krish Dholakia
122ee634f4
Merge pull request #9473 from BerriAI/litellm_dev_03_22_2025_p2
...
Read Version from pyproject.toml / read-version (push) Successful in 18s
Helm unit test / unit-test (push) Successful in 20s
Litellm dev 03 22 2025 p2
2025-03-22 21:57:15 -07:00
Krrish Dholakia
9fea7cec00
build: add new vertex text embedding model
2025-03-22 21:50:21 -07:00
Ishaan Jaff
f21a0c2da7
Merge branch 'main' into litellm_exp_mcp_server
2025-03-22 18:51:25 -07:00
Ishaan Jaff
d3cad97ebb
add gpt-4o-2024-08-06 pricing for web tools
2025-03-22 18:48:33 -07:00
Ishaan Jaff
edfcff47b9
fix bug when unable to look up model info
2025-03-22 18:24:47 -07:00
Ishaan Jaff
cf03b2a797
test_langfuse_logging_completion
2025-03-22 18:09:04 -07:00
Ishaan Jaff
1b9f1837cd
_get_file_search_tool_call
2025-03-22 18:04:33 -07:00
Ishaan Jaff
ded612563d
_get_file_search_tool_call
2025-03-22 18:02:22 -07:00
Krrish Dholakia
790e063044
fix: fix linting error
2025-03-22 17:56:39 -07:00
Ishaan Jaff
4d4e0d8a6d
FileSearchTool
2025-03-22 17:56:14 -07:00
Ishaan Jaff
8766b43a4b
fix StandardBuiltInToolsParams
2025-03-22 17:53:06 -07:00
Ishaan Jaff
0f36b8afc2
chat_completion_response_includes_annotations
2025-03-22 17:36:22 -07:00
Ishaan Jaff
7f807cd6ba
fix code quality check
2025-03-22 17:31:47 -07:00
Ishaan Jaff
f74640cb99
test_openai_responses_api_web_search_cost_tracking
2025-03-22 17:24:29 -07:00
Ishaan Jaff
81033dc8c9
test_openai_responses_api_web_search_cost_tracking
2025-03-22 17:21:41 -07:00
Ishaan Jaff
904795443d
get_cost_for_built_in_tools
2025-03-22 17:06:34 -07:00
Ishaan Jaff
bdefc9fe06
add web search to gpt-4o-2024-08-06
2025-03-22 17:02:28 -07:00
Ishaan Jaff
b9ee316e23
fix model cost map
2025-03-22 16:59:04 -07:00
Ishaan Jaff
b0e61152da
fix model cost map
2025-03-22 16:58:48 -07:00
Ishaan Jaff
475dfaa156
fixes for web search cost tracking
2025-03-22 16:56:32 -07:00
Ishaan Jaff
bfe3132bb6
StandardBuiltInToolCostTracking
2025-03-22 16:06:47 -07:00
Ishaan Jaff
10da287225
initialize_standard_built_in_tools_params
2025-03-22 16:03:53 -07:00
Ishaan Jaff
ded9a2e481
add cost tracking for StandardBuiltInToolCostTracking
2025-03-22 16:01:12 -07:00
Krrish Dholakia
59e14fc45c
fix(router.py): fix get_model_list to return all wildcard models
...
enables viewing all wildcard models on `/model/info`
2025-03-22 15:39:23 -07:00
Ishaan Jaff
1910ed6027
WebSearchOptions
2025-03-22 15:39:04 -07:00
Ishaan Jaff
e19b82f202
add WebSearchOptions as supported chat completion param
2025-03-22 15:37:34 -07:00
Krrish Dholakia
f859c6ac0f
fix(llm_passthrough_endpoints.py): fix raising helpful debug error message
2025-03-22 15:06:16 -07:00
Ishaan Jaff
cf22d31b2b
search_context_cost_per_query
2025-03-22 14:52:58 -07:00
Ishaan Jaff
3a454d00df
Merge branch 'main' into litellm_web_search_2
2025-03-22 14:35:32 -07:00