Ishaan Jaff
a75ff7cc54
fix code quality check
2025-04-21 19:42:57 -07:00
Ishaan Jaff
c17e5351be
test response id
2025-04-21 19:39:34 -07:00
Ishaan Jaff
a582a067f4
simplify ResponsesApiDeploymentCheck
2025-04-21 19:38:55 -07:00
Ishaan Jaff
39610d4888
docs routing affinity
2025-04-21 17:17:33 -07:00
Ishaan Jaff
95337903dd
test_responses_api_deployment_check.py
2025-04-21 16:59:14 -07:00
Ishaan Jaff
a4f0b32bab
ResponsesApiDeploymentCheck
2025-04-21 16:45:21 -07:00
Ishaan Jaff
6967d991f4
test_responses_api_routing_with_previous_response_id
2025-04-21 16:41:08 -07:00
Ishaan Jaff
a7b7cac7ff
fix ResponsesApiDeploymentCheck
2025-04-21 16:40:23 -07:00
Ishaan Jaff
a460304208
ResponsesApiDeploymentCheck
2025-04-21 16:39:42 -07:00
Ishaan Jaff
cddbac5b28
ResponsesApiDeploymentCheck
2025-04-21 16:21:01 -07:00
Ishaan Jaff
b8838004ca
add ResponsesApiDeploymentCheck
2025-04-21 14:50:39 -07:00
Ishaan Jaff
e82764158e
test_responses_api_routing_with_previous_response_id
2025-04-21 14:41:31 -07:00
Ishaan Jaff
6783c9e618
test for test_responses_api_routing_with_previous_response_id
2025-04-21 14:35:47 -07:00
Li Yang
10257426a2
fix(bedrock): wrong system prompt transformation ( #10120 )
...
Read Version from pyproject.toml / read-version (push) Successful in 16s
Helm unit test / unit-test (push) Successful in 25s
* fix(bedrock): wrong system transformation
* chore: add one more test case
---------
Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>
2025-04-21 08:48:14 -07:00
Marty Sullivan
0b63c7a2eb
Model pricing updates for Azure & VertexAI ( #10178 )
Read Version from pyproject.toml / read-version (push) Successful in 23s
Helm unit test / unit-test (push) Successful in 27s
2025-04-20 11:33:45 -07:00
Krrish Dholakia
1ff7625984
docs: cleanup
2025-04-20 09:26:05 -07:00
Krrish Dholakia
aa55103486
docs: cleanup doc
Read Version from pyproject.toml / read-version (push) Successful in 19s
Helm unit test / unit-test (push) Successful in 26s
2025-04-20 09:20:47 -07:00
Krrish Dholakia
1d9b58688b
docs(sidebars.js): place scim doc in correct place
2025-04-20 09:20:10 -07:00
Krish Dholakia
ce828408da
fix(proxy_server.py): pass llm router to get complete model list ( #10176 )
...
Read Version from pyproject.toml / read-version (push) Successful in 21s
Helm unit test / unit-test (push) Successful in 27s
allows model auth to work
2025-04-19 22:27:49 -07:00
Krish Dholakia
e0a613f88a
fix(common_daily_activity.py): support empty entity id field ( #10175 )
...
* fix(common_daily_activity.py): support empty entity id field
allows returning empty response when user is not admin and does not belong to any team
* test(test_common_daily_activity.py): add unit testing
2025-04-19 22:20:28 -07:00
Ishaan Jaff
72f6bd3972
fix azure foundry phi error
2025-04-19 22:10:18 -07:00
Ishaan Jaff
36bcb3de4e
fix models appearing under test key page
2025-04-19 21:37:08 -07:00
Krrish Dholakia
bb13ac45c8
docs(index.md): cleanup
2025-04-19 19:16:10 -07:00
Ishaan Jaff
1be36be72e
Litellm docs SCIM ( #10174 )
...
* docs scim
* docs SCIM stash
* docs litellm SCIM
* docs fix
* docs scim with LiteLLM
2025-04-19 18:29:09 -07:00
Krish Dholakia
55a17730fb
fix(transformation.py): pass back in gemini thinking content to api ( #10173 )
...
Ensures thinking content always returned
2025-04-19 18:03:05 -07:00
Krish Dholakia
bbfcb1ac7e
Litellm release notes 04 19 2025 ( #10169 )
...
* docs(index.md): initial draft release notes
* docs: note all pending docs
* build(model_prices_and_context_window.json): add o3, gpt-4.1, o4-mini pricing
* docs(vllm.md): update vllm doc to show file message type support
* docs(mistral.md): add mistral passthrough route doc
* docs(gemini.md): add gemini thinking to docs
* docs(vertex.md): add thinking/reasoning content for gemini models to docs
* docs(index.md): more links
* docs(index.md): add more links, images
* docs(index.md): cleanup highlights
2025-04-19 17:26:30 -07:00
Ishaan Jaff
daf024bad1
Supported Responses API Parameters
2025-04-19 17:14:53 -07:00
Ishaan Jaff
f39d917886
[Docs] Responses API ( #10172 )
...
* docs litellm responses api
* doc fix
* docs responses API
* add get_supported_openai_params for LiteLLMCompletionResponsesConfig
* add Supported Responses API Parameters
2025-04-19 17:10:45 -07:00
Ishaan Jaff
7c3df984da
can_user_call_model ( #10170 )
Read Version from pyproject.toml / read-version (push) Successful in 51s
Helm unit test / unit-test (push) Successful in 51s
2025-04-19 16:46:00 -07:00
Ishaan Jaff
431b230f07
[UI] Bug Fix, team model selector ( #10171 )
...
* fix tooltip
* bug fix fix team model selector
2025-04-19 16:31:38 -07:00
Ishaan Jaff
6206649219
bump: version 1.66.3 → 1.67.0
2025-04-19 14:41:16 -07:00
Ishaan Jaff
653570824a
Bug Fix - Responses API, Loosen restrictions on allowed environments for computer use tool ( #10168 )
...
* loosen allowed types on ComputerToolParam
* test_basic_computer_use_preview_tool_call
2025-04-19 14:40:32 -07:00
Ishaan Jaff
c80e984d7e
ui new build
2025-04-19 14:19:33 -07:00
Ishaan Jaff
b0024bb229
[Bug Fix] Spend Tracking Bug Fix, don't modify in memory default litellm params ( #10167 )
...
* _update_kwargs_with_default_litellm_params
* test_update_kwargs_does_not_mutate_defaults_and_merges_metadata
2025-04-19 14:13:59 -07:00
Ishaan Jaff
0717369ae6
[Feat] Expose Responses API on LiteLLM UI Test Key Page ( #10166 )
...
* add /responses API on UI
* add makeOpenAIResponsesRequest
* add makeOpenAIResponsesRequest
* fix add responses API on UI
* fix endpoint selector
* responses API render chunks on litellm chat ui
* fixes to streaming iterator
* fix render responses completed events
* fixes for MockResponsesAPIStreamingIterator
* transform_responses_api_request_to_chat_completion_request
* fix for responses API
* test_basic_openai_responses_api_streaming
* fix base responses api tests
2025-04-19 13:18:54 -07:00
Krish Dholakia
03b5399f86
test(utils.py): handle scenario where text tokens + reasoning tokens … ( #10165 )
...
* test(utils.py): handle scenario where text tokens + reasoning tokens set, but reasoning tokens not charged separately
Addresses https://github.com/BerriAI/litellm/pull/10141#discussion_r2051555332
* fix(vertex_and_google_ai_studio.py): only set content if non-empty str
2025-04-19 12:32:38 -07:00
Ong Khai Wei
99db1b7690
to get API key from environment viarble of WATSONX_APIKEY ( #10131 )
2025-04-19 11:25:14 -07:00
Classic298
173ef01ef6
Update model_prices_and_context_window_backup.json ( #10122 )
...
* Update model_prices_and_context_window_backup.json
* Update model_prices_and_context_window_backup.json
2025-04-19 11:24:52 -07:00
Krish Dholakia
5c929317cd
fix(triton/completion/transformation.py): remove bad_words / stop wor… ( #10163 )
...
* fix(triton/completion/transformation.py): remove bad_words / stop words from triton call
parameter 'bad_words' has invalid type. It should be either 'int', 'bool', or 'string'.
* fix(proxy_track_cost_callback.py): add debug logging for track cost callback error
2025-04-19 11:23:37 -07:00
Krish Dholakia
f08a4e3c06
Support 'file' message type for VLLM video url's + Anthropic redacted message thinking support ( #10129 )
...
* feat(hosted_vllm/chat/transformation.py): support calling vllm video url with openai 'file' message type
allows switching between gemini/vllm easily
* [WIP] redacted thinking tests (#9044 )
* WIP: redacted thinking tests
* test: add test for redacted thinking in assistant message
---------
Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>
* fix(anthropic/chat/transformation.py): support redacted thinking block on anthropic completion
Fixes https://github.com/BerriAI/litellm/issues/9058
* fix(anthropic/chat/handler.py): transform anthropic redacted messages on streaming
Fixes https://github.com/BerriAI/litellm/issues/9058
* fix(bedrock/): support redacted text on streaming + non-streaming
Fixes https://github.com/BerriAI/litellm/issues/9058
* feat(litellm_proxy/chat/transformation.py): support 'reasoning_effort' param for proxy
allows using reasoning effort with thinking models on proxy
* test: update tests
* fix(utils.py): fix linting error
* fix: fix linting errors
* fix: fix linting errors
* fix: fix linting error
* fix: fix linting errors
* fix(anthropic/chat/transformation.py): fix returning citations in chat completion
---------
Co-authored-by: Johann Miller <22018973+johannkm@users.noreply.github.com>
2025-04-19 11:16:37 -07:00
Ishaan Jaff
3c463f6715
test fix - output_cost_per_reasoning_token was added to model cost map
2025-04-19 10:02:25 -07:00
Krish Dholakia
2508ca71cb
Handle fireworks ai tool calling response ( #10130 )
...
* feat(fireworks_ai/chat): handle tool calling with fireworks ai correctly
Fixes https://github.com/BerriAI/litellm/issues/7209
* fix(utils.py): handle none type in message
* fix: fix model name in test
* fix(utils.py): fix validate check for openai messages
* fix: fix model returned
* fix(main.py): fix text completion routing
* test: update testing
* test: skip test - cohere having RBAC issues
2025-04-19 09:37:45 -07:00
Krrish Dholakia
b4f2b3dad1
test: update test to be more robust to usage updates
2025-04-19 09:26:26 -07:00
Ishaan Jaff
8ae2653280
fix calculated cache key for tests
2025-04-19 09:25:11 -07:00
Ishaan Jaff
97d7a5e78e
fix deployment name
2025-04-19 09:23:22 -07:00
Krish Dholakia
36308a31be
Gemini-2.5-flash - support reasoning cost calc + return reasoning content ( #10141 )
...
* build(model_prices_and_context_window.json): add vertex ai gemini-2.5-flash pricing
* build(model_prices_and_context_window.json): add gemini reasoning token pricing
* fix(vertex_and_google_ai_studio_gemini.py): support counting thinking tokens for gemini
allows accurate cost calc
* fix(utils.py): add reasoning token cost calc to generic cost calc
ensures gemini-2.5-flash cost calculation is accurate
* build(model_prices_and_context_window.json): mark gemini-2.5-flash as 'supports_reasoning'
* feat(gemini/): support 'thinking' + 'reasoning_effort' params + new unit tests
allow controlling thinking effort for gemini-2.5-flash models
* test: update unit testing
* feat(vertex_and_google_ai_studio_gemini.py): return reasoning content if given in gemini response
* test: update model name
* fix: fix ruff check
* test(test_spend_management_endpoints.py): update tests to be less sensitive to new keys / updates to usage object
* fix(vertex_and_google_ai_studio_gemini.py): fix translation
2025-04-19 09:20:52 -07:00
Ishaan Jaff
db4ebe10c8
bump litellm-proxy-extras
2025-04-19 09:14:33 -07:00
Krrish Dholakia
d726e0f34c
test: update testing imports
2025-04-19 09:13:16 -07:00
Krrish Dholakia
ba1b552e8b
fix(common_daily_activity.py): fix python 3_8 error
2025-04-19 08:39:19 -07:00
Ishaan Jaff
49759d5678
fix get_azure_client
2025-04-19 08:33:26 -07:00