litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-24 10:14:26 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	a75ff7cc54	fix code quality check	2025-04-21 19:42:57 -07:00
Ishaan Jaff	c17e5351be	test response id	2025-04-21 19:39:34 -07:00
Ishaan Jaff	a582a067f4	simplify ResponsesApiDeploymentCheck	2025-04-21 19:38:55 -07:00
Ishaan Jaff	39610d4888	docs routing affinity	2025-04-21 17:17:33 -07:00
Ishaan Jaff	95337903dd	test_responses_api_deployment_check.py	2025-04-21 16:59:14 -07:00
Ishaan Jaff	a4f0b32bab	ResponsesApiDeploymentCheck	2025-04-21 16:45:21 -07:00
Ishaan Jaff	6967d991f4	test_responses_api_routing_with_previous_response_id	2025-04-21 16:41:08 -07:00
Ishaan Jaff	a7b7cac7ff	fix ResponsesApiDeploymentCheck	2025-04-21 16:40:23 -07:00
Ishaan Jaff	a460304208	ResponsesApiDeploymentCheck	2025-04-21 16:39:42 -07:00
Ishaan Jaff	cddbac5b28	ResponsesApiDeploymentCheck	2025-04-21 16:21:01 -07:00
Ishaan Jaff	b8838004ca	add ResponsesApiDeploymentCheck	2025-04-21 14:50:39 -07:00
Ishaan Jaff	e82764158e	test_responses_api_routing_with_previous_response_id	2025-04-21 14:41:31 -07:00
Ishaan Jaff	6783c9e618	test for test_responses_api_routing_with_previous_response_id	2025-04-21 14:35:47 -07:00
Li Yang	10257426a2	fix(bedrock): wrong system prompt transformation (#10120 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details Helm unit test / unit-test (push) Successful in 25s Details * fix(bedrock): wrong system transformation * chore: add one more test case --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>	2025-04-21 08:48:14 -07:00
Marty Sullivan	0b63c7a2eb	Model pricing updates for Azure & VertexAI (#10178 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 23s Details Helm unit test / unit-test (push) Successful in 27s Details	2025-04-20 11:33:45 -07:00
Krrish Dholakia	1ff7625984	docs: cleanup	2025-04-20 09:26:05 -07:00
Krrish Dholakia	aa55103486	docs: cleanup doc All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 19s Details Helm unit test / unit-test (push) Successful in 26s Details	2025-04-20 09:20:47 -07:00
Krrish Dholakia	1d9b58688b	docs(sidebars.js): place scim doc in correct place	2025-04-20 09:20:10 -07:00
Krish Dholakia	ce828408da	fix(proxy_server.py): pass llm router to get complete model list (#10176 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 21s Details Helm unit test / unit-test (push) Successful in 27s Details allows model auth to work	2025-04-19 22:27:49 -07:00
Krish Dholakia	e0a613f88a	fix(common_daily_activity.py): support empty entity id field (#10175 ) * fix(common_daily_activity.py): support empty entity id field allows returning empty response when user is not admin and does not belong to any team * test(test_common_daily_activity.py): add unit testing	2025-04-19 22:20:28 -07:00
Ishaan Jaff	72f6bd3972	fix azure foundry phi error	2025-04-19 22:10:18 -07:00
Ishaan Jaff	36bcb3de4e	fix models appearing under test key page	2025-04-19 21:37:08 -07:00
Krrish Dholakia	bb13ac45c8	docs(index.md): cleanup	2025-04-19 19:16:10 -07:00
Ishaan Jaff	1be36be72e	Litellm docs SCIM (#10174 ) * docs scim * docs SCIM stash * docs litellm SCIM * docs fix * docs scim with LiteLLM	2025-04-19 18:29:09 -07:00
Krish Dholakia	55a17730fb	fix(transformation.py): pass back in gemini thinking content to api (#10173 ) Ensures thinking content always returned	2025-04-19 18:03:05 -07:00
Krish Dholakia	bbfcb1ac7e	Litellm release notes 04 19 2025 (#10169 ) * docs(index.md): initial draft release notes * docs: note all pending docs * build(model_prices_and_context_window.json): add o3, gpt-4.1, o4-mini pricing * docs(vllm.md): update vllm doc to show file message type support * docs(mistral.md): add mistral passthrough route doc * docs(gemini.md): add gemini thinking to docs * docs(vertex.md): add thinking/reasoning content for gemini models to docs * docs(index.md): more links * docs(index.md): add more links, images * docs(index.md): cleanup highlights	2025-04-19 17:26:30 -07:00
Ishaan Jaff	daf024bad1	Supported Responses API Parameters	2025-04-19 17:14:53 -07:00
Ishaan Jaff	f39d917886	[Docs] Responses API (#10172 ) * docs litellm responses api * doc fix * docs responses API * add get_supported_openai_params for LiteLLMCompletionResponsesConfig * add Supported Responses API Parameters	2025-04-19 17:10:45 -07:00
Ishaan Jaff	7c3df984da	can_user_call_model (#10170 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 51s Details Helm unit test / unit-test (push) Successful in 51s Details	2025-04-19 16:46:00 -07:00
Ishaan Jaff	431b230f07	[UI] Bug Fix, team model selector (#10171 ) * fix tooltip * bug fix fix team model selector	2025-04-19 16:31:38 -07:00
Ishaan Jaff	6206649219	bump: version 1.66.3 → 1.67.0	2025-04-19 14:41:16 -07:00
Ishaan Jaff	653570824a	Bug Fix - Responses API, Loosen restrictions on allowed environments for computer use tool (#10168 ) * loosen allowed types on ComputerToolParam * test_basic_computer_use_preview_tool_call	2025-04-19 14:40:32 -07:00
Ishaan Jaff	c80e984d7e	ui new build	2025-04-19 14:19:33 -07:00
Ishaan Jaff	b0024bb229	[Bug Fix] Spend Tracking Bug Fix, don't modify in memory default litellm params (#10167 ) * _update_kwargs_with_default_litellm_params * test_update_kwargs_does_not_mutate_defaults_and_merges_metadata	2025-04-19 14:13:59 -07:00
Ishaan Jaff	0717369ae6	[Feat] Expose Responses API on LiteLLM UI Test Key Page (#10166 ) * add /responses API on UI * add makeOpenAIResponsesRequest * add makeOpenAIResponsesRequest * fix add responses API on UI * fix endpoint selector * responses API render chunks on litellm chat ui * fixes to streaming iterator * fix render responses completed events * fixes for MockResponsesAPIStreamingIterator * transform_responses_api_request_to_chat_completion_request * fix for responses API * test_basic_openai_responses_api_streaming * fix base responses api tests	2025-04-19 13:18:54 -07:00
Krish Dholakia	03b5399f86	test(utils.py): handle scenario where text tokens + reasoning tokens … (#10165 ) * test(utils.py): handle scenario where text tokens + reasoning tokens set, but reasoning tokens not charged separately Addresses https://github.com/BerriAI/litellm/pull/10141#discussion_r2051555332 * fix(vertex_and_google_ai_studio.py): only set content if non-empty str	2025-04-19 12:32:38 -07:00
Ong Khai Wei	99db1b7690	to get API key from environment viarble of WATSONX_APIKEY (#10131 )	2025-04-19 11:25:14 -07:00
Classic298	173ef01ef6	Update model_prices_and_context_window_backup.json (#10122 ) * Update model_prices_and_context_window_backup.json * Update model_prices_and_context_window_backup.json	2025-04-19 11:24:52 -07:00
Krish Dholakia	5c929317cd	fix(triton/completion/transformation.py): remove bad_words / stop wor… (#10163 ) * fix(triton/completion/transformation.py): remove bad_words / stop words from triton call parameter 'bad_words' has invalid type. It should be either 'int', 'bool', or 'string'. * fix(proxy_track_cost_callback.py): add debug logging for track cost callback error	2025-04-19 11:23:37 -07:00
Krish Dholakia	f08a4e3c06	Support 'file' message type for VLLM video url's + Anthropic redacted message thinking support (#10129 ) * feat(hosted_vllm/chat/transformation.py): support calling vllm video url with openai 'file' message type allows switching between gemini/vllm easily * [WIP] redacted thinking tests (#9044) * WIP: redacted thinking tests * test: add test for redacted thinking in assistant message --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> * fix(anthropic/chat/transformation.py): support redacted thinking block on anthropic completion Fixes https://github.com/BerriAI/litellm/issues/9058 * fix(anthropic/chat/handler.py): transform anthropic redacted messages on streaming Fixes https://github.com/BerriAI/litellm/issues/9058 * fix(bedrock/): support redacted text on streaming + non-streaming Fixes https://github.com/BerriAI/litellm/issues/9058 * feat(litellm_proxy/chat/transformation.py): support 'reasoning_effort' param for proxy allows using reasoning effort with thinking models on proxy * test: update tests * fix(utils.py): fix linting error * fix: fix linting errors * fix: fix linting errors * fix: fix linting error * fix: fix linting errors * fix(anthropic/chat/transformation.py): fix returning citations in chat completion --------- Co-authored-by: Johann Miller <22018973+johannkm@users.noreply.github.com>	2025-04-19 11:16:37 -07:00
Ishaan Jaff	3c463f6715	test fix - output_cost_per_reasoning_token was added to model cost map	2025-04-19 10:02:25 -07:00
Krish Dholakia	2508ca71cb	Handle fireworks ai tool calling response (#10130 ) * feat(fireworks_ai/chat): handle tool calling with fireworks ai correctly Fixes https://github.com/BerriAI/litellm/issues/7209 * fix(utils.py): handle none type in message * fix: fix model name in test * fix(utils.py): fix validate check for openai messages * fix: fix model returned * fix(main.py): fix text completion routing * test: update testing * test: skip test - cohere having RBAC issues	2025-04-19 09:37:45 -07:00
Krrish Dholakia	b4f2b3dad1	test: update test to be more robust to usage updates	2025-04-19 09:26:26 -07:00
Ishaan Jaff	8ae2653280	fix calculated cache key for tests	2025-04-19 09:25:11 -07:00
Ishaan Jaff	97d7a5e78e	fix deployment name	2025-04-19 09:23:22 -07:00
Krish Dholakia	36308a31be	Gemini-2.5-flash - support reasoning cost calc + return reasoning content (#10141 ) * build(model_prices_and_context_window.json): add vertex ai gemini-2.5-flash pricing * build(model_prices_and_context_window.json): add gemini reasoning token pricing * fix(vertex_and_google_ai_studio_gemini.py): support counting thinking tokens for gemini allows accurate cost calc * fix(utils.py): add reasoning token cost calc to generic cost calc ensures gemini-2.5-flash cost calculation is accurate * build(model_prices_and_context_window.json): mark gemini-2.5-flash as 'supports_reasoning' * feat(gemini/): support 'thinking' + 'reasoning_effort' params + new unit tests allow controlling thinking effort for gemini-2.5-flash models * test: update unit testing * feat(vertex_and_google_ai_studio_gemini.py): return reasoning content if given in gemini response * test: update model name * fix: fix ruff check * test(test_spend_management_endpoints.py): update tests to be less sensitive to new keys / updates to usage object * fix(vertex_and_google_ai_studio_gemini.py): fix translation	2025-04-19 09:20:52 -07:00
Ishaan Jaff	db4ebe10c8	bump litellm-proxy-extras	2025-04-19 09:14:33 -07:00
Krrish Dholakia	d726e0f34c	test: update testing imports	2025-04-19 09:13:16 -07:00
Krrish Dholakia	ba1b552e8b	fix(common_daily_activity.py): fix python 3_8 error	2025-04-19 08:39:19 -07:00
Ishaan Jaff	49759d5678	fix get_azure_client	2025-04-19 08:33:26 -07:00

1 2 3 4 5 ...

21569 commits