litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 10:44:24 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	bde88b3ba6	fix type error	2025-04-04 16:34:43 -07:00
Ishaan Jaff	1cdee4b331	Merge branch 'main' into litellm_metrics_pod_lock_manager	2025-04-04 16:33:16 -07:00
Ishaan Jaff	decb6649ec	test_queue_flush_limit	2025-04-04 16:29:06 -07:00
Ishaan Jaff	e77a178a37	test_queue_size_reduction_with_large_volume	2025-04-04 16:21:29 -07:00
Ishaan Jaff	eb48cbdec6	aggregate_queue_updates	2025-04-04 15:54:07 -07:00
Ishaan Jaff	cdd351a03b	Merge pull request #9745 from BerriAI/litellm_sso_fixes_dev [Feat] Allow assigning SSO users to teams on MSFT SSO	2025-04-04 15:40:19 -07:00
Ishaan Jaff	93068cb142	flush_all_updates_from_in_memory_queue	2025-04-04 15:34:56 -07:00
Ishaan Jaff	065477abb4	add _get_aggregated_spend_update_queue_item	2025-04-04 15:32:27 -07:00
Ishaan Jaff	9abaefea62	add logic for max size in memory queue	2025-04-04 15:31:40 -07:00
Ishaan Jaff	363fb0c46f	add MAX_SIZE_IN_MEMORY_QUEUE	2025-04-04 15:31:09 -07:00
Ishaan Jaff	3374c54ba2	add MAX_SIZE_IN_MEMORY_QUEUE constant	2025-04-04 15:30:53 -07:00
Ishaan Jaff	cba1dacc7d	ui new build	2025-04-04 14:39:55 -07:00
Krrish Dholakia	ad90871ad6	fix(factory.py): don't pass cache control if not set bedrock invoke does not support this	2025-04-04 12:37:34 -07:00
Adrian Lyjak	d640bc0a00	fix #8425 , passthrough kwargs during acompletion, and unwrap extra_body for openrouter (#9747 )	2025-04-03 22:19:40 -07:00
Ishaan Jaff	984114adf0	fix sso callback	2025-04-03 22:13:46 -07:00
Ishaan Jaff	f1bc99a137	MSFT make it easier for using group ids with MSFT	2025-04-03 20:43:22 -07:00
Albert Örwall	bd5a8d582b	Fix prompt caching for Anthropic tool calls (#9706 ) * Add prompt cache support to Anhtropic tool calls * Fix linting issue and add test	2025-04-03 20:19:21 -07:00
Ishaan Jaff	add24d5999	debug show SSO callback result	2025-04-03 20:06:21 -07:00
sajda	4a4328b5bb	fix:Gemini Flash 2.0 implementation is not returning the logprobs (#9713 ) * fix:Gemini Flash 2.0 implementation is not returning the logprobs * fix: linting error by adding a helper method called _process_candidates	2025-04-03 11:53:41 -07:00
Krish Dholakia	6dda1ba6dd	LiteLLM Minor Fixes & Improvements (04/02/2025) (#9725 ) * Add date picker to usage tab + Add reasoning_content token tracking across all providers on streaming (#9722) * feat(new_usage.tsx): add date picker for new usage tab allow user to look back on their usage data * feat(anthropic/chat/transformation.py): report reasoning tokens in completion token details allows usage tracking on how many reasoning tokens are actually being used * feat(streaming_chunk_builder.py): return reasoning_tokens in anthropic/openai streaming response allows tracking reasoning_token usage across providers * Fix update team metadata + fix bulk adding models on Ui (#9721) * fix(handle_add_model_submit.tsx): fix bulk adding models * fix(team_info.tsx): fix team metadata update Fixes https://github.com/BerriAI/litellm/issues/9689 * (v0) Unified file id - allow calling multiple providers with same file id (#9718) * feat(files_endpoints.py): initial commit adding 'target_model_names' support allow developer to specify all the models they want to call with the file * feat(files_endpoints.py): return unified files endpoint * test(test_files_endpoints.py): add validation test - if invalid purpose submitted * feat: more updates * feat: initial working commit of unified file id translation * fix: additional fixes * fix(router.py): remove model replace logic in jsonl on acreate_file enables file upload to work for chat completion requests as well * fix(files_endpoints.py): remove whitespace around model name * fix(azure/handler.py): return acreate_file with correct response type * fix: fix linting errors * test: fix mock test to run on github actions * fix: fix ruff errors * fix: fix file too large error * fix(utils.py): remove redundant var * test: modify test to work on github actions * test: update tests * test: more debug logs to understand ci/cd issue * test: fix test for respx * test: skip mock respx test fails on ci/cd - not clear why * fix: fix ruff check * fix: fix test * fix(model_connection_test.tsx): fix linting error * test: update unit tests	2025-04-03 11:48:52 -07:00
fengjiajie	5a18eebdb6	Fix: Use request body in curl log for Gemini streaming mode (#9736 )	2025-04-03 09:45:27 -07:00
Tobias Hermann	5785600c4e	[Feat] Add VertexAI gemini-2.0-flash (#9723 )	2025-04-02 22:33:23 -07:00
Ishaan Jaff	e3b788ea29	fix test	2025-04-02 21:58:35 -07:00
Ishaan Jaff	dd2d1dc2f4	Merge branch 'main' into litellm_metrics_pod_lock_manager	2025-04-02 21:35:55 -07:00
Krish Dholakia	8ee32291e0	Squashed commit of the following: (#9709 ) commit `b12a9892b7` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Apr 2 08:09:56 2025 -0700 fix(utils.py): don't modify openai_token_counter commit `294de31803` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 21:22:40 2025 -0700 fix: fix linting error commit `cb6e9fbe40` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:52:45 2025 -0700 refactor: complete migration commit `bfc159172d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:09:59 2025 -0700 refactor: refactor more constants commit `43ffb6a558` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:45:24 2025 -0700 fix: test commit `04dbe4310c` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:28:58 2025 -0700 refactor: refactor: move more constants into constants.py commit `3c26284aff` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:14:46 2025 -0700 refactor: migrate hardcoded constants out of __init__.py commit `c11e0de69d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:11:21 2025 -0700 build: migrate all constants into constants.py commit `7882bdc787` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:07:37 2025 -0700 build: initial test banning hardcoded numbers in repo	2025-04-02 21:24:54 -07:00
Ishaan Jaff	bcf42fd82d	linting fix prometheus services	2025-04-02 21:19:05 -07:00
Ishaan Jaff	0155b9f212	Merge branch 'main' into litellm_refactor_pod_lock_manager	2025-04-02 21:05:18 -07:00
Ishaan Jaff	5222cce510	Merge branch 'main' into litellm_metrics_pod_lock_manager	2025-04-02 21:04:44 -07:00
Ishaan Jaff	acf920a41a	Merge branch 'main' into litellm_fix_azure_o_series	2025-04-02 20:58:52 -07:00
Ishaan Jaff	c4e8b9607d	fix async_set_cache	2025-04-02 18:54:51 -07:00
Ishaan Jaff	07215e3f7a	fix async_set_cache	2025-04-02 18:51:41 -07:00
Ishaan Jaff	80fb4ece97	prom emit size of DB TX queues for observability	2025-04-02 18:39:29 -07:00
Ishaan Jaff	3256b6af6c	track service types on prom services	2025-04-02 18:03:09 -07:00
Ishaan Jaff	05b30e28db	clean up service metrics	2025-04-02 17:50:41 -07:00
Ishaan Jaff	73bbd0a446	emit lock acquired and released events	2025-04-02 17:40:25 -07:00
Ishaan Jaff	e09ef4afc7	use service logger for tracking pod lock status	2025-04-02 17:39:48 -07:00
Ishaan Jaff	8b12a2e5dc	fix pod lock manager	2025-04-02 14:52:55 -07:00
Ishaan Jaff	a64631edfb	test pod lock manager	2025-04-02 14:39:40 -07:00
Ishaan Jaff	2e939a21b3	refactor pod lock manager to use redis	2025-04-02 14:37:39 -07:00
Ishaan Jaff	b48b8366c2	docs new deadlock fixing architecture	2025-04-02 13:24:53 -07:00
Krish Dholakia	053b0e741f	Add Google AI Studio `/v1/files` upload API support (#9645 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details Helm unit test / unit-test (push) Successful in 23s Details * test: fix import for test * fix: fix bad error string * docs: cleanup files docs * fix(files/main.py): cleanup error string * style: initial commit with a provider/config pattern for files api google ai studio files api onboarding * fix: test * feat(gemini/files/transformation.py): support gemini files api response transformation * fix(gemini/files/transformation.py): return file id as gemini uri allows id to be passed in to chat completion request, just like openai * feat(llm_http_handler.py): support async route for files api on llm_http_handler * fix: fix linting errors * fix: fix model info check * fix: fix ruff errors * fix: fix linting errors * Revert "fix: fix linting errors" This reverts commit `926a5a527f`. * fix: fix linting errors * test: fix test * test: fix tests	2025-04-02 08:56:58 -07:00
Krish Dholakia	453003c378	fix(gemini/): add gemini/ route optional param mapping support (#9677 ) Fixes https://github.com/BerriAI/litellm/issues/9654	2025-04-02 08:56:32 -07:00
Pranav Simha	2e35f07e94	Add support for max_completion_tokens to the Cohere chat transformation config (#9701 )	2025-04-02 07:50:44 -07:00
Ishaan Jaff	58b4e4b206	add AzureOpenAIO1Config for tools	2025-04-02 06:55:03 -07:00
Ishaan Jaff	9e7c67805b	get_supported_openai_params	2025-04-02 06:52:07 -07:00
Krish Dholakia	6c69ad4c89	fix(model_management_endpoints.py): fix allowing team admins to update team models (#9697 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 17s Details Helm unit test / unit-test (push) Successful in 22s Details * fix(model_management_endpoints.py): fix allowing team admins to update their models * test(test_models.py): add e2e test to for team model flow ensure team admin can always add / edit / delete team models	2025-04-01 22:28:15 -07:00
Krish Dholakia	3d0313b15b	Litellm user daily activity allow non admin usage (#9695 ) * feat(internal_user_endpoints.py): allow non-admin to view their own usage via `/user/daily/activity` route * fix(leftnav.tsx): allow users to view their own usage via new_usage.tsx allows internal users to see their usage via new api Handles 1m+ spend logs scenario * fix(leftnav.tsx): allow all users to see new usage tab	2025-04-01 22:27:26 -07:00
Krish Dholakia	23051d89dd	fix(streaming_handler.py): fix completion start time tracking (#9688 ) * fix(streaming_handler.py): fix completion start time tracking Fixes https://github.com/BerriAI/litellm/issues/9210 * feat(anthropic/chat/transformation.py): map openai 'reasoning_effort' to anthropic 'thinking' param Fixes https://github.com/BerriAI/litellm/issues/9022 * feat: map 'reasoning_effort' to 'thinking' param across bedrock + vertex Closes https://github.com/BerriAI/litellm/issues/9022#issuecomment-2705260808	2025-04-01 22:00:56 -07:00
Tomer Bin	0690f7a3cb	Virtual key based policies in Aim Guardrails (#9499 ) * report key alias to aim * send litellm version to aim * Update docs * blacken * add docs * Add info part about virtual keys specific guards * sort guardrails alphabetically * fix ruff	2025-04-01 21:57:23 -07:00
Ishaan Jaff	4080fe54d5	clean up o series	2025-04-01 21:21:41 -07:00

1 2 3 4 5 ...

13392 commits