litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 18:54:30 +00:00

Author	SHA1	Message	Date
Krish Dholakia	792ee079c2	Litellm 04 05 2025 release notes (#9785 ) * docs: update docs * docs: additional cleanup * docs(index.md): add initial links * docs: more doc updates * docs(index.md): add more links * docs(files.md): add gemini files API to docs * docs(index.md): add more docs * docs: more docs * docs: update docs	2025-04-06 09:03:51 -07:00
Ishaan Jaff	52b35cd809	[UI Polish] - Polish login screen (#9778 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 21s Details Helm unit test / unit-test (push) Successful in 24s Details * fix admin ui utils login screen * ui - add layer of polish on login screen * ui fix design of login page * ui fix color scheme on login page	2025-04-05 14:56:03 -07:00
Ishaan Jaff	7f6de81196	ui new build	2025-04-05 12:30:37 -07:00
Ishaan Jaff	3a7061a05c	bug fix de depluciate model list (#9775 )	2025-04-05 12:29:11 -07:00
Krish Dholakia	0d503ad8ad	Move daily user transaction logging outside of 'disable_spend_logs' flag - different tables (#9772 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details Helm unit test / unit-test (push) Successful in 18s Details * refactor(db_spend_update_writer.py): aggregate table is entirely different * test(test_db_spend_update_writer.py): add unit test to ensure if disable_spend_logs is true daily user transactions is still logged * test: fix test	2025-04-05 09:58:16 -07:00
Krish Dholakia	5099aac1a5	Add DBRX Anthropic w/ thinking + response_format support (#9744 ) * feat(databricks/chat/): add anthropic w/ reasoning content support via databricks Allows user to call claude-3-7-sonnet with thinking via databricks * refactor: refactor choices transformation + add unit testing * fix(databricks/chat/transformation.py): support thinking blocks on databricks response streaming * feat(databricks/chat/transformation.py): support response_format for claude models * fix(databricks/chat/transformation.py): correctly handle response_format={"type": "text"} * feat(databricks/chat/transformation.py): support 'reasoning_effort' param mapping for anthropic * fix: fix ruff errors * fix: fix linting error * test: update test * fix(databricks/chat/transformation.py): handle json mode output parsing * fix(databricks/chat/transformation.py): handle json mode on streaming * test: update test * test: update dbrx testing * test: update testing * fix(base_model_iterator.py): handle non-json chunk * test: update tests * fix: fix ruff check * fix: fix databricks config import * fix: handle _tool = none * test: skip invalid test	2025-04-04 22:13:32 -07:00
Krish Dholakia	e3b231bc11	fix(litellm-proxy-extras/utils.py): check migrations from correct directory + place prisma schema inside litellm-proxy-extras dir (#9767 ) Allows prisma migrate deploy to work as expected on new db's	2025-04-04 22:11:07 -07:00
Ishaan Jaff	b89ed69257	Merge branch 'main' into litellm_add_auth_metrics_endpoint	2025-04-04 21:28:06 -07:00
Ishaan Jaff	8c3670e192	Merge pull request #9719 from BerriAI/litellm_metrics_pod_lock_manager [Reliability] Emit operational metrics for new DB Transaction architecture	2025-04-04 21:12:06 -07:00
Ishaan Jaff	df51d8bcfa	Merge branch 'main' into litellm_metrics_pod_lock_manager	2025-04-04 21:11:39 -07:00
Chaos Yu	001043ba05	make sure metadata available and have a value (#9764 )	2025-04-04 20:39:12 -07:00
Ishaan Jaff	eaad3b2402	PrometheusAuthMiddleware	2025-04-04 20:37:53 -07:00
Ishaan Jaff	86b473d267	allow adding auth on /metrics endpoint	2025-04-04 20:37:17 -07:00
Krish Dholakia	d66db2207b	Allow team members to see team models (#9742 ) * fix(proxy_server.py): allow team member to see team models * fix(model_dashboard.tsx): show edit + delete icons to be disabled if user is not admin and did not create models * fix(proxy_server.py): fix ruff function size error * fix(proxy_server.py): fix user model filter check	2025-04-04 20:36:48 -07:00
Ishaan Jaff	96ce5dbf7d	_should_run_auth_on_metrics_endpoint	2025-04-04 20:32:04 -07:00
Ishaan Jaff	c7523818b4	PrometheusAuthMiddleware	2025-04-04 20:27:17 -07:00
Ishaan Jaff	253060cb09	allow requiring auth for /metrics endpoint	2025-04-04 17:35:02 -07:00
Ishaan Jaff	150e77cd7d	Merge branch 'main' into litellm_reliability_fix_db_txs	2025-04-04 16:46:46 -07:00
Ishaan Jaff	901d6fe7b7	add operational metrics for pod lock manager v2 arch	2025-04-04 16:41:07 -07:00
Ishaan Jaff	1cdee4b331	Merge branch 'main' into litellm_metrics_pod_lock_manager	2025-04-04 16:33:16 -07:00
Ishaan Jaff	decb6649ec	test_queue_flush_limit	2025-04-04 16:29:06 -07:00
Ishaan Jaff	e77a178a37	test_queue_size_reduction_with_large_volume	2025-04-04 16:21:29 -07:00
Ishaan Jaff	eb48cbdec6	aggregate_queue_updates	2025-04-04 15:54:07 -07:00
Ishaan Jaff	cdd351a03b	Merge pull request #9745 from BerriAI/litellm_sso_fixes_dev [Feat] Allow assigning SSO users to teams on MSFT SSO	2025-04-04 15:40:19 -07:00
Ishaan Jaff	93068cb142	flush_all_updates_from_in_memory_queue	2025-04-04 15:34:56 -07:00
Ishaan Jaff	065477abb4	add _get_aggregated_spend_update_queue_item	2025-04-04 15:32:27 -07:00
Ishaan Jaff	9abaefea62	add logic for max size in memory queue	2025-04-04 15:31:40 -07:00
Ishaan Jaff	363fb0c46f	add MAX_SIZE_IN_MEMORY_QUEUE	2025-04-04 15:31:09 -07:00
Ishaan Jaff	cba1dacc7d	ui new build	2025-04-04 14:39:55 -07:00
Ishaan Jaff	984114adf0	fix sso callback	2025-04-03 22:13:46 -07:00
Ishaan Jaff	f1bc99a137	MSFT make it easier for using group ids with MSFT	2025-04-03 20:43:22 -07:00
Ishaan Jaff	add24d5999	debug show SSO callback result	2025-04-03 20:06:21 -07:00
Krish Dholakia	6dda1ba6dd	LiteLLM Minor Fixes & Improvements (04/02/2025) (#9725 ) * Add date picker to usage tab + Add reasoning_content token tracking across all providers on streaming (#9722) * feat(new_usage.tsx): add date picker for new usage tab allow user to look back on their usage data * feat(anthropic/chat/transformation.py): report reasoning tokens in completion token details allows usage tracking on how many reasoning tokens are actually being used * feat(streaming_chunk_builder.py): return reasoning_tokens in anthropic/openai streaming response allows tracking reasoning_token usage across providers * Fix update team metadata + fix bulk adding models on Ui (#9721) * fix(handle_add_model_submit.tsx): fix bulk adding models * fix(team_info.tsx): fix team metadata update Fixes https://github.com/BerriAI/litellm/issues/9689 * (v0) Unified file id - allow calling multiple providers with same file id (#9718) * feat(files_endpoints.py): initial commit adding 'target_model_names' support allow developer to specify all the models they want to call with the file * feat(files_endpoints.py): return unified files endpoint * test(test_files_endpoints.py): add validation test - if invalid purpose submitted * feat: more updates * feat: initial working commit of unified file id translation * fix: additional fixes * fix(router.py): remove model replace logic in jsonl on acreate_file enables file upload to work for chat completion requests as well * fix(files_endpoints.py): remove whitespace around model name * fix(azure/handler.py): return acreate_file with correct response type * fix: fix linting errors * test: fix mock test to run on github actions * fix: fix ruff errors * fix: fix file too large error * fix(utils.py): remove redundant var * test: modify test to work on github actions * test: update tests * test: more debug logs to understand ci/cd issue * test: fix test for respx * test: skip mock respx test fails on ci/cd - not clear why * fix: fix ruff check * fix: fix test * fix(model_connection_test.tsx): fix linting error * test: update unit tests	2025-04-03 11:48:52 -07:00
Ishaan Jaff	dd2d1dc2f4	Merge branch 'main' into litellm_metrics_pod_lock_manager	2025-04-02 21:35:55 -07:00
Krish Dholakia	8ee32291e0	Squashed commit of the following: (#9709 ) commit `b12a9892b7` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Wed Apr 2 08:09:56 2025 -0700 fix(utils.py): don't modify openai_token_counter commit `294de31803` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 21:22:40 2025 -0700 fix: fix linting error commit `cb6e9fbe40` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:52:45 2025 -0700 refactor: complete migration commit `bfc159172d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 19:09:59 2025 -0700 refactor: refactor more constants commit `43ffb6a558` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:45:24 2025 -0700 fix: test commit `04dbe4310c` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:28:58 2025 -0700 refactor: refactor: move more constants into constants.py commit `3c26284aff` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:14:46 2025 -0700 refactor: migrate hardcoded constants out of __init__.py commit `c11e0de69d` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:11:21 2025 -0700 build: migrate all constants into constants.py commit `7882bdc787` Author: Krrish Dholakia <krrishdholakia@gmail.com> Date: Mon Mar 24 18:07:37 2025 -0700 build: initial test banning hardcoded numbers in repo	2025-04-02 21:24:54 -07:00
Ishaan Jaff	5222cce510	Merge branch 'main' into litellm_metrics_pod_lock_manager	2025-04-02 21:04:44 -07:00
Ishaan Jaff	80fb4ece97	prom emit size of DB TX queues for observability	2025-04-02 18:39:29 -07:00
Ishaan Jaff	05b30e28db	clean up service metrics	2025-04-02 17:50:41 -07:00
Ishaan Jaff	73bbd0a446	emit lock acquired and released events	2025-04-02 17:40:25 -07:00
Ishaan Jaff	8b12a2e5dc	fix pod lock manager	2025-04-02 14:52:55 -07:00
Ishaan Jaff	a64631edfb	test pod lock manager	2025-04-02 14:39:40 -07:00
Ishaan Jaff	b48b8366c2	docs new deadlock fixing architecture	2025-04-02 13:24:53 -07:00
Krish Dholakia	053b0e741f	Add Google AI Studio `/v1/files` upload API support (#9645 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details Helm unit test / unit-test (push) Successful in 23s Details * test: fix import for test * fix: fix bad error string * docs: cleanup files docs * fix(files/main.py): cleanup error string * style: initial commit with a provider/config pattern for files api google ai studio files api onboarding * fix: test * feat(gemini/files/transformation.py): support gemini files api response transformation * fix(gemini/files/transformation.py): return file id as gemini uri allows id to be passed in to chat completion request, just like openai * feat(llm_http_handler.py): support async route for files api on llm_http_handler * fix: fix linting errors * fix: fix model info check * fix: fix ruff errors * fix: fix linting errors * Revert "fix: fix linting errors" This reverts commit `926a5a527f`. * fix: fix linting errors * test: fix test * test: fix tests	2025-04-02 08:56:58 -07:00
Krish Dholakia	453003c378	fix(gemini/): add gemini/ route optional param mapping support (#9677 ) Fixes https://github.com/BerriAI/litellm/issues/9654	2025-04-02 08:56:32 -07:00
Krish Dholakia	6c69ad4c89	fix(model_management_endpoints.py): fix allowing team admins to update team models (#9697 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 17s Details Helm unit test / unit-test (push) Successful in 22s Details * fix(model_management_endpoints.py): fix allowing team admins to update their models * test(test_models.py): add e2e test to for team model flow ensure team admin can always add / edit / delete team models	2025-04-01 22:28:15 -07:00
Krish Dholakia	3d0313b15b	Litellm user daily activity allow non admin usage (#9695 ) * feat(internal_user_endpoints.py): allow non-admin to view their own usage via `/user/daily/activity` route * fix(leftnav.tsx): allow users to view their own usage via new_usage.tsx allows internal users to see their usage via new api Handles 1m+ spend logs scenario * fix(leftnav.tsx): allow all users to see new usage tab	2025-04-01 22:27:26 -07:00
Tomer Bin	0690f7a3cb	Virtual key based policies in Aim Guardrails (#9499 ) * report key alias to aim * send litellm version to aim * Update docs * blacken * add docs * Add info part about virtual keys specific guards * sort guardrails alphabetically * fix ruff	2025-04-01 21:57:23 -07:00
Ishaan Jaff	ac3399238e	Merge pull request #9690 from BerriAI/litellm_fixes_for_deadlock_round_2 [Reliability Fixes] - Ensure no deadlocks occur when updating `DailyUserSpendTransaction`	2025-04-01 20:59:41 -07:00
Krrish Dholakia	0cfc3e822c	build: new ui build	2025-04-01 19:44:23 -07:00
Krish Dholakia	60863bb134	UI - New Usage Tab fixes (#9696 ) * fix(new_usage.tsx): enable smooth scrolling - remove double scroll bars enables easier viewing of overflowing content * fix(new_usage.tsx): fix ordering of daily spend ensure always from earliest to latest date * feat(internal_user_endpoints.py): return key alias on `/user/daily/activity` Enables easier consumption on UI * fix(new_usage.tsx): show key alias on usage tab * feat(activity_metric.tsx): new activity panel - showing spend per model per day allows debugging if models are not being tracked * fix(top_key_view.tsx): use consistent param for selecting key	2025-04-01 19:36:00 -07:00

1 2 3 4 5 ...

4673 commits