litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 02:34:29 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	01d85d5fb7	Merge branch 'main' into litellm_anthropic_messages_improvements	2025-03-31 14:22:56 -07:00
Ishaan Jaff	ce5f55d04e	test fix update spend	2025-03-31 14:20:47 -07:00
Sam	a8673246dc	fix: Anthropic prompt caching on GCP Vertex AI (#9605 ) * fix: Anthropic prompt caching on GCP Vertex AI * test(vertex): anthropic prompt caching	2025-03-29 23:40:34 -07:00
Ishaan Jaff	5df985f964	Merge pull request #9642 from BerriAI/litellm_mcp_improvements_expose_sse_urls [Feat] - MCP improvements, add support for using SSE MCP servers	2025-03-29 19:37:57 -07:00
Krish Dholakia	5c107c64dd	Add gemini audio input support + handle special tokens in sagemaker response (#9640 ) * fix(internal_user_endpoints.py): cleanup unused variables on beta endpoint no team/org split on daily user endpoint * build(model_prices_and_context_window.json): gemini-2.0-flash supports audio input * feat(gemini/transformation.py): support passing audio input to gemini * test: fix test * fix(gemini/transformation.py): support audio input as a url enables passing google cloud bucket urls * fix(gemini/transformation.py): support explicitly passing format of file * fix(gemini/transformation.py): expand support for inferred file types from url * fix(sagemaker/completion/transformation.py): fix special token error when counting sagemaker tokens * test: fix import	2025-03-29 19:23:09 -07:00
Ishaan Jaff	3919e24256	test fix	2025-03-29 18:36:13 -07:00
Ishaan Jaff	194327bb7c	test fixes	2025-03-29 18:34:58 -07:00
Ishaan Jaff	a3df0269bb	fix tests	2025-03-29 17:38:24 -07:00
Ishaan Jaff	4e106ce217	fix test	2025-03-29 17:11:46 -07:00
Ishaan Jaff	3e378f2bec	async def test_spend_logs_payload_e2e(self):	2025-03-29 17:07:36 -07:00
Ishaan Jaff	047d767947	fix tests for gcs pub sub	2025-03-29 17:06:36 -07:00
Ishaan Jaff	79e8bbbfd4	fix types on tools.py	2025-03-29 16:48:15 -07:00
Ishaan Jaff	815263f7bc	rename transform_openai_tool_call_request_to_mcp_tool_call_request	2025-03-29 16:28:23 -07:00
Krish Dholakia	1604f87663	install prisma migration files - connects litellm proxy to litellm's prisma migration files (#9637 ) * build(README.md): initial commit adding a separate folder for additional proxy files. Meant to reduce size of core package * build(litellm-proxy-extras/): new pip package for storing migration files allows litellm proxy to use migration files, without adding them to core repo * build(litellm-proxy-extras/): cleanup pyproject.toml * build: move prisma migration files inside new proxy extras package * build(run_migration.py): update script to write to correct folder * build(proxy_cli.py): load in migration files from litellm-proxy-extras Closes https://github.com/BerriAI/litellm/issues/9558 * build: add MIT license to litellm-proxy-extras * test: update test * fix: fix schema * bump: version 0.1.0 → 0.1.1 * build(publish-proxy-extras.sh): add script for publishing new proxy-extras version * build(liccheck.ini): add litellm-proxy-extras to authorized packages * fix(litellm-proxy-extras/utils.py): move prisma migrate logic inside extra proxy pkg easier since migrations folder already there * build(pre-commit-config.yaml): add litellm_proxy_extras to ci tests * docs(config_settings.md): document new env var * build(pyproject.toml): bump relevant files when litellm-proxy-extras version changed * build(pre-commit-config.yaml): run poetry check on litellm-proxy-extras as well	2025-03-29 15:27:09 -07:00
Ishaan Jaff	a1ec0dd0e2	add testing mcp server	2025-03-29 12:52:46 -07:00
Krish Dholakia	9b7ebb6a7d	build(pyproject.toml): add new dev dependencies - for type checking (#9631 ) * build(pyproject.toml): add new dev dependencies - for type checking * build: reformat files to fit black * ci: reformat to fit black * ci(test-litellm.yml): make tests run clear * build(pyproject.toml): add ruff * fix: fix ruff checks * build(mypy/): fix mypy linting errors * fix(hashicorp_secret_manager.py): fix passing cert for tls auth * build(mypy/): resolve all mypy errors * test: update test * fix: fix black formatting * build(pre-commit-config.yaml): use poetry run black * fix(proxy_server.py): fix linting error * fix: fix ruff safe representation error	2025-03-29 11:02:13 -07:00
Krrish Dholakia	217e8d7d44	test: make script to run clearer	2025-03-29 08:23:18 -07:00
Krish Dholakia	5ac61a7572	Add bedrock latency optimized inference support (#9623 ) * fix(converse_transformation.py): add performanceConfig param support on bedrock Closes https://github.com/BerriAI/litellm/issues/7606 * fix(converse_transformation.py): refactor to use more flexible single getter for params which are separate config blocks * test(test_main.py): add e2e mock test for bedrock performance config * build(model_prices_and_context_window.json): add versioned multimodal embedding * refactor(multimodal_embeddings/): migrate to config pattern * feat(vertex_ai/multimodalembeddings): calculate usage for multimodal embedding calls Enables cost calculation for multimodal embeddings * feat(vertex_ai/multimodalembeddings): get usage object for embedding calls ensures accurate cost tracking for vertexai multimodal embedding calls * fix(embedding_handler.py): remove unused imports * fix: fix linting errors * fix: handle response api usage calculation * test(test_vertex_ai_multimodal_embedding_transformation.py): update tests * test: mark flaky test * feat(vertex_ai/multimodal_embeddings/transformation.py): support text+image+video input * docs(vertex.md): document sending text + image to vertex multimodal embeddings * test: remove incorrect file * fix(multimodal_embeddings/transformation.py): fix linting error * style: remove unused import	2025-03-29 00:23:09 -07:00
Ishaan Jaff	7e8a02099c	Merge branch 'main' into litellm_use_redis_for_updates	2025-03-28 20:12:29 -07:00
Ishaan Jaff	ba550e2147	test local spend accuracy	2025-03-28 19:52:39 -07:00
NickGrab	220d4c07f4	Merge pull request #9625 from BerriAI/litellm_mar_28_vertex_fix Add support to Vertex AI transformation for anyOf union type with null fields	2025-03-28 16:09:29 -07:00
Krish Dholakia	222898d727	Fix anthropic thinking + response_format (#9594 ) * fix(anthropic/chat/transformation.py): Don't set tool choice on response_format conversion when thinking is enabled Not allowed by Anthropic Fixes https://github.com/BerriAI/litellm/issues/8901 * refactor: move test to base anthropic chat tests ensures consistent behaviour across vertex/anthropic/bedrock * fix(anthropic/chat/transformation.py): if thinking token is specified and max tokens is not - ensure max token to anthropic is higher than thinking tokens * feat(converse_transformation.py): correctly handle thinking + response format on Bedrock Converse Fixes https://github.com/BerriAI/litellm/issues/8901 * fix(converse_transformation.py): correctly handle adding max tokens * test: handle service unavailable error	2025-03-28 15:57:40 -07:00
Nicholas Grabar	09daeac188	Rebasing 2	2025-03-28 15:18:09 -07:00
Nicholas Grabar	06a45706b2	Rebase 3	2025-03-28 15:18:05 -07:00
Krrish Dholakia	8c9ff23e19	test(test_caching_handler.py): move to in-memory cache - prevent redis flakiness from impacting ci/cd	2025-03-28 15:16:15 -07:00
Krish Dholakia	205db622bf	fix(proxy_server.py): get master key from environment, if not set in … (#9617 ) * fix(proxy_server.py): get master key from environment, if not set in general settings or general settings not set at all * test: mark flaky test * test(test_proxy_server.py): mock prisma client * ci: add new github workflow for testing just the mock tests * fix: fix linting error * ci(conftest.py): add conftest.py to isolate proxy tests * build(pyproject.toml): add respx to dev dependencies * build(pyproject.toml): add prisma to dev dependencies * test: fix mock prompt management tests to use a mock anthropic key * ci(test-litellm.yml): parallelize mock testing make it run faster * build(pyproject.toml): add hypercorn as dev dep * build(pyproject.toml): separate proxy vs. core dev dependencies make it easier for non-proxy contributors to run tests locally - e.g. no need to install hypercorn * ci(test-litellm.yml): pin python version * test(test_rerank.py): move test - cannot be mocked, requires aws credentials for e2e testing * ci: add thank you message to ci * test: add mock env var to test * test: add autouse to tests * test: test mock env vars for e2e tests	2025-03-28 15:16:15 -07:00
Ishaan Jaff	193052ed70	test pod lock manager	2025-03-28 15:05:17 -07:00
Krrish Dholakia	28a9edb547	test(test_caching_handler.py): move to in-memory cache - prevent redis flakiness from impacting ci/cd All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 15s Details Helm unit test / unit-test (push) Successful in 19s Details	2025-03-28 13:32:04 -07:00
Ishaan Jaff	1eaf847f8a	test pod lock manager	2025-03-28 13:31:45 -07:00
Nicholas Grabar	1f2bbda11d	Add recursion depth to convert_anyof_null_to_nullable, constants.py. Fix recursive_detector.py raise error state	2025-03-28 13:11:19 -07:00
Ishaan Jaff	021eedaf69	test pod lock manager	2025-03-28 12:59:16 -07:00
Ishaan Jaff	c53d172b06	rename pod lock manager	2025-03-28 12:57:00 -07:00
Krish Dholakia	0865e52db3	fix(proxy_server.py): get master key from environment, if not set in … (#9617 ) * fix(proxy_server.py): get master key from environment, if not set in general settings or general settings not set at all * test: mark flaky test * test(test_proxy_server.py): mock prisma client * ci: add new github workflow for testing just the mock tests * fix: fix linting error * ci(conftest.py): add conftest.py to isolate proxy tests * build(pyproject.toml): add respx to dev dependencies * build(pyproject.toml): add prisma to dev dependencies * test: fix mock prompt management tests to use a mock anthropic key * ci(test-litellm.yml): parallelize mock testing make it run faster * build(pyproject.toml): add hypercorn as dev dep * build(pyproject.toml): separate proxy vs. core dev dependencies make it easier for non-proxy contributors to run tests locally - e.g. no need to install hypercorn * ci(test-litellm.yml): pin python version * test(test_rerank.py): move test - cannot be mocked, requires aws credentials for e2e testing * ci: add thank you message to ci * test: add mock env var to test * test: add autouse to tests * test: test mock env vars for e2e tests	2025-03-28 12:32:04 -07:00
NickGrab	b72fbdde74	Merge branch 'main' into litellm_8864-feature-vertex-anyOf-support	2025-03-28 10:25:04 -07:00
Nicholas Grabar	9437ee5e1f	Revert "Unit test fixing and poetry update" This reverts commit `8c79e1902e`.	2025-03-28 10:22:32 -07:00
Nicholas Grabar	8c79e1902e	Unit test fixing and poetry update	2025-03-28 09:57:53 -07:00
Krrish Dholakia	5203382702	test: run test earlier to catch error	2025-03-27 23:08:52 -07:00
Krish Dholakia	ccbac691e5	Support discovering gemini, anthropic, xai models by calling their `/v1/model` endpoint (#9530 ) * fix: initial commit for adding provider model discovery to gemini * feat(gemini/): add model discovery for gemini/ route * docs(set_keys.md): update docs to show you can check available gemini models as well * feat(anthropic/): add model discovery for anthropic api key * feat(xai/): add model discovery for XAI enables checking what models an xai key can call * ci: bump ci config yml * fix(topaz/common_utils.py): fix linting error * fix: fix linting error for python38	2025-03-27 22:50:48 -07:00
Ishaan Jaff	758182fc7f	fix typo on codebase	2025-03-27 22:36:00 -07:00
Krrish Dholakia	79175ddb53	test: fix test	2025-03-27 22:04:59 -07:00
Krrish Dholakia	b3c7785240	test: skip flaky test - failing due to db timeouts - unrelated to test	2025-03-27 20:34:26 -07:00
Krrish Dholakia	e2d4597588	test: mark flaky test	2025-03-27 20:10:57 -07:00
Krish Dholakia	b9d0f460e8	Revert "Support max_completion_tokens on Mistral (#9589 )" (#9604 ) This reverts commit `fef5d23dd5`.	2025-03-27 19:14:26 -07:00
Chris Mancuso	fef5d23dd5	Support max_completion_tokens on Mistral (#9589 ) * Support max_completion_tokens on Mistral * test fix	2025-03-27 17:27:19 -07:00
Krish Dholakia	fb83567a03	Litellm new UI build (#9601 ) * build: new ui build * build: new ui build * fix(proxy_server.py): only show user models their key can access on `/models` * fix(model_management_endpoints.py): ensure team admin can add models * test: update unit testing to reflect changes * fix(model_dashboard.tsx): fix sizing on models page * build: fix ui	2025-03-27 17:15:25 -07:00
Ishaan Jaff	a0fd508de4	DBSpendUpdateWriter	2025-03-27 16:43:18 -07:00
Ishaan Jaff	21e3b764f5	use DBSpendUpdateWriter class for managing DB spend updates	2025-03-27 16:31:23 -07:00
Krish Dholakia	11838e1c3b	Litellm fix db testing (#9593 ) * ci: fix test * test: safely change db url * fix: print db url * test: remove delenv	2025-03-27 14:50:41 -07:00
Krish Dholakia	63c9f59373	Allow team admins to add/update/delete models on UI + show api base and model id on request logs (#9572 ) * feat(view_logs.tsx): show model id + api base in request logs easier debugging * fix(index.tsx): fix length of api base easier viewing * refactor(leftnav.tsx): show models tab to team admin * feat(model_dashboard.tsx): add explainer for what the 'models' page is for team admin helps them understand how they can use it * feat(model_management_endpoints.py): restrict model add by team to just team admin allow team admin to add models via non-team keys (e.g. ui token) * test(test_add_update_models.py): update unit testing for new behaviour * fix(model_dashboard.tsx): show user the models * feat(proxy_server.py): add new query param 'user_models_only' to `/v2/model/info` Allows user to retrieve just the models they've added Used in UI to show internal users just the models they've added * feat(model_dashboard.tsx): allow team admins to view their own models * fix: allow ui user to fetch model cost map * feat(add_model_tab.tsx): require team admins to specify team when onboarding models * fix(_types.py): add `/v1/model/info` to info route `/model/info` was already there * fix(model_info_view.tsx): allow user to edit a model they created * fix(model_management_endpoints.py): allow team admin to update team model * feat(model_managament_endpoints.py): allow team admin to delete team models * fix(model_management_endpoints.py): don't require team id to be set when adding a model * fix(proxy_server.py): fix linting error * fix: fix ui linting error * fix(model_management_endpoints.py): ensure consistent auth checks on all model calls * test: remove old test - function no longer exists in same form * test: add updated mock testing	2025-03-27 12:06:31 -07:00
Krish Dholakia	c0845fec1f	Add OpenAI gpt-4o-transcribe support (#9517 ) * refactor: introduce new transformation config for gpt-4o-transcribe models * refactor: expose new transformation configs for audio transcription * ci: fix config yml * feat(openai/transcriptions): support provider config transformation on openai audio transcriptions allows gpt-4o and whisper audio transformation to work as expected * refactor: migrate fireworks ai + deepgram to new transform request pattern * feat(openai/): working support for gpt-4o-audio-transcribe * build(model_prices_and_context_window.json): add gpt-4o-transcribe to model cost map * build(model_prices_and_context_window.json): specify what endpoints are supported for `/audio/transcriptions` * fix(get_supported_openai_params.py): fix return * refactor(deepgram/): migrate unit test to deepgram handler * refactor: cleanup unused imports * fix(get_supported_openai_params.py): fix linting error * test: update test	2025-03-26 23:10:25 -07:00

... 2 3 4 5 6 ...

1617 commits