litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-24 18:24:20 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	db4ebe10c8	bump litellm-proxy-extras	2025-04-19 09:14:33 -07:00
Ishaan Jaff	2b14978d9d	bump litellm proxy extras	2025-04-16 19:28:16 -07:00
Ishaan Jaff	a9e8a36f89	[Bug Fix] Azure Blob Storage fixes (#10059 ) * Simple fix for #9339 - upgrade the underlying library and cache the azure storage client (#9965) * fix - use constants for caching azure storage client --------- Co-authored-by: Adrian Lyjak <adrian@chatmeter.com>	2025-04-16 09:47:10 -07:00
Ishaan Jaff	ca593e003a	bump litellm-proxy-extras==0.1.9 Some checks failed Publish Prisma Migrations / publish-migrations (push) Failing after 36s Details Read Version from pyproject.toml / read-version (push) Successful in 45s Details Helm unit test / unit-test (push) Successful in 54s Details	2025-04-15 22:49:24 -07:00
Krish Dholakia	9b77559ccf	Add aggregate team based usage logging (#10039 ) * feat(schema.prisma): initial commit adding aggregate table for team spend allows team spend to be visible at 1m+ logs * feat(db_spend_update_writer.py): support logging aggregate team spend allows usage dashboard to work at 1m+ logs * feat(litellm-proxy-extras/): add new migration file * fix(db_spend_update_writer.py): fix return type * build: bump requirements * fix: fix ruff error	2025-04-15 20:58:48 -07:00
Krrish Dholakia	0853b89864	build: use new litellm-proxy-extras version	2025-04-12 10:55:47 -07:00
Ishaan Jaff	fb0c3d9e18	[DB / Infra] Add new column team_member_permissions (#9941 ) * add team_member_permissions to team table * add migration.sql file * fix poetry lock * fix prisma migrations * fix poetry lock * fix migration	2025-04-12 09:06:04 -07:00
Krish Dholakia	421e0a3004	Litellm add managed files db (#9930 ) * fix(openai.py): ensure openai file object shows up on logs * fix(managed_files.py): return unified file id as b64 str allows retrieve file id to work as expected * fix(managed_files.py): apply decoded file id transformation * fix: add unit test for file id + decode logic * fix: initial commit for litellm_proxy support with CRUD Endpoints * fix(managed_files.py): support retrieve file operation * fix(managed_files.py): support for DELETE endpoint for files * fix(managed_files.py): retrieve file content support supports retrieve file content api from openai * fix: fix linting error * test: update tests * fix: fix linting error * feat(managed_files.py): support reading / writing files in DB * feat(managed_files.py): support deleting file from DB on delete * test: update testing * fix(spend_tracking_utils.py): ensure each file create request is logged correctly * fix(managed_files.py): fix storing / returning managed file object from cache * fix(files/main.py): pass litellm params to azure route * test: fix test * build: add new prisma migration * build: bump requirements * test: add more testing * refactor: cleanup post merge w/ main * fix: fix code qa errors	2025-04-12 08:24:46 -07:00
Krish Dholakia	ac4f32fb1e	Cost tracking for `gemini-2.5-pro` (#9837 ) * build(model_prices_and_context_window.json): add google/gemini-2.0-flash-lite-001 versioned pricing Closes https://github.com/BerriAI/litellm/issues/9829 * build(model_prices_and_context_window.json): add initial support for 'supported_output_modalities' param * build(model_prices_and_context_window.json): add initial support for 'supported_output_modalities' param * build(model_prices_and_context_window.json): add supported endpoints to gemini-2.5-pro * build(model_prices_and_context_window.json): add gemini 200k+ pricing * feat(utils.py): support cost calculation for gemini-2.5-pro above 200k tokens Fixes https://github.com/BerriAI/litellm/issues/9807 * build: test dockerfile change * build: revert apk change * ci(config.yml): pip install wheel * ci: test problematic package first * ci(config.yml): pip install only binary * ci: try more things * ci: test different ml_dtypes version * ci(config.yml): check ml_dtypes==0.4.0 * ci: test * ci: cleanup config.yml * ci: specify ml dtypes in requirements.txt * ci: remove redisvl depedency (temporary) * fix: fix linting errors * test: update test * test: fix test	2025-04-09 18:48:43 -07:00
Ishaan Jaff	357f081d1c	fix mldtypes dep All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 14s Details Helm unit test / unit-test (push) Successful in 23s Details	2025-04-08 17:25:28 -07:00
Ishaan Jaff	9f33e9b3e8	pin ml-dtypes==0.4.0	2025-04-08 17:24:08 -07:00
Krish Dholakia	d4d3c4f697	build: bump litellm-proxy-extras version (#9771 )	2025-04-05 09:02:52 -07:00
Krish Dholakia	62ad84fb64	UI (new_usage.tsx): Report 'total_tokens' + report success/failure calls (#9675 ) * feat(internal_user_endpoints.py): return 'total_tokens' in `/user/daily/analytics` * test(test_internal_user_endpoints.py): add unit test to assert spend metrics and dailyspend metadata always report the same fields * build(schema.prisma): record success + failure calls to daily user table allows understanding why model requests might exceed provider requests (e.g. user hit rate limit error) * fix(internal_user_endpoints.py): report success / failure requests in API * fix(proxy/utils.py): default to success status can be missing or none at times for successful requests * feat(new_usage.tsx): show success/failure calls on UI * style(new_usage.tsx): ui cleanup * fix: fix linting error * fix: fix linting error * feat(litellm-proxy-extras/): add new migration files	2025-03-31 22:48:43 -07:00
Krish Dholakia	1604f87663	install prisma migration files - connects litellm proxy to litellm's prisma migration files (#9637 ) * build(README.md): initial commit adding a separate folder for additional proxy files. Meant to reduce size of core package * build(litellm-proxy-extras/): new pip package for storing migration files allows litellm proxy to use migration files, without adding them to core repo * build(litellm-proxy-extras/): cleanup pyproject.toml * build: move prisma migration files inside new proxy extras package * build(run_migration.py): update script to write to correct folder * build(proxy_cli.py): load in migration files from litellm-proxy-extras Closes https://github.com/BerriAI/litellm/issues/9558 * build: add MIT license to litellm-proxy-extras * test: update test * fix: fix schema * bump: version 0.1.0 → 0.1.1 * build(publish-proxy-extras.sh): add script for publishing new proxy-extras version * build(liccheck.ini): add litellm-proxy-extras to authorized packages * fix(litellm-proxy-extras/utils.py): move prisma migrate logic inside extra proxy pkg easier since migrations folder already there * build(pre-commit-config.yaml): add litellm_proxy_extras to ci tests * docs(config_settings.md): document new env var * build(pyproject.toml): bump relevant files when litellm-proxy-extras version changed * build(pre-commit-config.yaml): run poetry check on litellm-proxy-extras as well	2025-03-29 15:27:09 -07:00
Ishaan Jaff	08a4ba1b7e	Merge branch 'main' into litellm_exp_mcp_server	2025-03-24 19:03:56 -07:00
Ishaan Jaff	6e3f86f12f	fix using mcp router	2025-03-24 18:36:42 -07:00
Ishaan Jaff	aaa567737b	bump MCP	2025-03-24 16:53:01 -07:00
Ishaan Jaff	eef8043a86	update requirements.txt	2025-03-24 16:43:24 -07:00
Tyler Hutcherson	7864cd1f76	update redisvl dependency	2025-03-24 08:42:11 -04:00
Ishaan Jaff	f21a0c2da7	Merge branch 'main' into litellm_exp_mcp_server	2025-03-22 18:51:25 -07:00
Ishaan Jaff	7e9fc92f5c	security fix - bump gunicorn==23.0.0 # server dep	2025-03-21 21:56:56 -07:00
Ishaan Jaff	e85e3bc52a	bump to pip install "openai==1.68.2"	2025-03-21 21:49:54 -07:00
Ishaan Jaff	90877d2c1c	bump requirements.txt	2025-03-20 21:56:10 -07:00
Ishaan Jaff	2cd170daeb	add mcp==1.4.1	2025-03-20 21:50:11 -07:00
Ishaan Jaff	b868509ddb	bump to openai==1.67.0	2025-03-20 09:06:13 -07:00
Krrish Dholakia	ee6c9576d4	fix: bump websockets version on ci/cd	2025-03-13 14:38:22 -07:00
Ishaan Jaff	1b50e6d65f	openai 1.66.1	2025-03-12 11:44:16 -07:00
dependabot[bot]	8cd5d8d8f3	build(deps): bump jinja2 from 3.1.4 to 3.1.6 Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.4 to 3.1.6. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/3.1.4...3.1.6) --- updated-dependencies: - dependency-name: jinja2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2025-03-05 22:46:46 +00:00
Krish Dholakia	17efbf0ee9	Fix #7629 - Add tzdata package to Dockerfile (#8915 ) (#9009 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 17s Details * Add tzdata package to Dockerfile * Move tzdata to python requirement.txt Co-authored-by: subnet.dev <50828879+subnet-dev@users.noreply.github.com>	2025-03-05 14:45:19 -08:00
Krish Dholakia	73d73d1cb9	Litellm contributor prs 02 24 2025 (#8781 ) * Adding VertexAI Claude 3.7 Sonnet (#8774) Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com> * build(model_prices_and_context_window.json): add anthropic 3-7 models on vertex ai and bedrock * Support video_url (#8743) * Support video_url Support VLMs that works with video. Example implemenation in vllm: https://github.com/vllm-project/vllm/pull/10020 * llms openai.py: Add ChatCompletionVideoObject Add data structures to support `video_url` in chat completion * test test_completion.py: add test for video_url * Arize Phoenix - ensure correct endpoint/protocol are used; and default to phoenix cloud (#8750) * minor fixes to default to http and to ensure that the correct endpoint is used * Update test_arize_phoenix.py * prioritize http over grpc * update sentry_sdk to latest version (#8588) * Add anthropic thinking + reasoning content support (#8778) * feat(anthropic/chat/transformation.py): add anthropic thinking param support * feat(anthropic/chat/transformation.py): support returning thinking content for anthropic on streaming responses * feat(anthropic/chat/transformation.py): return list of thinking blocks (include block signature) allows usage in tool call responses * fix(types/utils.py): extract and map reasoning_content from anthropic as content str * test: add testing to ensure thinking_blocks are returned at the root * fix(anthropic/chat/handler.py): return thinking blocks on streaming - include signature * feat(factory.py): handle anthropic thinking blocks translation if in assistant response * test: handle openai internal instability * test: handle openai audio instability * ci: pin anthropic dep * test: handle openai audio instability * fix: fix linting error * refactor(anthropic/chat/transformation.py): refactor function to remain <50 LOC * fix: fix linting error * fix: fix linting error * fix: fix linting error * fix: fix linting error * test: handle index error * bump: version 1.61.15 → 1.61.16 --------- Co-authored-by: Emerson Gomes <emerson.gomes@gmail.com> Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com> Co-authored-by: Pang Wu <104795337+pang-wu@users.noreply.github.com> Co-authored-by: Nate Mar <67926244+nate-mar@users.noreply.github.com> Co-authored-by: stephaneminisini <stephane.minisini@gmail.com>	2025-02-24 23:41:04 -08:00
Krish Dholakia	49610485a2	feat: add oss license check for related packages (#8623 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 15s Details ensure all packages used are permissable	2025-02-18 10:40:00 -08:00
Krish Dholakia	c17342ac5b	fix(openai/): allows 'reasoning_effort' param to be passed correctly (#8227 ) * fix(openai/): allows 'reasoning_effort' param to be passed correctly Fixes https://github.com/BerriAI/litellm/issues/8217 * test: update test to handle gemini token counter change * fix(factory.py): fix bedrock http:// handling * test: fix test * test: update testing for new openai sdk	2025-02-03 22:39:10 -08:00
Ishaan Jaff	9b944ca60c	(Fix + Testing) - Add `dd-trace-run` to litellm ci/cd pipeline + fix bug caused by `dd-trace` patching OpenAI sdk (#7820 ) * add dd trace to e2e docker run tests * update dd trace v * fix entrypoint * dd trace fixes * proxy_build_from_pip_tests * build python3.13 * use py 3.13 * fix build from pip * dd trace fix * proxy_build_from_pip_tests * bump build from pip	2025-01-16 22:03:09 -08:00
Krish Dholakia	4ec2c2c8e4	build: bump certifi version - see if that fixes asyncio ssl issue on python 3.13 (#7800 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details	2025-01-15 22:02:45 -08:00
Ishaan Jaff	2507c275f6	(proxy perf improvement) - use `uvloop` for higher RPS (10%-20% higher RPS) (#7662 ) * uvicorn use uvloop * fix uvloop==0.21.0 * add uvloop to pyproject * test_completion_response_ratelimit_headers	2025-01-09 18:11:20 -08:00
Ishaan Jaff	6125ba1e2b	(Feat) - allow including dd-trace in litellm base image (#7587 ) * introduce USE_DDTRACE=true * update dd tracer * update * bump dd trace * use og slim image * DD tracing * fix _init_dd_tracer	2025-01-06 17:27:09 -08:00
Ishaan Jaff	3158dcf88b	(Security fix) - Upgrade to `fastapi==0.115.5` (#7447 ) * fix upgrade fast api * bump fastapi * update a proxy startup tests * remove unused test file * update tests * bump fast api	2024-12-28 17:08:19 -08:00
Ishaan Jaff	4f49f836aa	(security fix) - bump fast api, fastapi-sso, python-multipart - fix snyk vulnerabilities (#7417 ) * bump fast api, fastapi-sso, python-multipart * bump cryptography * bump deps * bump proxy_logging_guardrails_model_info_tests * reduce num changes * undo change * undo change * undo fast api change	2024-12-25 19:56:35 -08:00
Ishaan Jaff	a790d43116	[Bug Fix]: ImportError: cannot import name 'T' from 're' (#7314 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 11s Details * fix unused imports * add test for python 3.12 * re introduce error - as a test * update config for ci/cd * fix python 13 install * bump pyyaml * bump numpy * fix embedding requests * bump pillow dep * bump version * bump pydantic * bump tiktoken * fix import * fix python 3.13 import * fix unused imports in tests/*	2024-12-19 13:09:30 -08:00
Ishaan Jaff	f3c546b79e	(feat) proxy Azure Blob Storage - Add support for `AZURE_STORAGE_ACCOUNT_KEY` Auth (#7280 ) * add upload_to_azure_data_lake_with_azure_account_key * async_upload_payload_to_azure_blob_storage * docs add AZURE_STORAGE_ACCOUNT_KEY * add azure-storage-file-datalake	2024-12-17 17:35:45 -08:00
Ishaan Jaff	2a8d64991f	(fix) 'utf-8' codec can't encode characters error on OpenAI (#7018 ) * test_openai_multilingual * pin httpx * fix openai pyproject * test_multilingual_requests * TestOpenAIChatCompletion * fix test anthropic completion	2024-12-03 20:33:14 -08:00
Krish Dholakia	859b47f08b	LiteLLM Minor Fixes & Improvements (11/29/2024) (#6965 ) * fix(factory.py): ensure tool call converts image url Fixes https://github.com/BerriAI/litellm/issues/6953 * fix(transformation.py): support mp4 + pdf url's for vertex ai Fixes https://github.com/BerriAI/litellm/issues/6936 * fix(http_handler.py): mask gemini api key in error logs Fixes https://github.com/BerriAI/litellm/issues/6963 * docs(prometheus.md): update prometheus FAQs * feat(auth_checks.py): ensure specific model access > wildcard model access if wildcard model is in access group, but specific model is not - deny access * fix(auth_checks.py): handle auth checks for team based model access groups handles scenario where model access group used for wildcard models * fix(internal_user_endpoints.py): support adding guardrails on `/user/update` Fixes https://github.com/BerriAI/litellm/issues/6942 * fix(key_management_endpoints.py): fix prepare_metadata_fields helper * fix: fix tests * build(requirements.txt): bump openai dep version fixes proxies argument * test: fix tests * fix(http_handler.py): fix error message masking * fix(bedrock_guardrails.py): pass in prepped data * test: fix test * test: fix nvidia nim test * fix(http_handler.py): return original response headers * fix: revert maskedhttpstatuserror * test: update tests * test: cleanup test * fix(key_management_endpoints.py): fix metadata field update logic * fix(key_management_endpoints.py): maintain initial order of guardrails in key update * fix(key_management_endpoints.py): handle prepare metadata * fix: fix linting errors * fix: fix linting errors * fix: fix linting errors * fix: fix key management errors * fix(key_management_endpoints.py): update metadata * test: update test * refactor: add more debug statements * test: skip flaky test * test: fix test * fix: fix test * fix: fix update metadata logic * fix: fix test * ci(config.yml): change db url for e2e ui testing	2024-12-01 05:24:11 -08:00
Ishaan Jaff	c047d51cc8	(feat) add `Predicted Outputs` for OpenAI (#6594 ) * bump openai to openai==1.54.0 * add 'prediction' param * testing fix bedrock deprecated cohere.command-text-v14 * test test_openai_prediction_param.py * test_openai_prediction_param_with_caching * doc Predicted Outputs * doc Predicted Output	2024-11-04 21:16:57 -08:00
Ishaan Jaff	13e0b3f626	(feat) Support `audio`, `modalities` params (#6304 ) * add audio, modalities param * add test for gpt audio models * add get_supported_openai_params for GPT audio models * add supported params for audio * test_audio_output_from_model * bump openai to openai==1.52.0 * bump openai on pyproject * fix audio test * fix test mock_chat_response * handle audio for Message * fix handling audio for OAI compatible API endpoints * fix linting * fix mock dbrx test	2024-10-18 19:14:25 +05:30
Krish Dholakia	f9d0bcc5a1	OpenAI `/v1/realtime` api support (#6047 ) * feat(azure/realtime): initial working commit for proxy azure openai realtime endpoint support Adds support for passing /v1/realtime calls via litellm proxy * feat(realtime_api/main.py): abstraction for handling openai realtime api calls * feat(router.py): add `arealtime()` endpoint in router for realtime api calls Allows using `model_list` in proxy for realtime as well * fix: make realtime api a private function Structure might change based on feedback. Make that clear to users. * build(requirements.txt): add websockets to the requirements.txt * feat(openai/realtime): add openai /v1/realtime api support	2024-10-03 17:11:22 -04:00
Ishaan Jaff	4e88fd65e1	(feat) openai prompt caching (non streaming) - add prompt_tokens_details in usage response (#6039 ) * add prompt_tokens_details in usage response * use _prompt_tokens_details as a param in Usage * fix linting errors * fix type error * fix ci/cd deps * bump deps for openai * bump deps openai * fix llm translation testing * fix llm translation embedding	2024-10-03 23:31:10 +05:30
Ishaan Jaff	e70d1a2808	add sentry sdk to litellm docker (#5965 )	2024-09-28 19:33:41 -07:00
Jacob Hagstedt P Suorra	9ec3365ba6	Upgrade dependencies in dockerfile (#5862 ) * Upgrade dependencies in dockerfile * Change apt-get to apk for alpine image * Set requirements file to same as dockerfile --------- Co-authored-by: Jacob Hagstedt <wcgs@novonordisk.com>	2024-09-27 07:51:20 -07:00
Ishaan Jaff	741c8e8a45	[Feat - Perf Improvement] DataDog Logger 91% lower latency (#5687 ) * fix refactor dd to be an instance of custom logger * migrate dd logger to be async * clean up dd logging * add datadog sync and async code * use batching for datadog logger * add doc string for dd logging * add clear doc string * fix doc string * allow debugging intake url * clean up requirements.txt * allow setting custom batch size on logger * fix dd logging to use compression * fix linting * add dd load test * fix dd load test * fix dd url * add test_datadog_logging_http_request * fix test_datadog_logging_http_request	2024-09-13 17:39:17 -07:00
Ishaan Jaff	ded40e4d41	bump openai to 1.45.0	2024-09-12 14:18:15 -07:00

1 2 3 4

172 commits