litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 02:34:29 +00:00

Author	SHA1	Message	Date
Krrish Dholakia	4a226814ff	fix(stream_chunk_builder_utils.py): don't set index on modelresponse	2025-04-16 09:28:00 -07:00
Krrish Dholakia	a743b6fc1f	fix(bedrock/common_utils.py): add us-west-1 to us regions	2025-04-16 08:00:39 -07:00
ChaoFu Yang	c07eea864e	/utils/token_counter: get model_info from deployment directly (#10047 )	2025-04-16 07:53:18 -07:00
Michael Leshchinsky	e19d05980c	Add litellm call id passing to Aim guardrails on pre and post-hooks calls (#10021 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details Helm unit test / unit-test (push) Successful in 19s Details * Add litellm_call_id passing to aim guardrails on pre and post-hooks * Add test that ensures that pre_call_hook receives litellm call id when common_request_processing called	2025-04-16 07:41:28 -07:00
Ishaan Jaff	1d4fea509d	ui new build	2025-04-15 22:36:44 -07:00
Ishaan Jaff	dcc43e797a	[Docs] Auto prompt caching (#10044 ) * docs prompt cache controls * doc fix auto prompt caching	2025-04-15 22:29:47 -07:00
Krish Dholakia	fdfa1108a6	Add property ordering for vertex ai schema (#9828 ) + Fix combining multiple tool calls (#10040 ) * fix #9783: Retain schema field ordering for google gemini and vertex (#9828) * test: update test * refactor(groq.py): initial commit migrating groq to base_llm_http_handler * fix(streaming_chunk_builder_utils.py): fix how tool content is combined Fixes https://github.com/BerriAI/litellm/issues/10034 * fix(vertex_ai/common_utils.py): prevent infinite loop in helper function * fix(groq/chat/transformation.py): handle groq streaming errors correctly * fix(groq/chat/transformation.py): handle max_retries --------- Co-authored-by: Adrian Lyjak <adrian@chatmeter.com>	2025-04-15 22:29:25 -07:00
Krish Dholakia	1b9b745cae	Fix gcs pub sub logging with env var GCS_PROJECT_ID (#10042 ) * fix(pub_sub.py): fix passing project id in pub sub call Fixes issue where GCS_PUBSUB_PROJECT_ID was not being used * test(test_pub_sub.py): add unit test to prevent future regressions * test: fix test	2025-04-15 21:50:48 -07:00
Ishaan Jaff	bd88263b29	[Feat - Cost Tracking improvement] Track prompt caching metrics in DailyUserSpendTransactions (#10029 ) * stash changes * emit cache read/write tokens to daily spend update * emit cache read/write tokens on daily activity * update types.ts * docs prompt caching * undo ui change * fix activity metrics * fix prompt caching metrics * fix typed dict fields * fix get_aggregated_daily_spend_update_transactions * fix aggregating cache tokens * test_cache_token_fields_aggregation * daily_transaction * add cache_creation_input_tokens and cache_read_input_tokens to LiteLLM_DailyUserSpend * test_daily_spend_update_queue.py	2025-04-15 21:40:57 -07:00
Ishaan Jaff	d32d6fe03e	[UI] Bug Fix - Show created_at and updated_at for Users Page (#10033 ) * add created_at and updated_at as fields for internal user table * test_get_users_includes_timestamps	2025-04-15 21:15:44 -07:00
Krish Dholakia	9b77559ccf	Add aggregate team based usage logging (#10039 ) * feat(schema.prisma): initial commit adding aggregate table for team spend allows team spend to be visible at 1m+ logs * feat(db_spend_update_writer.py): support logging aggregate team spend allows usage dashboard to work at 1m+ logs * feat(litellm-proxy-extras/): add new migration file * fix(db_spend_update_writer.py): fix return type * build: bump requirements * fix: fix ruff error	2025-04-15 20:58:48 -07:00
Krish Dholakia	d3e7a137ad	Revert "fix #9783 : Retain schema field ordering for google gemini and vertex …" (#10038 ) This reverts commit `e3729f9855`.	2025-04-15 19:21:33 -07:00
Adrian Lyjak	e3729f9855	fix #9783 : Retain schema field ordering for google gemini and vertex (#9828 )	2025-04-15 19:12:02 -07:00
Marc Abramowitz	837a6948d8	Fix typo: Entrata -> Entra in code (#9922 ) * Fix typo: Entrata -> Entra * Fix a few more	2025-04-15 17:31:18 -07:00
Krish Dholakia	6b5f093087	Revert "Fix case where only system messages are passed to Gemini (#9992 )" (#10027 ) This reverts commit `2afd922f8c`.	2025-04-15 13:34:03 -07:00
Nolan Tremelling	2afd922f8c	Fix case where only system messages are passed to Gemini (#9992 )	2025-04-15 13:30:49 -07:00
Michael Schmid	14bcc9a6c9	feat: update region configuration in AmazonBedrockGlobalConfig (#9430 )	2025-04-15 09:59:32 -07:00
Krish Dholakia	33ead69c0a	Support checking provider `/models` endpoints on proxy `/v1/models` endpoint (#9958 ) * feat(utils.py): support global flag for 'check_provider_endpoints' enables setting this for `/models` on proxy * feat(utils.py): add caching to 'get_valid_models' Prevents checking endpoint repeatedly * fix(utils.py): ensure mutations don't impact cached results * test(test_utils.py): add unit test to confirm cache invalidation logic * feat(utils.py): get_valid_models - support passing litellm params dynamically Allows for checking endpoints based on received credentials * test: update test * feat(model_checks.py): pass router credentials to get_valid_models - ensures it checks correct credentials * refactor(utils.py): refactor for simpler functions * fix: fix linting errors * fix(utils.py): fix test * fix(utils.py): set valid providers to custom_llm_provider, if given * test: update test * fix: fix ruff check error	2025-04-14 23:23:20 -07:00
Eoous	e94eb4ec70	env for `litellm.modify_params` (#9964 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 17s Details Helm unit test / unit-test (push) Successful in 23s Details	2025-04-14 22:33:56 -07:00
Krish Dholakia	9b0f871129	Add `/vllm/` and `/mistral/` passthrough endpoints (adds support for Mistral OCR via passthrough) * feat(llm_passthrough_endpoints.py): support mistral passthrough Closes https://github.com/BerriAI/litellm/issues/9051 * feat(llm_passthrough_endpoints.py): initial commit for adding vllm passthrough route * feat(vllm/common_utils.py): add new vllm model info route make it possible to use vllm passthrough route via factory function * fix(llm_passthrough_endpoints.py): add all methods to vllm passthrough route * fix: fix linting error * fix: fix linting error * fix: fix ruff check * fix(proxy/_types.py): add new passthrough routes * docs(config_settings.md): add mistral env vars to docs	2025-04-14 22:06:33 -07:00
Krish Dholakia	8faf56922c	Fix azure tenant id check from env var + response_format check on api_version 2025+ (#9993 ) * fix(azure/common_utils.py): check for azure tenant id, client id, client secret in env var Fixes https://github.com/BerriAI/litellm/issues/9598#issuecomment-2801966027 * fix(azure/gpt_transformation.py): fix passing response_format to azure when api year = 2025 Fixes https://github.com/BerriAI/litellm/issues/9703 * test: monkeypatch azure api version in test * test: update testing * test: fix test * test: update test * docs(config_settings.md): document env vars	2025-04-14 22:02:35 -07:00
Ishaan Jaff	ce2595f56a	bump: version 1.66.0 → 1.66.1	2025-04-14 21:30:07 -07:00
Ishaan Jaff	b210639dce	ui new build	2025-04-14 21:19:21 -07:00
Ishaan Jaff	c1a642ce20	[UI] Allow setting prompt `cache_control_injection_points` (#10000 ) * test_anthropic_cache_control_hook_system_message * test_anthropic_cache_control_hook.py * should_run_prompt_management_hooks * fix should_run_prompt_management_hooks * test_anthropic_cache_control_hook_specific_index * fix test * fix linting errors * ChatCompletionCachedContent * initial commit for cache control * fixes ui design * fix inserting cache_control_injection_points * fix entering cache control points * fixes for using cache control on ui + backend * update cache control settings on edit model page * fix init custom logger compatible class * fix linting errors * fix linting errors * fix get_chat_completion_prompt	2025-04-14 21:17:42 -07:00
Ishaan Jaff	6cfa50d278	[Feat] Add support for `cache_control_injection_points` for Anthropic API, Bedrock API (#9996 ) * test_anthropic_cache_control_hook_system_message * test_anthropic_cache_control_hook.py * should_run_prompt_management_hooks * fix should_run_prompt_management_hooks * test_anthropic_cache_control_hook_specific_index * fix test * fix linting errors * ChatCompletionCachedContent	2025-04-14 20:50:13 -07:00
Krish Dholakia	2ed593e052	Updated cohere v2 passthrough (#9997 ) * Add cohere `/v2/chat` pass-through cost tracking support (#8235) * feat(cohere_passthrough_handler.py): initial working commit with cohere passthrough cost tracking * fix(v2_transformation.py): support cohere /v2/chat endpoint * fix: fix linting errors * fix: fix import * fix(v2_transformation.py): fix linting error * test: handle openai exception change	2025-04-14 19:51:01 -07:00
Ishaan Jaff	24447eb0cd	fix gpt 4.1 costs (#9991 )	2025-04-14 12:50:14 -07:00
Krish Dholakia	bbb7541c22	build(model_prices_and_context_window.json): add gpt-4.1 pricing (#9990 ) * build(model_prices_and_context_window.json): add gpt-4.1 pricing * build(model_prices_and_context_window.json): add gpt-4.1-mini and gpt-4.1-nano model support	2025-04-14 12:14:46 -07:00
Ishaan Jaff	72c1f7e09a	ui new build	2025-04-12 20:42:43 -07:00
Ishaan Jaff	89dfb42697	[UI QA checklist] (#9957 ) * fix typo on UI * fix for edit user tab * fix for user spend * add /team/permissions_list to management routes * fix auth check for team member permissions * fix team endpoints test	2025-04-12 20:41:50 -07:00
Krish Dholakia	00e49380df	Litellm UI qa 04 12 2025 p1 (#9955 ) * fix(model_info_view.tsx): cleanup text * fix(key_management_endpoints.py): fix filtering litellm-dashboard keys for internal users * fix(proxy_track_cost_callback.py): prevent flooding spend logs with admin endpoint errors * test: add unit testing for logic * test(test_auth_exception_handler.py): add more unit testing * fix(router.py): correctly handle retrieving model info on get_model_group_info fixes issue where model hub was showing None prices * fix: fix linting errors	2025-04-12 19:30:48 -07:00
Ishaan Jaff	c86e678809	[Docs] v1.66.0-stable fixes (#9953 ) * add categories for spend tracking improvements * xai reasoning usage * docs tag management * docs tag based routing * [Beta] Routing based * docs tag based routing * docs tag routing * docs enterprise web search	2025-04-12 16:57:25 -07:00
Ishaan Jaff	4e81b2cab4	[Team Member permissions] - Fixes (#9945 ) * only load member permissions for non-admins * run member permission checks on update + regenerate endpoints * run check for /key/generate * working test_default_member_permissions * passing test with permissions on update delete endpoints * test_create_permissions * _team_key_generation_check * fix TeamBase * fix team endpoints * fix api docs check	2025-04-12 11:17:51 -07:00
Krish Dholakia	d004fb542f	fix(litellm_proxy_extras): add baselining db script (#9942 ) * fix(litellm_proxy_extras): add baselining db script Fixes https://github.com/BerriAI/litellm/issues/9885 * fix(prisma_client.py): fix ruff errors * ci(config.yml): add publish_proxy_extras step * fix(config.yml): compare contents between versions to check for changes * fix(config.yml): fix check * fix: install toml * fix: update check * fix: ensure versions in sync * fix: fix version compare * fix: correct the cost for 'gemini/gemini-2.5-pro-preview-03-25' (#9896) * fix: Typo in the cost 'gemini/gemini-2.5-pro-preview-03-25', closes #9854 * chore: update in backup file as well * Litellm add managed files db (#9930) * fix(openai.py): ensure openai file object shows up on logs * fix(managed_files.py): return unified file id as b64 str allows retrieve file id to work as expected * fix(managed_files.py): apply decoded file id transformation * fix: add unit test for file id + decode logic * fix: initial commit for litellm_proxy support with CRUD Endpoints * fix(managed_files.py): support retrieve file operation * fix(managed_files.py): support for DELETE endpoint for files * fix(managed_files.py): retrieve file content support supports retrieve file content api from openai * fix: fix linting error * test: update tests * fix: fix linting error * feat(managed_files.py): support reading / writing files in DB * feat(managed_files.py): support deleting file from DB on delete * test: update testing * fix(spend_tracking_utils.py): ensure each file create request is logged correctly * fix(managed_files.py): fix storing / returning managed file object from cache * fix(files/main.py): pass litellm params to azure route * test: fix test * build: add new prisma migration * build: bump requirements * test: add more testing * refactor: cleanup post merge w/ main * fix: fix code qa errors * [DB / Infra] Add new column team_member_permissions (#9941) * add team_member_permissions to team table * add migration.sql file * fix poetry lock * fix prisma migrations * fix poetry lock * fix migration * ui new build * fix(factory.py): correct indentation for message index increment in ollama, This fixes bug #9822 (#9943) * fix(factory.py): correct indentation for message index increment in ollama_pt function * test: add unit tests for ollama_pt function handling various message types * ci: update test * fix: fix check * ci: see what dir looks like * ci: more checks * ci: fix filepath * ci: cleanup * ci: fix ci --------- Co-authored-by: Nilanjan De <nilanjan.de@gmail.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Dan Shaw <dan@danieljshaw.com>	2025-04-12 10:29:34 -07:00
Dan Shaw	433075a8d9	fix(factory.py): correct indentation for message index increment in ollama, This fixes bug #9822 (#9943 ) * fix(factory.py): correct indentation for message index increment in ollama_pt function * test: add unit tests for ollama_pt function handling various message types	2025-04-12 09:50:40 -07:00
Ishaan Jaff	69a3aab4c8	ui new build	2025-04-12 09:13:00 -07:00
Ishaan Jaff	fb0c3d9e18	[DB / Infra] Add new column team_member_permissions (#9941 ) * add team_member_permissions to team table * add migration.sql file * fix poetry lock * fix prisma migrations * fix poetry lock * fix migration	2025-04-12 09:06:04 -07:00
Krish Dholakia	421e0a3004	Litellm add managed files db (#9930 ) * fix(openai.py): ensure openai file object shows up on logs * fix(managed_files.py): return unified file id as b64 str allows retrieve file id to work as expected * fix(managed_files.py): apply decoded file id transformation * fix: add unit test for file id + decode logic * fix: initial commit for litellm_proxy support with CRUD Endpoints * fix(managed_files.py): support retrieve file operation * fix(managed_files.py): support for DELETE endpoint for files * fix(managed_files.py): retrieve file content support supports retrieve file content api from openai * fix: fix linting error * test: update tests * fix: fix linting error * feat(managed_files.py): support reading / writing files in DB * feat(managed_files.py): support deleting file from DB on delete * test: update testing * fix(spend_tracking_utils.py): ensure each file create request is logged correctly * fix(managed_files.py): fix storing / returning managed file object from cache * fix(files/main.py): pass litellm params to azure route * test: fix test * build: add new prisma migration * build: bump requirements * test: add more testing * refactor: cleanup post merge w/ main * fix: fix code qa errors	2025-04-12 08:24:46 -07:00
Nilanjan De	93037ea4d3	fix: correct the cost for 'gemini/gemini-2.5-pro-preview-03-25' (#9896 ) * fix: Typo in the cost 'gemini/gemini-2.5-pro-preview-03-25', closes #9854 * chore: update in backup file as well	2025-04-12 08:20:04 -07:00
Krish Dholakia	069aee9f70	fix(transformation.py): correctly translate 'thinking' param for lite… (#9904 ) All checks were successful Helm unit test / unit-test (push) Successful in 21s Details Read Version from pyproject.toml / read-version (push) Successful in 40s Details * fix(transformation.py): correctly translate 'thinking' param for litellm_proxy/ route Fixes https://github.com/BerriAI/litellm/issues/9892 * test: update test	2025-04-11 23:25:13 -07:00
Krish Dholakia	b9f01c9f5b	fix(databricks/common_utils.py): fix custom endpoint check (#9925 ) * fix(databricks/common_utils.py): fix custom endpoint check Fixes https://github.com/BerriAI/litellm/issues/9915 * fix(common_utils.py): add unit test to ensure custom_endpoint=False is handled correctly Fixes https://github.com/BerriAI/litellm/issues/9915	2025-04-11 23:20:49 -07:00
Krish Dholakia	3ca82c22b6	Support CRUD endpoints for Managed Files (#9924 ) * fix(openai.py): ensure openai file object shows up on logs * fix(managed_files.py): return unified file id as b64 str allows retrieve file id to work as expected * fix(managed_files.py): apply decoded file id transformation * fix: add unit test for file id + decode logic * fix: initial commit for litellm_proxy support with CRUD Endpoints * fix(managed_files.py): support retrieve file operation * fix(managed_files.py): support for DELETE endpoint for files * fix(managed_files.py): retrieve file content support supports retrieve file content api from openai * fix: fix linting error * test: update tests * fix: fix linting error * fix(files/main.py): pass litellm params to azure route * test: fix test	2025-04-11 21:48:27 -07:00
Ishaan Jaff	57bc03b30b	[Feat] Add reasoning_effort support for `xai/grok-3-mini-beta` model family (#9932 ) * add BaseReasoningEffortTests * BaseReasoningLLMTests * fix test rename * docs update thinking / reasoning content docs	2025-04-11 19:17:09 -07:00
Ishaan Jaff	f9ce754817	[Feat] Add litellm.supports_reasoning() util to track if an llm supports reasoning (#9923 ) * add supports_reasoning for xai models * add "supports_reasoning": true for o1 series models * add supports_reasoning util * add litellm.supports_reasoning * add supports reasoning for claude 3-7 models * add deepseek as supports reasoning * test_supports_reasoning * add supports reasoning to model group info * add supports_reasoning * docs supports reasoning * fix supports_reasoning test * "supports_reasoning": false, * fix test * supports_reasoning	2025-04-11 17:56:04 -07:00
Ishaan Jaff	91c0a794b9	[Feat - Team Member Permissions] - CRUD Endpoints for managing team member permissions (#9919 ) * add team_member_permissions * add GetTeamMemberPermissionsRequest types * crud endpoint for team member permissions * test team member permissions CRUD * fix GetTeamMemberPermissionsRequest	2025-04-11 17:15:16 -07:00
Ishaan Jaff	2d6ad534bc	[Feat - PR1] Add xAI grok-3 models to LiteLLM (#9920 ) * add xai/grok-3-mini-beta, xai/grok-3-beta * add grok-3-fast-latest models * supports_response_schema * fix pricing * docs xai	2025-04-11 15:12:12 -07:00
Ishaan Jaff	8b1d2d6956	[Feat - UI] - Allow setting Default Team setting when LiteLLM SSO auto creates teams (#9918 ) * endpoint for updating default team settings on ui * add GET default team settings endpoint * ui expose default team settings on UI * update to use DefaultTeamSSOParams * DefaultTeamSSOParams * fix DefaultTeamSSOParams * docs team management * test_update_default_team_settings	2025-04-11 14:07:10 -07:00
Krish Dholakia	0415f1205e	Litellm dev 04 10 2025 p3 (#9903 ) * feat(managed_files.py): encode file type in unified file id simplify calling gemini models * fix(common_utils.py): fix extracting file type from unified file id * fix(litellm_logging.py): create standard logging payload for create file call * fix: fix linting error	2025-04-11 09:29:42 -07:00
Krish Dholakia	9f27e8363f	Realtime API: Support 'base_model' cost tracking + show response in spend logs (if enabled) (#9897 ) * refactor(litellm_logging.py): refactor realtime cost tracking to use common code as rest Ensures basic features like base model just work * feat(realtime/): support 'base_model' cost tracking on realtime api Fixes issue where base model was not working on realtime * fix: fix ruff linting error * test: fix test	2025-04-10 21:24:45 -07:00
Krish Dholakia	78879c68a9	Revert avglogprobs change + Add azure/gpt-4o-realtime-audio cost tracking (#9893 ) * test: initial commit fixing gemini logprobs Fixes https://github.com/BerriAI/litellm/issues/9888 * fix(vertex_and_google_ai_studio.py): Revert avglogprobs change Fixes https://github.com/BerriAI/litellm/issues/8890 * build(model_prices_and_context_window.json): add gpt-4o-realtime-preview cost to model cost map Fixes https://github.com/BerriAI/litellm/issues/9814 * test: add cost calculation unit testing * test: fix test * test: update test	2025-04-10 21:23:55 -07:00

1 2 3 4 5 ...

13405 commits