litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 03:34:10 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	b492551d3d	(fix) IBM Watsonx using ZenApiKey (#7821 ) * ibm watsonx fix * test ZenAPIKey * fix zenapikey	2025-01-16 22:02:36 -08:00
Ishaan Jaff	5b36985c00	run ci/cd again	2025-01-16 22:02:03 -08:00
Ishaan Jaff	ef9177f0a8	test_completion_mistral_api_mistral_large_function_call	2025-01-16 21:50:56 -08:00
Ishaan Jaff	117256d264	test_async_vertexai_streaming_response	2025-01-16 21:45:12 -08:00
Ishaan Jaff	5458a2ff33	fireworks ai use llama-v3p1-8b-instruct	2025-01-16 21:28:44 -08:00
Krrish Dholakia	8ab1335ae0	test: fix unit test	2025-01-16 21:11:17 -08:00
Ishaan Jaff	2f38e72026	test commit on main	2025-01-16 20:52:55 -08:00
Krish Dholakia	c57266c9dc	test: initial commit enforcing testing on all anthropic pass through … (#7794 ) * test: initial commit enforcing testing on all anthropic pass through functions prevents future regressions * test(test_unit_test_anthropic_pass_through.py): add unit test for '_get_user_from_metadata' function * test(test_unit_test_anthropic_passthrough.py): add unit test for handle_logging_anthropic_collected_chunks * test(test_unit_test_anthropic_pass_through): add coverage for all anthropic pass through functions	2025-01-15 22:02:35 -08:00
Krish Dholakia	843cd3b7c6	test: initial test to enforce all functions in user_api_key_auth.py h… (#7797 ) * test: initial test to enforce all functions in user_api_key_auth.py have direct testing * test(test_user_api_key_auth.py): add is_allowed_route unit test * test(test_user_api_key_auth.py): add more tests * test(test_user_api_key_auth.py): add complete testing coverage for all functions in `user_api_key_auth.py` * test(test_db_schema_changes.py): add a unit test to ensure all db schema changes are backwards compatible gives user an easy rollback path * test: fix schema compatibility test filepath * test: fix test	2025-01-15 21:52:45 -08:00
Krish Dholakia	80d6bbec29	Litellm dev 01 14 2025 p2 (#7772 ) * feat(pass_through_endpoints.py): fix anthropic end user cost tracking * fix(anthropic/chat/transformation.py): use returned provider model for anthropic handles anthropic `-latest` tag in request body throwing cost calculation errors ensures we can be accurate in our model cost tracking * feat(model_prices_and_context_window.json): add gemini-2.0-flash-thinking-exp pricing * test: update test to use assumption that user_api_key_dict can get anthropic user id * test: fix test * fix: fix test * fix(anthropic_pass_through.py): uncomment previous anthropic end-user cost tracking code block can't guarantee user api key dict always has end user id - too many code paths * fix(user_api_key_auth.py): this allows end user id from request body to always be read and set in auth object * fix(auth_check.py): fix linting error * test: fix auth check * fix(auth_utils.py): fix get end user id to handle metadata = None	2025-01-15 21:34:50 -08:00
Krish Dholakia	fe60a38c8e	Litellm dev 01 2025 p4 (#7776 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 13s Details * fix(gemini/): support gemini 'frequency_penalty' and 'presence_penalty' Closes https://github.com/BerriAI/litellm/issues/7748 * feat(proxy_server.py): new env var to disable prisma health check on startup * test: fix test	2025-01-14 21:49:25 -08:00
Krish Dholakia	8353caa485	build(pyproject.toml): bump uvicorn depedency requirement (#7773 ) * build(pyproject.toml): bump uvicorn depedency requirement Fixes https://github.com/BerriAI/litellm/issues/7768 * fix(anthropic/chat/transformation.py): fix is_vertex_request check to actually use optional param passed in Fixes https://github.com/BerriAI/litellm/issues/6898#issuecomment-2590860695 * fix(o1_transformation.py): fix azure o1 'is_o1_model' check to just check for o1 in model string https://github.com/BerriAI/litellm/issues/7743 * test: load vertex creds	2025-01-14 21:47:11 -08:00
Ishaan Jaff	30bb4c4cdd	(fix) `BaseAWSLLM` - cache IAM role credentials when used (#7775 ) * fix base aws llm * fix auth with aws role * test aws base llm * fix base aws llm init * run ci/cd again * fix get_credentials * ci/cd run again * _auth_with_aws_role	2025-01-14 20:16:22 -08:00
Ishaan Jaff	5fbbf47581	(Feat) prometheus - emit remaining team budget metric on proxy startup (#7777 ) * fix get_paginated_teams * use _initialize_remaining_budget_metrics * fix prom metric * run ci/cd again * fix run async func * fix _initialize_prometheus_startup_metrics * fix _initialize_prometheus_startup_metrics * prom unit tests * test_get_paginated_teams	2025-01-14 20:08:23 -08:00
Krish Dholakia	35919d9fec	Litellm dev 01 13 2025 p2 (#7758 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details * fix(factory.py): fix bedrock document url check Make check more generic - if starts with 'text' or 'application' assume it's a document and let it go through Fixes https://github.com/BerriAI/litellm/issues/7746 * feat(key_management_endpoints.py): support writing new key alias to aws secret manager - on key rotation adds rotation endpoint to aws key management hook - allows for rotated litellm virtual keys with new key alias to be written to it * feat(key_management_event_hooks.py): support rotating keys and updating secret manager * refactor(base_secret_manager.py): support rotate secret at the base level since it's just an abstraction function, it's easy to implement at the base manager level * style: cleanup unused imports	2025-01-14 17:04:01 -08:00
Krish Dholakia	7b27cfb0ae	Support temporary budget increases on keys (#7754 ) * fix(gpt_transformation.py): fix response_format translation check for 4o models Fixes https://github.com/BerriAI/litellm/issues/7616 * feat(key_management_endpoints.py): support 'temp_budget_increase' and 'temp_budget_expiry' fields Allow proxy admin to grant temporary budget increases to keys * fix(proxy/_types.py): enforce temp_budget_increase and temp_budget_expiry are always passed together * feat(user_api_key_auth.py): initial working temp budget increase logic ensures key budget exceeded error checks for temp budget in key metadata * feat(proxy_server.py): return the key max budget and key spend in the response headers Allows clientside user to know their remaining limits * test: add unit testing for new proxy utils Ensures new key budget is correctly handled * docs(temporary_budget_increase.md): add doc on temporary budget increase * fix(utils.py): remove 3.5 from response_format check for now not all azure 3.5 models support response_format * fix(user_api_key_auth.py): return valid user api key auth object on all paths	2025-01-14 17:03:11 -08:00
Krish Dholakia	29663c2db5	Litellm dev 01 14 2025 p1 (#7771 ) * First-class Aim Guardrails support (#7738) * initial aim support * add tests * docs(langsmith_integration.md): cleanup * style: cleanup unused imports --------- Co-authored-by: Tomer Bin <117278227+hxtomer@users.noreply.github.com>	2025-01-14 16:18:21 -08:00
Ishaan Jaff	d510f1d517	(fix) health check - allow setting `health_check_model` (#7752 ) * use _update_litellm_params_for_health_check * fix Wildcard Routes * test_update_litellm_params_for_health_check * test_perform_health_check_with_health_check_model * fix doc string * huggingface/mistralai/Mistral-7B-Instruct-v0.3	2025-01-13 20:16:44 -08:00
Ishaan Jaff	9daa6fb0b4	(prometheus - minor bug fix) - `litellm_llm_api_time_to_first_token_metric` not populating for bedrock models (#7740 ) * fix prometheus ttft * fix test_set_latency_metrics * fix _set_latency_metrics * fix _set_latency_metrics * fix test_set_latency_metrics * test_async_log_success_event * huggingface/mistralai/Mistral-7B-Instruct-v0.3	2025-01-13 20:16:34 -08:00
Ishaan Jaff	f1335362cf	(core sdk fix) - fix fallbacks stuck in infinite loop (#7751 ) * test_acompletion_fallbacks_basic * use common run_async_function * fix completion_with_fallbacks * fix completion with fallbacks * fix fallback utils * test_acompletion_fallbacks_basic * test_completion_fallbacks_sync * huggingface/mistralai/Mistral-7B-Instruct-v0.3	2025-01-13 19:34:34 -08:00
Ishaan Jaff	970e9c7507	huggingface/mistralai/Mistral-7B-Instruct-v0.3	2025-01-13 18:42:36 -08:00
Ishaan Jaff	3fe1f3b3b2	test_team_access_groups All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 12s Details	2025-01-12 22:26:13 -08:00
Krish Dholakia	ec5a354eac	add azure o1 pricing (#7715 ) * build(model_prices_and_context_window.json): add azure o1 pricing Closes https://github.com/BerriAI/litellm/issues/7712 * refactor: replace regex with string method for whitespace check in stop-sequences handling (#7713) * Allows overriding keep_alive time in ollama (#7079) * Allows overriding keep_alive time in ollama * Also adds to ollama_chat * Adds some info on the docs about this parameter * fix: together ai warning (#7688) Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com> * fix(proxy_server.py): handle config containing thread locked objects when using get_config_state * fix(proxy_server.py): add exception to debug * build(model_prices_and_context_window.json): update 'supports_vision' for azure o1 --------- Co-authored-by: Wolfram Ravenwolf <52386626+WolframRavenwolf@users.noreply.github.com> Co-authored-by: Regis David Souza Mesquita <github@rdsm.dev> Co-authored-by: Carl <45709281+capsenz@users.noreply.github.com> Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com>	2025-01-12 18:15:35 -08:00
Ishaan Jaff	15b52039d2	(litellm sdk speedup router) - adds a helper `_cached_get_model_group_info` to use when trying to get deployment tpm/rpm limits (#7719 ) * fix _cached_get_model_group_info * fixes get_remaining_model_group_usage * test_cached_get_model_group_info	2025-01-12 15:14:54 -08:00
Krish Dholakia	ad2f66b3e3	[BETA] Add OpenAI `/images/variations` + Topaz API support (#7700 ) * feat(main.py): initial commit for `/image/variations` endpoint support * refactor(base_llm/): introduce new base llm base config for image variation endpoints * refactor(openai/image_variations/transformation.py): implement openai image variation transformation handler * fix: test * feat(openai/): working openai `/image/variation` endpoint calls via sdk * feat(topaz/): topaz sync image variation call support Addresses https://github.com/BerriAI/litellm/issues/7593 ' * fix(topaz/transformation.py): fix linting errors * fix(openai/image_variations/handler.py): fix passing json data * fix(main.py): image_variation/ support async image variation route - `aimage_variation` * fix(test_get_model_info.py): fix test * fix: cleanup unused imports * feat(openai/): add async `/image/variations` endpoint support * feat(topaz/): support async `/image/variations` calls * fix: test * fix(utils.py): fix get_model_info_helper for no model info w/ provider config handles situation where model info is not known but provider config exists * test(test_router_fallbacks.py): mark flaky test * fix: fix unused imports * test: bump otel load test perf threshold - accounts for current load tests hitting same server	2025-01-11 23:27:46 -08:00
Krish Dholakia	becd4bc748	Litellm dev 01 11 2025 p3 (#7702 ) * fix(__init__.py): fix init to exclude pricing-only model cost values from real model names prevents bad health checks on wildcard routes * fix(get_llm_provider.py): fix to handle calling bedrock_converse models	2025-01-11 20:06:54 -08:00
Krish Dholakia	27892acdfc	Litellm dev 01 10 2025 p3 (#7682 ) * feat(langfuse.py): log the used prompt when prompt management used * test: fix test * docs(self_serve.md): add doc on restricting personal key creation on ui * feat(s3.py): support s3 logging with team alias prefixes (if available) New preview feature * fix(main.py): remove old if block - simplify to just await if coroutine returned fixes lm_studio async embedding error * fix(langfuse.py): handle get prompt check	2025-01-10 21:56:42 -08:00
Krish Dholakia	c4780479a9	Litellm dev 01 10 2025 p2 (#7679 ) * test(test_basic_python_version.py): assert all optional dependencies are marked as extras on poetry Fixes https://github.com/BerriAI/litellm/issues/7677 * docs(secret.md): clarify 'read_and_write' secret manager usage on aws * docs(secret.md): fix doc * build(ui/teams.tsx): add edit/delete button for updating user / team membership on ui allows updating user role to admin on ui * build(ui/teams.tsx): display edit member component on ui, when edit button on member clicked * feat(team_endpoints.py): support updating team member role to admin via api endpoints allows team member to become admin post-add * build(ui/user_dashboard.tsx): if team admin - show all team keys Fixes https://github.com/BerriAI/litellm/issues/7650 * test(config.yml): add tomli to ci/cd * test: don't call python_basic_testing in local testing (covered by python 3.13 testing)	2025-01-10 21:50:53 -08:00
Ishaan Jaff	02f5c44a35	[Bug fix]: Proxy Auth Layer - Allow Azure Realtime routes as llm_api_routes (#7684 ) * fix route check azure realtime endpoints * test_is_llm_api_route * fix /realtime * test_routes_on_litellm_proxy	2025-01-10 20:38:06 -08:00
Ishaan Jaff	5c870c0c51	(performance improvement - litellm sdk + proxy) - ensure litellm does not create unnecessary threads when running async functions (#7680 ) * fix handle_sync_success_callbacks_for_async_calls * fix handle_sync_success_callbacks_for_async_calls * fix linting / testing errors * use handle_sync_success_callbacks_for_async_calls * add unit testing for logging fixes	2025-01-10 17:57:22 -08:00
Krish Dholakia	a3e65c9bcb	LiteLLM Minor Fixes & Improvements (01/10/2025) - p1 (#7670 ) * test(test_get_model_info.py): add unit test confirming router deployment updates global 'get_model_info' * fix(get_supported_openai_params.py): fix custom llm provider 'get_supported_openai_params' Fixes https://github.com/BerriAI/litellm/issues/7668 * docs(azure.md): clarify how azure ad token refresh on proxy works Closes https://github.com/BerriAI/litellm/issues/7665	2025-01-10 17:49:05 -08:00
Krish Dholakia	c10ae8879e	fix(vertex_ai/gemini/transformation.py): handle 'http://' in gemini p… (#7660 ) * fix(vertex_ai/gemini/transformation.py): handle 'http://' in gemini process url * refactor(router.py): refactor '_prompt_management_factory' to use logging obj get_chat_completion logic deduplicates code * fix(litellm_logging.py): update 'get_chat_completion_prompt' to update logging object messages * docs(prompt_management.md): update prompt management to be in beta given feedback - this still needs to be revised (e.g. passing in user message, not ignoring) * refactor(prompt_management_base.py): introduce base class for prompt management allows consistent behaviour across prompt management integrations * feat(prompt_management_base.py): support adding client message to template message + refactor langfuse prompt management to use prompt management base * fix(litellm_logging.py): log prompt id + prompt variables to langfuse if set allows tracking what prompt was used for what purpose * feat(litellm_logging.py): log prompt management metadata in standard logging payload + use in langfuse allows logging prompt id / prompt variables to langfuse * test: fix test * fix(router.py): cleanup unused imports * fix: fix linting error * fix: fix trace param typing * fix: fix linting errors * fix: fix code qa check	2025-01-10 07:31:59 -08:00
Krish Dholakia	865e6d5bda	fix(main.py): fix lm_studio/ embedding routing (#7658 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 36s Details * fix(main.py): fix lm_studio/ embedding routing adds the mapping + updates docs with example * docs(self_serve.md): update doc to show how to auto-add sso users to teams * fix(streaming_handler.py): simplify async iterator check, to just check if streaming response is an async iterable	2025-01-09 23:03:24 -08:00
Ishaan Jaff	13f364682d	(Feat - Batches API) add support for retrieving vertex api batch jobs (#7661 ) * add _async_retrieve_batch * fix aretrieve_batch * fix _get_batch_id_from_vertex_ai_batch_response * fix batches docs	2025-01-09 18:35:03 -08:00
Ishaan Jaff	2507c275f6	(proxy perf improvement) - use `uvloop` for higher RPS (10%-20% higher RPS) (#7662 ) * uvicorn use uvloop * fix uvloop==0.21.0 * add uvloop to pyproject * test_completion_response_ratelimit_headers	2025-01-09 18:11:20 -08:00
Ishaan Jaff	a85de46ef7	(proxy - RPS) - Get 2K RPS at 4 instances, minor fix `aiohttp_openai/` (#7659 ) * speed up transform_response * use 2 workers * undo changes to uvicorn * ci/cd run again	2025-01-09 17:24:18 -08:00
Krish Dholakia	907bcd3a62	Litellm dev 01 08 2025 p1 (#7640 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 36s Details * feat(ui_sso.py): support reading team ids from sso token * feat(ui_sso.py): working upsert sso user teams membership in litellm - if team exists Adds user to relevant teams, if user is part of teams and team exists on litellm * fix(ui_sso.py): safely handle add team member task * build(ui/): support setting team id when creating team on UI * build(ui/): teams.tsx allow setting team id on ui * build(circle_ci/requirements.txt): add fastapi-sso to ci/cd testing * fix: fix linting errors	2025-01-08 22:08:20 -08:00
Krish Dholakia	1e3370f3cb	LiteLLM Minor Fixes & Improvements (01/08/2025) - p2 (#7643 ) * fix(streaming_chunk_builder_utils.py): add test for groq tool calling + streaming + combine chunks Addresses https://github.com/BerriAI/litellm/issues/7621 * fix(streaming_utils.py): fix modelresponseiterator for openai like chunk parser ensures chunk parser uses the correct tool call id when translating the chunk Fixes https://github.com/BerriAI/litellm/issues/7621 * build(model_hub.tsx): display cost pricing on model hub * build(model_hub.tsx): show cost per token pricing + complete model information * fix(types/utils.py): fix usage object handling	2025-01-08 19:45:19 -08:00
Ishaan Jaff	48d4f79206	fix is llm api route check (#7631 )	2025-01-08 18:45:59 -08:00
Krish Dholakia	4af23353d6	Allow assigning teams to org on UI + OpenAI `omni-moderation` cost model tracking (#7566 ) * feat(cost_calculator.py): add cost tracking ($0) for openai moderations endpoint removes sentry cost tracking errors caused by this * build(teams.tsx): allow assigning teams to orgs	2025-01-08 16:58:21 -08:00
Krish Dholakia	0ffc5379ea	Litellm dev 01 07 2025 p2 (#7622 ) * build(ui/): update ui * fix: drop unsupported non-whitespace characters for real when calling… (#7484) * fix: drop unsupported non-whitespace characters for real when calling anthropic with stop sequences * test: add parameterized test for _map_stop_sequences method in AnthropicConfig --------- Co-authored-by: Wolfram Ravenwolf <52386626+WolframRavenwolf@users.noreply.github.com>	2025-01-08 16:56:39 -08:00
Krish Dholakia	a187cee538	Litellm dev 01 07 2025 p3 (#7635 ) * fix(__init__.py): fix mistral large tool calling map bedrock mistral large to converse endpoint Fixes https://github.com/BerriAI/litellm/issues/7521 * braintrust logging: respect project_id, add more metrics + more (#7613) * braintrust logging: respect project_id, add more metrics * braintrust logger: improve json formatting * braintrust logger: add test for passing specific project_id * rm unneeded import * braintrust logging: rm unneeded var in tets * add project_name * update docs --------- Co-authored-by: H <no@email.com> --------- Co-authored-by: hi019 <65871571+hi019@users.noreply.github.com> Co-authored-by: H <no@email.com>	2025-01-08 11:46:24 -08:00
Krish Dholakia	e8ed40a27b	Litellm dev 01 01 2025 p2 (#7615 ) * fix(utils.py): prevent double logging when passing 'fallbacks=' to .completion() Fixes https://github.com/BerriAI/litellm/issues/7477 * fix(utils.py): fix vertex anthropic check * fix(utils.py): ensure supported params is always set Fixes https://github.com/BerriAI/litellm/issues/7470 * test(test_optional_params.py): add unit testing to prevent mistranslation Fixes https://github.com/BerriAI/litellm/issues/7470 * fix: fix linting error * test: cleanup	2025-01-07 21:40:33 -08:00
Ishaan Jaff	081826a5d6	(Feat) soft budget alerts on keys (#7623 ) * class WebhookEvent(CallInfo): Add * handle soft budget alerts * handle soft budget * fix budget alerts * fix CallInfo * fix _get_user_info_str * test_soft_budget_alerts * test_soft_budget_alert	2025-01-07 21:36:34 -08:00
Krish Dholakia	4e69711411	Litellm dev 01 07 2025 p1 (#7618 ) * fix(main.py): pass custom llm provider on litellm logging provider update * fix(cost_calculator.py): don't append provider name to return model if existing llm provider Fixes https://github.com/BerriAI/litellm/issues/7607 * fix(prometheus_services.py): fix prometheus system health error logging Fixes https://github.com/BerriAI/litellm/issues/7611	2025-01-07 21:22:31 -08:00
Ishaan Jaff	55139b8fd6	update tests All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 34s Details	2025-01-06 22:36:00 -08:00
Ishaan Jaff	2ca0977921	`aiohttp_openai/` fixes - allow using `aiohttp_openai/gpt-4o` (#7598 ) * fixes for get_complete_url * update aiohttp tests * fix event loop for aiohtto * ci/cd run again * test_aiohttp_openai	2025-01-06 21:39:11 -08:00
Ishaan Jaff	744beac754	ci/cd run again	2025-01-06 21:35:34 -08:00
Krish Dholakia	fef7839e8a	Litellm dev 01 06 2025 p1 (#7594 ) * fix(custom_logger.py): expose new 'async_get_chat_completion_prompt' event hook * fix(custom_logger.py): langfuse_prompt_management.py remove 'headers' from custom logger 'async_get_chat_completion_prompt' and 'get_chat_completion_prompt' event hooks * feat(router.py): expose new function for prompt management based routing * feat(router.py): partial working router prompt factory logic allows load balanced model to be used for model name w/ langfuse prompt management call * feat(router.py): fix prompt management with load balanced model group * feat(langfuse_prompt_management.py): support reading in openai params from langfuse enables user to define optional params on langfuse vs. client code * test(test_Router.py): add unit test for router based langfuse prompt management * fix: fix linting errors	2025-01-06 21:26:21 -08:00
Krish Dholakia	0c3fef24cd	Litellm dev 01 06 2025 p2 (#7597 ) * test(test_amazing_vertex_completion.py): fix test * test: initial working code gecko test * fix(vertex_ai_non_gemini.py): support vertex ai code gecko fake streaming Fixes https://github.com/BerriAI/litellm/issues/7360 * test(test_get_model_info.py): add test for getting custom provider model info Covers https://github.com/BerriAI/litellm/issues/7575 * fix(utils.py): fix get_provider_model_info check Handle custom llm provider scenario Fixes https://github.com/ BerriAI/litellm/issues/7575	2025-01-06 21:04:49 -08:00

... 17 18 19 20 21 ...

1687 commits