litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 03:34:10 +00:00

History

Krish Dholakia 8903bd1c7f fix(utils.py): fix vertex ai optional param handling (#8477 ) * fix(utils.py): fix vertex ai optional param handling don't pass max retries to unsupported route Fixes https://github.com/BerriAI/litellm/issues/8254 * fix(get_supported_openai_params.py): fix linting error * fix(get_supported_openai_params.py): default to openai-like spec * test: fix test * fix: fix linting error * Improved wildcard route handling on `/models` and `/model_group/info` (#8473) * fix(model_checks.py): update returning known model from wildcard to filter based on given model prefix ensures wildcard route - `vertex_ai/gemini-` just returns known vertex_ai/gemini- models test(test_proxy_utils.py): add unit testing for new 'get_known_models_from_wildcard' helper * test(test_models.py): add e2e testing for `/model_group/info` endpoint * feat(prometheus.py): support tracking total requests by user_email on prometheus adds initial support for tracking total requests by user_email * test(test_prometheus.py): add testing to ensure user email is always tracked * test: update testing for new prometheus metric * test(test_prometheus_unit_tests.py): add user email to total proxy metric * test: update tests * test: fix spend tests * test: fix test * fix(pagerduty.py): fix linting error * (Bug fix) - Using `include_usage` for /completions requests + unit testing (#8484) * pass stream options (#8419) * test_completion_streaming_usage_metrics * test_text_completion_include_usage --------- Co-authored-by: Kaushik Deka <55996465+Kaushikdkrikhanu@users.noreply.github.com> * fix naming docker stable release * build(model_prices_and_context_window.json): handle azure model update * docs(token_auth.md): clarify scopes can be a list or comma separated string * docs: fix docs * add sonar pricings (#8476) * add sonar pricings * Update model_prices_and_context_window.json * Update model_prices_and_context_window.json * Update model_prices_and_context_window_backup.json * update load testing script * fix test_async_router_context_window_fallback * pplx - fix supports tool choice openai param (#8496) * fix prom check startup (#8492) * test_async_router_context_window_fallback * ci(config.yml): mark daily docker builds with `-nightly` (#8499) Resolves https://github.com/BerriAI/litellm/discussions/8495 * (Redis Cluster) - Fixes for using redis cluster + pipeline (#8442) * update RedisCluster creation * update RedisClusterCache * add redis ClusterCache * update async_set_cache_pipeline * cleanup redis cluster usage * fix redis pipeline * test_init_async_client_returns_same_instance * fix redis cluster * update mypy_path * fix init_redis_cluster * remove stub * test redis commit * ClusterPipeline * fix import * RedisCluster import * fix redis cluster * Potential fix for code scanning alert no. 2129: Clear-text logging of sensitive information Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * fix naming of redis cluster integration * test_redis_caching_ttl_pipeline * fix async_set_cache_pipeline --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * Litellm UI stable version 02 12 2025 (#8497) * fix(key_management_endpoints.py): fix `/key/list` to include `return_full_object` as a top-level query param Allows user to specify they want the keys as a list of objects * refactor(key_list.tsx): initial refactor of key table in user dashboard offloads key filtering logic to backend api prevents common error of user not being able to see their keys * fix(key_management_endpoints.py): allow internal user to query `/key/list` to see their keys * fix(key_management_endpoints.py): add validation checks and filtering to `/key/list` endpoint allow internal user to see their keys. not anybody else's * fix(view_key_table.tsx): fix issue where internal user could not see default team keys * fix: fix linting error * fix: fix linting error * fix: fix linting error * fix: fix linting error * fix: fix linting error * fix: fix linting error * fix: fix linting error * test_supports_tool_choice * test_async_router_context_window_fallback * fix: fix test (#8501) * Litellm dev 02 12 2025 p1 (#8494) * Resolves https://github.com/BerriAI/litellm/issues/6625 (#8459) - enables no auth for SMTP Signed-off-by: Regli Daniel <daniel.regli1@sanitas.com> * add sonar pricings (#8476) * add sonar pricings * Update model_prices_and_context_window.json * Update model_prices_and_context_window.json * Update model_prices_and_context_window_backup.json * test: fix test --------- Signed-off-by: Regli Daniel <daniel.regli1@sanitas.com> Co-authored-by: Dani Regli <1daniregli@gmail.com> Co-authored-by: Lucca Zenóbio <luccazen@gmail.com> * test: fix test * UI Fixes p2 (#8502) * refactor(admin.tsx): cleanup add new admin flow removes buggy flow. Ensures just 1 simple way to add users / update roles. * fix(user_search_modal.tsx): ensure 'add member' button is always visible * fix(edit_membership.tsx): ensure 'save changes' button always visible * fix(internal_user_endpoints.py): ensure user in org can be deleted Fixes issue where user couldn't be deleted if they were a member of an org * fix: fix linting error * add phoenix docs for observability integration (#8522) * Add files via upload * Update arize_integration.md * Update arize_integration.md * add Phoenix docs * Added custom_attributes to additional_keys which can be sent to athina (#8518) * (UI) fix log details page (#8524) * rollback changes to view logs page * ui new build * add interface for prefetch * fix spread operation * fix max size for request view page * clean up table * ui fix column on request logs page * ui new build * Add UI Support for Admins to Call /cache/ping and View Cache Analytics (#8475) (#8519) * [Bug] UI: Newly created key does not display on the View Key Page (#8039) - Fixed issue where all keys appeared blank for admin users. - Implemented filtering of data via team settings to ensure all keys are displayed correctly. * Fix: - Updated the validator to allow model editing when `keyTeam.team_alias === "Default Team"`. - Ensured other teams still follow the original validation rules. * - added some classes in global.css - added text wrap in output of request,response and metadata in index.tsx - fixed styles of table in table.tsx * - added full payload when we open single log entry - added Combined Info Card in index.tsx * fix: keys not showing on refresh for internal user * merge * main merge * cache page * ca remove * terms change * fix:places caching inside exp --------- Signed-off-by: Regli Daniel <daniel.regli1@sanitas.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Kaushik Deka <55996465+Kaushikdkrikhanu@users.noreply.github.com> Co-authored-by: Lucca Zenóbio <luccazen@gmail.com> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Co-authored-by: Dani Regli <1daniregli@gmail.com> Co-authored-by: exiao <exiao@users.noreply.github.com> Co-authored-by: vivek-athina <153479827+vivek-athina@users.noreply.github.com> Co-authored-by: Taha Ali <123803932+tahaali-dev@users.noreply.github.com>		2025-02-13 19:58:50 -08:00
..
audio_utils	(Refactor) - Re use litellm.completion/litellm.embedding etc for health checks (#7455 )	2024-12-28 18:38:54 -08:00
llm_cost_calc	LiteLLM Minor Fixes & Improvements (12/23/2024) - p3 (#7394 )	2024-12-23 22:02:52 -08:00
llm_response_utils	fix(router.py): add more deployment timeout debug information for tim… (#8523 )	2025-02-13 17:10:22 -08:00
prompt_templates	Anthropic Citations API Support (#8382 )	2025-02-07 22:27:01 -08:00
specialty_caches	Fix team-based logging to langfuse + allow custom tokenizer on `/token_counter` endpoint (#7493 )	2024-12-31 23:18:41 -08:00
tokenizers	Code Quality Improvement - remove `tokenizers/` from /llms (#7163 )	2024-12-10 23:50:15 -08:00
asyncify.py	(core sdk fix) - fix fallbacks stuck in infinite loop (#7751 )	2025-01-13 19:34:34 -08:00
core_helpers.py	fix unused imports	2025-01-02 22:28:22 -08:00
default_encoding.py	Code Quality Improvement - remove `tokenizers/` from /llms (#7163 )	2024-12-10 23:50:15 -08:00
dot_notation_indexing.py	feat(handle_jwt.py): initial commit adding custom RBAC support on jwt… (#8037 )	2025-01-28 16:27:06 -08:00
duration_parser.py	(QOL improvement) Provider budget routing - allow using 1s, 1d, 1mo, 2mo etc (#6885 )	2024-11-23 16:59:46 -08:00
exception_mapping_utils.py	Easier user onboarding via SSO (#8187 )	2025-02-02 23:02:33 -08:00
fallback_utils.py	LiteLLM Minor Fixes & Improvements (2024/16/01) (#7826 )	2025-01-17 20:59:21 -08:00
get_litellm_params.py	Ensure base_model cost tracking works across all endpoints (#7989 )	2025-01-24 21:05:26 -08:00
get_llm_provider_logic.py	Fix deepseek calling - refactor to use base_llm_http_handler (#8266 )	2025-02-04 22:30:00 -08:00
get_model_cost_map.py	Doc updates + management endpoint fixes (#8138 )	2025-01-30 22:56:41 -08:00
get_supported_openai_params.py	fix(utils.py): fix vertex ai optional param handling (#8477 )	2025-02-13 19:58:50 -08:00
health_check_utils.py	(Refactor) - Re use litellm.completion/litellm.embedding etc for health checks (#7455 )	2024-12-28 18:38:54 -08:00
initialize_dynamic_callback_params.py	Fix team-based logging to langfuse + allow custom tokenizer on `/token_counter` endpoint (#7493 )	2024-12-31 23:18:41 -08:00
json_validation_rule.py	feat(vertex_ai_anthropic.py): support response_schema for vertex ai anthropic calls	2024-07-18 16:57:38 -07:00
litellm_logging.py	Improved wildcard route handling on `/models` and `/model_group/info` (#8473 )	2025-02-11 19:37:43 -08:00
llm_request_utils.py	Revert "test_completion_mistral_api_mistral_large_function_call"	2025-01-17 07:20:46 -08:00
logging_callback_manager.py	(Feat) - Allow viewing Request/Response Logs stored in GCS Bucket (#8449 )	2025-02-10 20:38:55 -08:00
logging_utils.py	Add datadog health check support + fix bedrock converse cost tracking w/ region name specified (#7958 )	2025-01-23 22:17:09 -08:00
mock_functions.py	Ensure base_model cost tracking works across all endpoints (#7989 )	2025-01-24 21:05:26 -08:00
README.md	(QOL improvement) Provider budget routing - allow using 1s, 1d, 1mo, 2mo etc (#6885 )	2024-11-23 16:59:46 -08:00
realtime_streaming.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
redact_messages.py	Litellm staging (#8270 )	2025-02-04 22:35:48 -08:00
response_header_helpers.py	fix(utils.py): guarantee openai-compatible headers always exist in response	2024-09-28 21:08:15 -07:00
rules.py	Litellm dev 11 07 2024 (#6649 )	2024-11-08 19:34:22 +05:30
sensitive_data_masker.py	Litellm dev 02 07 2025 p2 (#8377 )	2025-02-07 17:30:38 -08:00
streaming_chunk_builder_utils.py	LiteLLM Minor Fixes & Improvements (01/08/2025) - p2 (#7643 )	2025-01-08 19:45:19 -08:00
streaming_handler.py	Anthropic Citations API Support (#8382 )	2025-02-07 22:27:01 -08:00
thread_pool_executor.py	(Fixes) OpenAI Streaming Token Counting + Fixes usage track when `litellm.turn_off_message_logging=True` (#8156 )	2025-01-31 15:06:37 -08:00
token_counter.py	fix: Support WebP image format and avoid token calculation error (#7182 )	2024-12-12 14:32:39 -08:00

README.md

Folder Contents

This folder contains general-purpose utilities that are used in multiple places in the codebase.

Core files:

streaming_handler.py: The core streaming logic + streaming related helper utils
core_helpers.py: code used in types/ - e.g. map_finish_reason.
exception_mapping_utils.py: utils for mapping exceptions to openai-compatible error types.
default_encoding.py: code for loading the default encoding (tiktoken)
get_llm_provider_logic.py: code for inferring the LLM provider from a given model name.
duration_parser.py: code for parsing durations - e.g. "1d", "1mo", "10s"