litellm-mirror/litellm/router_utils
Krish Dholakia 2f08341a08
Litellm dev readd prompt caching (#7299)
* fix(router.py): re-add saving model id on prompt caching valid successful deployment

* fix(router.py): introduce optional pre_call_checks

isolate prompt caching logic in a separate file

* fix(prompt_caching_deployment_check.py): fix import

* fix(router.py): new 'async_filter_deployments' event hook

allows custom logger to filter deployments returned to routing strategy

* feat(prompt_caching_deployment_check.py): initial working commit of prompt caching based routing

* fix(cooldown_callbacks.py): fix linting error

* fix(budget_limiter.py): move budget logger to async_filter_deployment hook

* test: add unit test

* test(test_router_helper_utils.py): add unit testing

* fix(budget_limiter.py): fix linting errors

* docs(config_settings.md): add 'optional_pre_call_checks' to router_settings param docs
2024-12-18 15:13:49 -08:00
..
pre_call_checks Litellm dev readd prompt caching (#7299) 2024-12-18 15:13:49 -08:00
router_callbacks (testing) Router add testing coverage (#6253) 2024-10-16 07:32:27 -07:00
batch_utils.py Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
client_initalization_utils.py (feat) Add Azure Blob Storage Logging Integration (#7265) 2024-12-16 22:18:22 -08:00
cooldown_cache.py LiteLLM Minor Fixes & Improvements (10/28/2024) (#6475) 2024-10-29 17:20:24 -07:00
cooldown_callbacks.py Litellm dev readd prompt caching (#7299) 2024-12-18 15:13:49 -08:00
cooldown_handlers.py redis otel tracing + async support for latency routing (#6452) 2024-10-28 21:52:12 -07:00
fallback_event_handlers.py Litellm dev 12 11 2024 v2 (#7215) 2024-12-13 12:49:57 -08:00
get_retry_from_policy.py Litellm dev 12 06 2024 (#7067) 2024-12-06 22:44:18 -08:00
handle_error.py (UI) Fix viewing members, keys in a team + added testing (#6514) 2024-10-30 23:51:13 +05:30
pattern_match_deployments.py Litellm dev 12 12 2024 (#7203) 2024-12-13 08:54:03 -08:00
prompt_caching_cache.py Litellm dev readd prompt caching (#7299) 2024-12-18 15:13:49 -08:00
response_headers.py LiteLLM Minor Fixes & Improvements (11/26/2024) (#6913) 2024-11-28 00:01:38 +05:30