litellm-mirror/litellm/proxy/hooks
Ishaan Jaff 6220e17ebf (feat proxy) v2 - model max budgets (#7302)
* clean up unused code

* add _PROXY_VirtualKeyModelMaxBudgetLimiter

* adjust type imports

* working _PROXY_VirtualKeyModelMaxBudgetLimiter

* fix user_api_key_model_max_budget

* fix user_api_key_model_max_budget

* update naming

* update naming

* fix changes to RouterBudgetLimiting

* test_call_with_key_over_model_budget

* test_call_with_key_over_model_budget

* handle _get_request_model_budget_config

* e2e test for test_call_with_key_over_model_budget

* clean up test

* run ci/cd again

* add validate_model_max_budget

* docs fix

* update doc

* add e2e testing for _PROXY_VirtualKeyModelMaxBudgetLimiter

* test_unit_test_max_model_budget_limiter.py
2024-12-18 19:42:46 -08:00
..
__init__.py fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits 2023-12-08 17:11:30 -08:00
azure_content_safety.py (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
batch_redis_get.py (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
cache_control_check.py (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
dynamic_rate_limiter.py (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
example_presidio_ad_hoc_recognizer.json fix(presidio_pii_masking.py): enable user to pass ad hoc recognizer for pii masking 2024-02-20 16:01:15 -08:00
key_management_event_hooks.py Litellm lm studio embedding params (#6746) 2024-11-19 09:54:50 +05:30
max_budget_limiter.py redis otel tracing + async support for latency routing (#6452) 2024-10-28 21:52:12 -07:00
model_max_budget_limiter.py (feat proxy) v2 - model max budgets (#7302) 2024-12-18 19:42:46 -08:00
parallel_request_limiter.py (minor fix proxy) Clarify Proxy Rate limit errors are showing hash of litellm virtual key (#7210) 2024-12-12 20:13:14 -08:00
prompt_injection_detection.py Litellm remove circular imports (#7232) 2024-12-14 16:28:34 -08:00
proxy_failure_handler.py (feat) Allow disabling ErrorLogs written to the DB (#6940) 2024-11-27 19:34:51 -08:00