litellm-mirror/litellm/router_utils
Krish Dholakia a42f008cd0 Litellm dev 12 12 2024 (#7203)
* fix(azure/): support passing headers to azure openai endpoints

Fixes https://github.com/BerriAI/litellm/issues/6217

* fix(utils.py): move default tokenizer to just openai

hf tokenizer makes network calls when trying to get the tokenizer - this slows down execution time calls

* fix(router.py): fix pattern matching router - add generic "*" to it as well

Fixes issue where generic "*" model access group wouldn't show up

* fix(pattern_match_deployments.py): match to more specific pattern

match to more specific pattern

allows setting generic wildcard model access group and excluding specific models more easily

* fix(proxy_server.py): fix _delete_deployment to handle base case where db_model list is empty

don't delete all router models  b/c of empty list

Fixes https://github.com/BerriAI/litellm/issues/7196

* fix(anthropic/): fix handling response_format for anthropic messages with anthropic api

* fix(fireworks_ai/): support passing response_format + tool call in same message

Addresses https://github.com/BerriAI/litellm/issues/7135

* Revert "fix(fireworks_ai/): support passing response_format + tool call in same message"

This reverts commit 6a30dc6929.

* test: fix test

* fix(replicate/): fix replicate default retry/polling logic

* test: add unit testing for router pattern matching

* test: update test to use default oai tokenizer

* test: mark flaky test

* test: skip flaky test
2024-12-13 08:54:03 -08:00
..
router_callbacks (testing) Router add testing coverage (#6253) 2024-10-16 07:32:27 -07:00
batch_utils.py Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
client_initalization_utils.py fix use consistent naming (#7092) 2024-12-07 22:01:00 -08:00
cooldown_cache.py LiteLLM Minor Fixes & Improvements (10/28/2024) (#6475) 2024-10-29 17:20:24 -07:00
cooldown_callbacks.py (feat) provider budget routing improvements (#6827) 2024-11-19 21:25:08 -08:00
cooldown_handlers.py redis otel tracing + async support for latency routing (#6452) 2024-10-28 21:52:12 -07:00
fallback_event_handlers.py Litellm dev 12 07 2024 (#7086) 2024-12-08 00:30:33 -08:00
get_retry_from_policy.py Litellm dev 12 06 2024 (#7067) 2024-12-06 22:44:18 -08:00
handle_error.py (UI) Fix viewing members, keys in a team + added testing (#6514) 2024-10-30 23:51:13 +05:30
pattern_match_deployments.py Litellm dev 12 12 2024 (#7203) 2024-12-13 08:54:03 -08:00
prompt_caching_cache.py Litellm dev 12 07 2024 (#7086) 2024-12-08 00:30:33 -08:00
response_headers.py LiteLLM Minor Fixes & Improvements (11/26/2024) (#6913) 2024-11-28 00:01:38 +05:30