From 9cf3dcbbf39404d752bd07a06d4122fd13cd1052 Mon Sep 17 00:00:00 2001 From: Emmanuel Ferdman Date: Sat, 16 Nov 2024 18:28:44 +0200 Subject: [PATCH] Update routing references (#6758) * Update routing references Signed-off-by: Emmanuel Ferdman * Update routing references Signed-off-by: Emmanuel Ferdman --------- Signed-off-by: Emmanuel Ferdman --- docs/my-website/docs/routing.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/docs/my-website/docs/routing.md b/docs/my-website/docs/routing.md index ec692147b..702cafa7f 100644 --- a/docs/my-website/docs/routing.md +++ b/docs/my-website/docs/routing.md @@ -281,7 +281,7 @@ Picks the deployment with the lowest response time. It caches, and updates the response times for deployments based on when a request was sent and received from a deployment. -[**How to test**](https://github.com/BerriAI/litellm/blob/main/litellm/tests/test_lowest_latency_routing.py) +[**How to test**](https://github.com/BerriAI/litellm/blob/main/tests/local_testing/test_lowest_latency_routing.py) ```python from litellm import Router @@ -567,7 +567,7 @@ print(response) Picks a deployment with the least number of ongoing calls, it's handling. -[**How to test**](https://github.com/BerriAI/litellm/blob/main/litellm/tests/test_least_busy_routing.py) +[**How to test**](https://github.com/BerriAI/litellm/blob/main/tests/local_testing/test_least_busy_routing.py) ```python from litellm import Router @@ -1035,7 +1035,7 @@ print(f"response: {response}") ### [Advanced]: Custom Retries, Cooldowns based on Error Type -- Use `RetryPolicy` if you want to set a `num_retries` based on the Exception receieved +- Use `RetryPolicy` if you want to set a `num_retries` based on the Exception received - Use `AllowedFailsPolicy` to set a custom number of `allowed_fails`/minute before cooling down a deployment [**See All Exception Types**](https://github.com/BerriAI/litellm/blob/ccda616f2f881375d4e8586c76fe4662909a7d22/litellm/types/router.py#L436)