docs setup alerting on router

2024-05-07 18:26:45 -07:00 · 2024-05-07 18:26:45 -07:00 · d46544d2bc
commit d46544d2bc
parent e8053c3d0b
1 changed files with 40 additions and 0 deletions
--- a/docs/my-website/docs/routing.md
+++ b/docs/my-website/docs/routing.md
@ -1086,6 +1086,46 @@ async def test_acompletion_caching_on_router_caching_groups():
 asyncio.run(test_acompletion_caching_on_router_caching_groups())
 ```
 ## Alerting 🚨
 Send alerts to slack / your webhook url for the following events
 - LLM API Exceptions
 - Slow LLM Responses
 Get a slack webhook url from https://api.slack.com/messaging/webhooks
 #### Usage
 Initialize an `AlertingConfig` and pass it to `litellm.Router`. The following code will trigger an alert because `api_key=bad-key` which is invalid
 ```python
 from litellm.router import AlertingConfig
 import litellm
 import os
 router = litellm.Router(
 	model_list=[
 		{
 			"model_name": "gpt-3.5-turbo",
 			"litellm_params": {
 				"model": "gpt-3.5-turbo",
 				"api_key": "bad_key",
 			},
 		}
 	],
 	alerting_config= AlertingConfig(
 		alerting_threshold=10,                        # threshold for slow / hanging llm responses (in seconds). Defaults to 300 seconds
 		webhook_url= os.getenv("SLACK_WEBHOOK_URL")   # webhook you want to send alerts to
 	),
 )
 try:
 	await router.acompletion(
 		model="gpt-3.5-turbo",
 		messages=[{"role": "user", "content": "Hey, how's it going?"}],
 	)
 except:
 	pass
 ```
 ## Track cost for Azure Deployments
 **Problem**: Azure returns `gpt-4` in the response when `azure/gpt-4-1106-preview` is used. This leads to inaccurate cost tracking