(docs) simple proxy

2025-04-24 18:24:20 +00:00 · 2023-11-29 16:44:39 -08:00 · 2023-11-29 16:44:39 -08:00 · 69eca78000
commit 69eca78000
parent c2f642dbec
1 changed files with 3 additions and 5 deletions
--- a/docs/my-website/docs/simple_proxy.md
+++ b/docs/my-website/docs/simple_proxy.md
@ -460,12 +460,10 @@ curl --location 'http://0.0.0.0:8000/chat/completions' \
 ```

 ### Load Balancing - Multiple Instances of 1 model
-**LiteLLM Proxy can handle 1k+ requests/second**. Use this config to load balance between multiple instances of the same model.
-
-The proxy will handle routing requests (using LiteLLM's Router). 
-
-In the config below requests with `model=gpt-3.5-turbo` will be routed across multiple instances of `azure/gpt-3.5-turbo`
+Use this config to load balance between multiple instances of the same model. The proxy will handle routing requests (using LiteLLM's Router). **Set `rpm` in the config if you want maximize throughput**

+#### Example config
+requests with `model=gpt-3.5-turbo` will be routed across multiple instances of `azure/gpt-3.5-turbo`
 ```yaml
 model_list:
  - model_name: gpt-3.5-turbo