(docs) deprecated proxy

2023-11-09 10:55:28 -08:00 · 2023-11-09 10:55:28 -08:00 · e2a380b832
commit e2a380b832
parent fbebb28970
1 changed files with 0 additions and 43 deletions
--- a/docs/my-website/docs/proxy_server.md
+++ b/docs/my-website/docs/proxy_server.md
@ -820,49 +820,6 @@ litellm --model ollama/llama2 \
 # OpenAI-compatible server running on http://0.0.0.0:8000
 ```
 **Across restarts**:  
 Create a file called `litellm_config.toml` and paste this in there:
 ```shell
 [model."ollama/llama2"] # run via `litellm --model ollama/llama2`
 max_tokens = 250 # set max tokens for the model 
 temperature = 0.5 # set temperature for the model 
 api_base = "http://localhost:11434" # set a custom api base for the model
 ```
 &nbsp;   
 Save it to the proxy with: 
 ```shell
 $ litellm --config -f ./litellm_config.toml 
 ```
 LiteLLM will save a copy of this file in it's package, so it can persist these settings across restarts.
 [**Complete Config File**](https://github.com/BerriAI/litellm/blob/main/secrets_template.toml)
 [**🔥 [Tutorial] modify a model prompt on the proxy**](./tutorials/model_config_proxy.md)
 ### Track Costs
 By default litellm proxy writes cost logs to litellm/proxy/costs.json
 How can the proxy be better? Let us know [here](https://github.com/BerriAI/litellm/issues)
 ```json
 {
  "Oct-12-2023": {
    "claude-2": {
      "cost": 0.02365918,
      "num_requests": 1
    }
  }
 }
 ```
 You can view costs on the cli using 
 ```shell
 litellm --cost
 ```
 ### Performance
 We load-tested 500,000 HTTP connections on the FastAPI server for 1 minute, using [wrk](https://github.com/wg/wrk).