(docs) proxy - advanced caching

2023-12-16 13:52:58 +05:30 · 2023-12-16 13:52:58 +05:30 · 80bf99b1af
commit 80bf99b1af
parent a04f43ef38
1 changed files with 28 additions and 2 deletions
--- a/docs/my-website/docs/proxy/caching.md
+++ b/docs/my-website/docs/proxy/caching.md
@ -79,7 +79,33 @@ curl --location 'http://0.0.0.0:8000/embeddings' \
  }'
 ```
-## Override caching per `chat/completions` request
+## Advanced
 ### Set Cache Params on config.yaml
 ```yaml
 model_list:
  - model_name: gpt-3.5-turbo
    litellm_params:
      model: gpt-3.5-turbo
  - model_name: text-embedding-ada-002
    litellm_params:
      model: text-embedding-ada-002
 litellm_settings:
  set_verbose: True
  cache: True          # set cache responses to True, litellm defaults to using a redis cache
  # cache_params are optional
  cache_params:
    type: "redis"  # The type of cache to initialize. Can be "local" or "redis". Defaults to "local".
    host: "localhost"  # The host address for the Redis cache. Required if type is "redis".
    port: 6379  # The port number for the Redis cache. Required if type is "redis".
    password: "your_password"  # The password for the Redis cache. Required if type is "redis".
    # Optional configurations
    supported_call_types: ["acompletion", "completion", "embedding", "aembedding"] # defaults to all litellm call types
 ```
 ### Override caching per `chat/completions` request
 Caching can be switched on/off per `/chat/completions` request
 - Caching **on** for individual completion - pass `caching=True`:
  ```shell
@ -105,7 +131,7 @@ Caching can be switched on/off per `/chat/completions` request
  ```
-## Override caching per `/embeddings` request
+### Override caching per `/embeddings` request
 Caching can be switched on/off per `/embeddings` request
 - Caching **on** for embedding - pass `caching=True`: