docs(docs/index.md): add proxy details to docs

2024-01-04 11:20:43 +05:30 · 2024-01-04 11:20:43 +05:30 · 4946b1ef6d
commit 4946b1ef6d
parent 0f7d03f761
2 changed files with 81 additions and 1 deletions
--- a/docs/my-website/docs/index.md
+++ b/docs/my-website/docs/index.md
@ -396,6 +396,47 @@ response = completion(
 )
 ```
 ## OpenAI Proxy
 Track spend across multiple projects/people 
 The proxy provides: 
 1. [Hooks for auth](https://docs.litellm.ai/docs/proxy/virtual_keys#custom-auth)
 2. [Hooks for logging](https://docs.litellm.ai/docs/proxy/logging#step-1---create-your-custom-litellm-callback-class)
 3. [Cost tracking](https://docs.litellm.ai/docs/proxy/virtual_keys#tracking-spend)
 4. [Rate Limiting](https://docs.litellm.ai/docs/proxy/users#set-rate-limits)
 ### 📖 Proxy Endpoints - [Swagger Docs](https://litellm-api.up.railway.app/)
 ### Quick Start Proxy - CLI 
 ```shell
 pip install litellm[proxy]
 ```
 #### Step 1: Start litellm proxy
 ```shell
 $ litellm --model huggingface/bigcode/starcoder
 #INFO: Proxy running on http://0.0.0.0:8000
 ```
 #### Step 2: Make ChatCompletions Request to Proxy
 ```python
 import openai # openai v1.0.0+
 client = openai.OpenAI(api_key="anything",base_url="http://0.0.0.0:8000") # set proxy to base_url
 # request sent to model set on litellm proxy, `litellm --model`
 response = client.chat.completions.create(model="gpt-3.5-turbo", messages = [
    {
        "role": "user",
        "content": "this is a test request, write a short poem"
    }
 ])
 print(response)
 ```
 ## More details
 * [exception mapping](./exception_mapping.md)
 * [retries + model fallbacks for completion()](./completion/reliable_completions.md)
--- a/docs/my-website/src/pages/index.md
+++ b/docs/my-website/src/pages/index.md
@ -375,6 +375,45 @@ response = completion(
 Need a dedicated key? Email us @ krrish@berri.ai
 ## OpenAI Proxy
 Track spend across multiple projects/people 
 The proxy provides: 
 1. [Hooks for auth](https://docs.litellm.ai/docs/proxy/virtual_keys#custom-auth)
 2. [Hooks for logging](https://docs.litellm.ai/docs/proxy/logging#step-1---create-your-custom-litellm-callback-class)
 3. [Cost tracking](https://docs.litellm.ai/docs/proxy/virtual_keys#tracking-spend)
 4. [Rate Limiting](https://docs.litellm.ai/docs/proxy/users#set-rate-limits)
 ### 📖 Proxy Endpoints - [Swagger Docs](https://litellm-api.up.railway.app/)
 ### Quick Start Proxy - CLI 
 ```shell
 pip install litellm[proxy]
 ```
 #### Step 1: Start litellm proxy
 ```shell
 $ litellm --model huggingface/bigcode/starcoder
 #INFO: Proxy running on http://0.0.0.0:8000
 ```
 #### Step 2: Make ChatCompletions Request to Proxy
 ```python
 import openai # openai v1.0.0+
 client = openai.OpenAI(api_key="anything",base_url="http://0.0.0.0:8000") # set proxy to base_url
 # request sent to model set on litellm proxy, `litellm --model`
 response = client.chat.completions.create(model="gpt-3.5-turbo", messages = [
    {
        "role": "user",
        "content": "this is a test request, write a short poem"
    }
 ])
 print(response)
 ```
 ## More details
 * [exception mapping](./exception_mapping.md)