doc - spend tracking with litellm

2024-06-03 13:39:38 -07:00 · 2024-06-03 13:39:38 -07:00 · 26002006d4
commit 26002006d4
parent 3ef3a7ba5f
2 changed files with 133 additions and 11 deletions
--- a/docs/my-website/docs/proxy/cost_tracking.md
+++ b/docs/my-website/docs/proxy/cost_tracking.md
@ -1,22 +1,140 @@
 import Tabs from '@theme/Tabs';
 import TabItem from '@theme/TabItem';
+import Image from '@theme/IdealImage';

 # 💸 Spend Tracking

 Track spend for keys, users, and teams across 100+ LLMs.

-## Getting Spend Reports - To Charge Other Teams, API Keys
+### How to Track Spend with LiteLLM
+
+👉 [Setup LiteLLM with a Database](https://docs.litellm.ai/docs/proxy/deploy)
+
+That's it, that's all you need to do for basic LLM spend tracking
+
+### What spend gets tracked 
+
+For the Request below LiteLLM will **automatically** track spend for
+- Virtual Key = `sk-1234`
+- Customer = `palantir`
+- Tags = `jobID:214590dsff09fds`, `taskName:run_page_classification`
+
+<Tabs>
+
+
+<TabItem value="openai" label="OpenAI Python v1.0.0+">
+
+```python
+import openai
+client = openai.OpenAI(
+    api_key="sk-1234",
+    base_url="http://0.0.0.0:4000"
+)
+
+# request sent to model set on litellm proxy, `litellm --model`
+response = client.chat.completions.create(
+    model="gpt-3.5-turbo",
+    messages = [
+        {
+            "role": "user",
+            "content": "this is a test request, write a short poem"
+        }
+    ],
+    user="palantir",
+    extra_body={
+        "metadata": {
+            "tags": ["jobID:214590dsff09fds", "taskName:run_page_classification"]
+        }
+    }
+)
+
+print(response)
+```
+</TabItem>
+
+<TabItem value="Curl" label="Curl Request">
+
+Pass `metadata` as part of the request body
+
+```shell
+curl --location 'http://0.0.0.0:4000/chat/completions' \
+    --header 'Content-Type: application/json' \
+    --header 'Authorization: Bearer sk-1234' \
+    --data '{
+    "model": "gpt-3.5-turbo",
+    "messages": [
+        {
+        "role": "user",
+        "content": "what llm are you"
+        }
+    ],
+    "user": "palantir",
+    "metadata": {
+        "tags": ["jobID:214590dsff09fds", "taskName:run_page_classification"]
+    }
+}'
+```
+</TabItem>
+<TabItem value="langchain" label="Langchain">
+
+```python
+from langchain.chat_models import ChatOpenAI
+from langchain.prompts.chat import (
+    ChatPromptTemplate,
+    HumanMessagePromptTemplate,
+    SystemMessagePromptTemplate,
+)
+from langchain.schema import HumanMessage, SystemMessage
+import os
+
+os.environ["OPENAI_API_KEY"] = "sk-1234"
+
+chat = ChatOpenAI(
+    openai_api_base="http://0.0.0.0:4000",
+    model = "gpt-3.5-turbo",
+    user="palantir",
+    extra_body={
+        "metadata": {
+            "tags": ["jobID:214590dsff09fds", "taskName:run_page_classification"]
+        }
+    }
+)
+
+messages = [
+    SystemMessage(
+        content="You are a helpful assistant that im using to make a test request to."
+    ),
+    HumanMessage(
+        content="test from litellm. tell me why it's amazing in 1 sentence"
+    ),
+]
+response = chat(messages)
+
+print(response)
+```
+
+</TabItem>
+</Tabs>
+
+## View Spend on LiteLLM Admin UI
+
+Navigate to the `Usage` Tab on the LiteLLM UI (found on `https://your-proxy-endpoint/ui`)
+
+<Image img={require('../../img/admin_ui_spend.png')} />
+
+## API Endpoints to get Spend
+### Getting Spend Reports - To Charge Other Teams, API Keys

 Use the `/global/spend/report` endpoint to get daily spend per team, with a breakdown of spend per API Key, Model

-### Example Request
+#### Example Request

 ```shell
 curl -X GET 'http://localhost:4000/global/spend/report?start_date=2024-04-01&end_date=2024-06-30' \
  -H 'Authorization: Bearer sk-1234'
 ```

-### Example Response
+#### Example Response
 <Tabs>

 <TabItem value="response" label="Expected Response">
@ -125,7 +243,7 @@ Output from script

 </Tabs>

-## Allowing Non-Proxy Admins to access `/spend` endpoints 
+### Allowing Non-Proxy Admins to access `/spend` endpoints 

 Use this when you want non-proxy admins to access `/spend` endpoints

@ -135,7 +253,7 @@ Schedule a [meeting with us to get your Enterprise License](https://calendly.com

 :::

-### Create Key 
+#### Create Key 
 Create Key with with `permissions={"get_spend_routes": true}` 
 ```shell
 curl --location 'http://0.0.0.0:4000/key/generate' \
@ -146,7 +264,7 @@ curl --location 'http://0.0.0.0:4000/key/generate' \
    }'
 ```

-### Use generated key on `/spend` endpoints
+#### Use generated key on `/spend` endpoints

 Access spend Routes with newly generate keys
 ```shell
@ -156,14 +274,14 @@ curl -X GET 'http://localhost:4000/global/spend/report?start_date=2024-04-01&end



-## Reset Team, API Key Spend - MASTER KEY ONLY
+### Reset Team, API Key Spend - MASTER KEY ONLY

 Use `/global/spend/reset` if you want to:
 - Reset the Spend for all API Keys, Teams. The `spend` for ALL Teams and Keys in `LiteLLM_TeamTable` and `LiteLLM_VerificationToken` will be set to `spend=0`

 - LiteLLM will maintain all the logs in `LiteLLMSpendLogs` for Auditing Purposes

-### Request 
+#### Request 
 Only the `LITELLM_MASTER_KEY` you set can access this route
 ```shell
 curl -X POST \
@ -172,7 +290,7 @@ curl -X POST \
  -H 'Content-Type: application/json'
 ```

-### Expected Responses
+#### Expected Responses

 ```shell
 {"message":"Spend for all API Keys and Teams reset successfully","status":"success"}
@ -181,7 +299,7 @@ curl -X POST \



-## Spend Tracking for Azure
+## Spend Tracking for Azure OpenAI Models

 Set base model for cost tracking azure image-gen call

@ -219,4 +337,8 @@ model_list:
      api_version: "2023-07-01-preview"
    model_info:
      base_model: azure/gpt-4-1106-preview
-```
+```
+
+## Custom Input/Output Pricing
+
+👉 Head to [Custom Input/Output Pricing](https://docs.litellm.ai/docs/proxy/custom_pricing) to setup custom pricing or your models
--- a/docs/my-website/img/admin_ui_spend.png
+++ b/docs/my-website/img/admin_ui_spend.png