Merge b82fe58f6c into b82af5b826

2025-04-24 18:24:20 +00:00 · 2025-04-24 00:57:00 -07:00 · 2025-04-24 00:57:00 -07:00 · 1953d16b13
commit 1953d16b13
parent b82af5b826 b82fe58f6c
2 changed files with 150 additions and 0 deletions
--- a/docs/my-website/docs/completion/video.md
+++ b/docs/my-website/docs/completion/video.md
@ -0,0 +1,149 @@
+import Tabs from '@theme/Tabs';
+import TabItem from '@theme/TabItem';
+
+# Using Video Models
+
+## Quick Start
+Example passing videos to a model 
+
+
+## Proxy Config setup
+
+Admins can configure file storage settings and retention policies in their LiteLLM config:
+
+```yaml
+model_list: 
+  - model_name: vertex_ai/*
+    litellm_params:
+      model: vertex_ai/*
+  - model_name: bedrock/*
+    litellm_params:
+      model: bedrock/*
+
+
+files_settings:
+  # Configure storage providers and retention  
+    - custom_llm_provider: azure
+      api_base: https://exampleopenaiendpoint-production.up.railway.app
+      api_key: fake-key
+      api_version: "2023-03-15-preview"
+    - custom_llm_provider: openai
+      api_key: os.environ/OPENAI_API_KEY
+    - custom_llm_provider: bedrock
+      api_key: os.environ/BEDROCK_API_KEY
+      api_base: https://bedrock.us-east-1.amazonaws.com
+      retention_period: 7
+    - custom_llm_provider: vertex_ai
+      bucket_name: my-vertex_ai-bucket
+      retention_period: 7
+```
+
+This configuration:
+- Sets up storage providers (GCS and S3) with their retention policies
+- Configures provider-specific file endpoints for Azure and OpenAI
+
+## 1. Local File and process on Vertex + Bedrock 
+
+When uploading a local file, the process follows these steps:
+
+1. Client makes a POST request to `/files` endpoint with the local file
+2. Client MUST specify required storage custom_llm_provider(s) based on intended model usage
+3. LiteLLM uploads the file to specified custom_llm_provider(s)
+4. A file ID is returned to the client
+5. Files have a 7-day retention policy (configured by admin)
+
+```bash
+# custom_llm_provider is required
+curl https://api.litellm.ai/v1/files \
+  -H "Authorization: Bearer sk-1234" \
+  -F purpose="fine-tune" \
+  -F file="@mydata.jsonl" \
+  -F custom_llm_provider='["vertex_ai"]'  # Required: specify ["vertex_ai"], ["bedrock"], or ["vertex_ai", "bedrock"]
+```
+
+
+After uploading, you can use the file_id in a /chat/completions request:
+
+```bash
+curl "https://api.litellm.ai/v1/chat/completions" \
+  -H "Content-Type: application/json" \
+  -H "Authorization: Bearer sk-1234" \
+  -d '{
+    "model": "bedrock/amazon.nova-pro-v1:0",
+    "input": [
+      {
+        "role": "user",
+        "content": [
+          {
+            "type": "input_file",
+            "file_id": "file-6F2ksmvXxt4VdoqmHRw6kL"
+          },
+          {
+            "type": "input_text",
+            "text": "What is happening in this video?"
+          }
+        ]
+      }
+    ]
+  }'
+```
+
+## 2. Existing file in S3 and process on Vertex + Bedrock 
+
+For files already in S3, you can avoid downloading and re-uploading by using a presigned URL:
+
+```bash
+# custom_llm_provider is required
+curl -X POST "https://api.litellm.ai/files" \
+  -H "Authorization: Bearer sk-1234" \
+  -H "Content-Type: application/json" \
+  -d '{
+    "source": "https://your-bucket.s3.amazonaws.com/path/to/mydata.jsonl?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=...",
+    "custom_llm_provider": ["vertex_ai"]  # Required: specify target custom_llm_provider
+  }'
+```
+
+This performs a direct copy from the S3 presigned URL to the specified custom_llm_provider(s), which is more efficient than downloading and re-uploading.
+
+After the S3 file is copied, you can use the returned file_id in your /chat/completions request:
+
+```bash
+curl "https://api.litellm.ai/v1/chat/completions" \
+  -H "Content-Type: application/json" \
+  -H "Authorization: Bearer sk-1234" \
+  -d '{
+    "model": "vertex_ai/gemini-pro-vision",
+    "input": [
+      {
+        "role": "user",
+        "content": [
+          {
+            "type": "input_file",
+            "file_id": "file-8H3jsmwYyt6WepqnKRw9mN"
+          },
+          {
+            "type": "input_text",
+            "text": "Describe the main events in this video"
+          }
+        ]
+      }
+    ]
+  }'
+```
+
+
+## Best Practices
+
+**Storage Location Selection**: 
+   - You MUST specify custom_llm_provider based on your model requirements
+   - For Vertex AI models, specify `["vertex_ai"]`
+   - For Bedrock models, specify `["bedrock"]`
+   - For fallback scenarios, specify both `["vertex_ai", "bedrock"]`
+   - Avoid unnecessary copies by selecting only required destinations
+## Model Compatibility
+
+This file handling approach is compatible with various model backends:
+- Vertex AI
+- AWS Bedrock
+- vLLM (for video processing, using their multimodal input format)
+
--- a/docs/my-website/sidebars.js
+++ b/docs/my-website/sidebars.js
@ -260,6 +260,7 @@ const sidebars = {
        "completion/web_search",
        "completion/document_understanding",
        "completion/vision",
+        "completion/video",
        "completion/json_mode",
        "reasoning_content",
        "completion/prompt_caching",