docs(aws_sagemaker.md): cleanup sagemaker messages api docs

2025-04-26 03:04:13 +00:00 · 2024-08-23 21:17:16 -07:00 · 2024-08-23 21:17:16 -07:00 · 74a85fac0e
commit 74a85fac0e
parent cd61ddc610
1 changed files with 75 additions and 1 deletions
--- a/docs/my-website/docs/providers/aws_sagemaker.md
+++ b/docs/my-website/docs/providers/aws_sagemaker.md
@ -415,7 +415,81 @@ response = completion(
 You can also pass in your own [custom prompt template](../completion/prompt_formatting.md#format-prompt-yourself)


-### Completion Models 
+## Sagemaker Messages API 
+
+Use route `sagemaker_chat/*` to route to Sagemaker Messages API
+
+```
+model: sagemaker_chat/<your-endpoint-name>
+```
+
+<Tabs>
+<TabItem value="sdk" label="SDK">
+
+```python
+import os
+import litellm
+from litellm import completion
+
+litellm.set_verbose = True # 👈 SEE RAW REQUEST
+
+os.environ["AWS_ACCESS_KEY_ID"] = ""
+os.environ["AWS_SECRET_ACCESS_KEY"] = ""
+os.environ["AWS_REGION_NAME"] = ""
+
+response = completion(
+            model="sagemaker_chat/<your-endpoint-name>", 
+            messages=[{ "content": "Hello, how are you?","role": "user"}],
+            temperature=0.2,
+            max_tokens=80
+        )
+```
+
+</TabItem>
+<TabItem value="proxy" label="PROXY">
+
+#### 1. Setup config.yaml 
+
+```yaml
+model_list:
+  - model_name: "sagemaker-model"
+    litellm_params:
+      model: "sagemaker_chat/jumpstart-dft-hf-textgeneration1-mp-20240815-185614"
+      aws_access_key_id: os.environ/AWS_ACCESS_KEY_ID
+      aws_secret_access_key: os.environ/AWS_SECRET_ACCESS_KEY
+      aws_region_name: os.environ/AWS_REGION_NAME
+```
+
+#### 2. Start the proxy 
+
+```bash
+litellm --config /path/to/config.yaml
+```
+#### 3. Test it
+
+
+```shell
+curl --location 'http://0.0.0.0:4000/chat/completions' \
+--header 'Content-Type: application/json' \
+--data ' {
+      "model": "sagemaker-model",
+      "messages": [
+        {
+          "role": "user",
+          "content": "what llm are you"
+        }
+      ]
+    }
+'
+```
+
+[**👉 See OpenAI SDK/Langchain/Llamaindex/etc. examples**](../proxy/user_keys.md#chatcompletions)
+
+</TabItem>
+</Tabs>
+
+
+## Completion Models 


 :::tip