add Azure OpenAI entrata id docs (#5985)

2024-09-30 12:17:58 -07:00 · 2024-09-30 12:17:58 -07:00 · ca9c437021
commit ca9c437021
parent 30aa04b8c2
1 changed files with 148 additions and 106 deletions
--- a/docs/my-website/docs/providers/azure.md
+++ b/docs/my-website/docs/providers/azure.md
@ -82,9 +82,6 @@ export AZURE_API_KEY=""

 ### 2. Start the proxy 

-<Tabs>
-<TabItem value="config" label="config.yaml">
-
 ```yaml
 model_list:
  - model_name: gpt-3.5-turbo
@ -94,28 +91,9 @@ model_list:
      api_version: "2023-05-15"
      api_key: os.environ/AZURE_API_KEY # The `os.environ/` prefix tells litellm to read this from the env.
 ```
-</TabItem>
-<TabItem value="config-*" label="config.yaml (Entrata ID) use tenant_id, client_id, client_secret">
-
-
-```yaml
-model_list:
-  - model_name: gpt-3.5-turbo
-    litellm_params:
-      model: azure/chatgpt-v-2
-      api_base: https://openai-gpt-4-test-v-1.openai.azure.com/
-      api_version: "2023-05-15"
-      tenant_id: os.environ/AZURE_TENANT_ID
-      client_id: os.environ/AZURE_CLIENT_ID
-      client_secret: os.environ/AZURE_CLIENT_SECRET
-```
-</TabItem>
-
-</Tabs>

 ### 3. Test it

-
 <Tabs>
 <TabItem value="Curl" label="Curl Request">

@ -360,6 +338,153 @@ response = speech(
 response.stream_to_file(speech_file_path)
 ```

+## **Authentication**
+
+
+### Entrata ID - use `azure_ad_token`
+
+This is a walkthrough on how to use Azure Active Directory Tokens - Microsoft Entra ID to make `litellm.completion()` calls 
+
+Step 1 - Download Azure CLI 
+Installation instructons: https://learn.microsoft.com/en-us/cli/azure/install-azure-cli
+```shell
+brew update && brew install azure-cli
+```
+Step 2 - Sign in using `az`
+```shell
+az login --output table
+```
+
+Step 3 - Generate azure ad token
+```shell
+az account get-access-token --resource https://cognitiveservices.azure.com
+```
+
+In this step you should see an `accessToken` generated
+```shell
+{
+  "accessToken": "eyJ0eXAiOiJKV1QiLCJhbGciOiJSUzI1NiIsIng1dCI6IjlHbW55RlBraGMzaE91UjIybXZTdmduTG83WSIsImtpZCI6IjlHbW55RlBraGMzaE91UjIybXZTdmduTG83WSJ9",
+  "expiresOn": "2023-11-14 15:50:46.000000",
+  "expires_on": 1700005846,
+  "subscription": "db38de1f-4bb3..",
+  "tenant": "bdfd79b3-8401-47..",
+  "tokenType": "Bearer"
+}
+```
+
+Step 4 - Make litellm.completion call with Azure AD token
+
+Set `azure_ad_token` = `accessToken` from step 3 or set `os.environ['AZURE_AD_TOKEN']`
+
+
+<Tabs>
+<TabItem value="sdk" label="SDK">
+
+
+```python
+response = litellm.completion(
+    model = "azure/<your deployment name>",             # model = azure/<your deployment name> 
+    api_base = "",                                      # azure api base
+    api_version = "",                                   # azure api version
+    azure_ad_token="", 									# your accessToken from step 3 
+    messages = [{"role": "user", "content": "good morning"}],
+)
+
+```
+
+</TabItem>
+<TabItem value="proxy" label="PROXY config.yaml">
+
+```yaml
+model_list:
+  - model_name: gpt-3.5-turbo
+    litellm_params:
+      model: azure/chatgpt-v-2
+      api_base: https://openai-gpt-4-test-v-1.openai.azure.com/
+      api_version: "2023-05-15"
+      azure_ad_token: os.environ/AZURE_AD_TOKEN
+```
+
+</TabItem>
+</Tabs>
+
+### Entrata ID - use tenant_id, client_id, client_secret
+
+Here is an example of setting up `tenant_id`, `client_id`, `client_secret` in your litellm proxy `config.yaml`
+```yaml
+model_list:
+  - model_name: gpt-3.5-turbo
+    litellm_params:
+      model: azure/chatgpt-v-2
+      api_base: https://openai-gpt-4-test-v-1.openai.azure.com/
+      api_version: "2023-05-15"
+      tenant_id: os.environ/AZURE_TENANT_ID
+      client_id: os.environ/AZURE_CLIENT_ID
+      client_secret: os.environ/AZURE_CLIENT_SECRET
+```
+
+Test it 
+
+```shell
+curl --location 'http://0.0.0.0:4000/chat/completions' \
+--header 'Content-Type: application/json' \
+--data ' {
+      "model": "gpt-3.5-turbo",
+      "messages": [
+        {
+          "role": "user",
+          "content": "what llm are you"
+        }
+      ]
+    }
+'
+```
+
+Example video of using `tenant_id`, `client_id`, `client_secret` with LiteLLM Proxy Server
+
+<iframe width="840" height="500" src="https://www.loom.com/embed/70d3f219ee7f4e5d84778b7f17bba506?sid=04b8ff29-485f-4cb8-929e-6b392722f36d" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>
+
+### Azure AD Token Refresh - `DefaultAzureCredential`
+
+Use this if you want to use Azure `DefaultAzureCredential` for Authentication on your requests
+
+<Tabs>
+<TabItem value="sdk" label="SDK">
+
+```python
+from litellm import completion
+from azure.identity import DefaultAzureCredential, get_bearer_token_provider
+
+token_provider = get_bearer_token_provider(DefaultAzureCredential(), "https://cognitiveservices.azure.com/.default")
+
+
+response = completion(
+    model = "azure/<your deployment name>",             # model = azure/<your deployment name> 
+    api_base = "",                                      # azure api base
+    api_version = "",                                   # azure api version
+    azure_ad_token_provider=token_provider
+    messages = [{"role": "user", "content": "good morning"}],
+)
+```
+
+</TabItem>
+<TabItem value="proxy" label="PROXY config.yaml">
+
+```yaml
+model_list:
+  - model_name: gpt-3.5-turbo
+    litellm_params:
+      model: azure/your-deployment-name
+      api_base: https://openai-gpt-4-test-v-1.openai.azure.com/
+
+litellm_settings:
+    enable_azure_ad_token_refresh: true # 👈 KEY CHANGE
+```
+
+</TabItem>
+</Tabs>
+
+
 ## Advanced
 ### Azure API Load-Balancing

@ -486,87 +611,4 @@ print("\nLLM Response1:\n", response)
 response_message = response.choices[0].message
 tool_calls = response.choices[0].message.tool_calls
 print("\nTool Choice:\n", tool_calls)
-```
-
-
-### Authentication with Azure Active Directory Tokens (Microsoft Entra ID)
-This is a walkthrough on how to use Azure Active Directory Tokens - Microsoft Entra ID to make `litellm.completion()` calls 
-
-Step 1 - Download Azure CLI 
-Installation instructons: https://learn.microsoft.com/en-us/cli/azure/install-azure-cli
-```shell
-brew update && brew install azure-cli
-```
-Step 2 - Sign in using `az`
-```shell
-az login --output table
-```
-
-Step 3 - Generate azure ad token
-```shell
-az account get-access-token --resource https://cognitiveservices.azure.com
-```
-
-In this step you should see an `accessToken` generated
-```shell
-{
-  "accessToken": "eyJ0eXAiOiJKV1QiLCJhbGciOiJSUzI1NiIsIng1dCI6IjlHbW55RlBraGMzaE91UjIybXZTdmduTG83WSIsImtpZCI6IjlHbW55RlBraGMzaE91UjIybXZTdmduTG83WSJ9",
-  "expiresOn": "2023-11-14 15:50:46.000000",
-  "expires_on": 1700005846,
-  "subscription": "db38de1f-4bb3..",
-  "tenant": "bdfd79b3-8401-47..",
-  "tokenType": "Bearer"
-}
-```
-
-Step 4 - Make litellm.completion call with Azure AD token
-
-Set `azure_ad_token` = `accessToken` from step 3 or set `os.environ['AZURE_AD_TOKEN']`
-
-```python
-response = litellm.completion(
-    model = "azure/<your deployment name>",             # model = azure/<your deployment name> 
-    api_base = "",                                      # azure api base
-    api_version = "",                                   # azure api version
-    azure_ad_token="", 									# your accessToken from step 3 
-    messages = [{"role": "user", "content": "good morning"}],
-)
-
-```
-### Azure AD Token Refresh
-
-<Tabs>
-<TabItem value="sdk" label="SDK">
-
-```python
-from litellm import completion
-from azure.identity import DefaultAzureCredential, get_bearer_token_provider
-
-token_provider = get_bearer_token_provider(DefaultAzureCredential(), "https://cognitiveservices.azure.com/.default")
-
-
-response = completion(
-    model = "azure/<your deployment name>",             # model = azure/<your deployment name> 
-    api_base = "",                                      # azure api base
-    api_version = "",                                   # azure api version
-    azure_ad_token_provider=token_provider
-    messages = [{"role": "user", "content": "good morning"}],
-)
-```
-
-</TabItem>
-<TabItem value="proxy" label="PROXY config.yaml">
-
-```yaml
-model_list:
-  - model_name: gpt-3.5-turbo
-    litellm_params:
-      model: azure/your-deployment-name
-      api_base: https://openai-gpt-4-test-v-1.openai.azure.com/
-
-litellm_settings:
-    enable_azure_ad_token_refresh: true # 👈 KEY CHANGE
-```
-
-</TabItem>
-</Tabs>
+```