docs: Documentation update for NVIDIA Inference Provider (#3840)

# What does this PR do?    - Fix examples in the NVIDIA inference documentation to align with current API requirements. ## Test Plan  N/A
2025-12-08 11:07:22 +00:00 · 2025-10-20 09:51:43 -07:00 · 2025-10-20 09:51:43 -07:00 · 165b8b07f4
commit 165b8b07f4
parent f675fdda0f
2 changed files with 34 additions and 47 deletions
--- a/llama_stack/providers/remote/inference/nvidia/nvidia.py
+++ b/llama_stack/providers/remote/inference/nvidia/nvidia.py
@ -19,15 +19,6 @@ class NVIDIAInferenceAdapter(OpenAIMixin):

    """
    NVIDIA Inference Adapter for Llama Stack.
-
-    Note: The inheritance order is important here. OpenAIMixin must come before
-    ModelRegistryHelper to ensure that OpenAIMixin.check_model_availability()
-    is used instead of ModelRegistryHelper.check_model_availability(). It also
-    must come before Inference to ensure that OpenAIMixin methods are available
-    in the Inference interface.
-
-    - OpenAIMixin.check_model_availability() queries the NVIDIA API to check if a model exists
-    - ModelRegistryHelper.check_model_availability() just returns False and shows a warning
    """

    # source: https://docs.nvidia.com/nim/nemo-retriever/text-embedding/latest/support-matrix.html