Update docs

2025-10-04 12:07:34 +00:00 · 2025-09-12 19:55:04 -07:00 · 2025-09-12 19:55:04 -07:00 · 3538477070
commit 3538477070
parent a0e6e82c1e
7 changed files with 6 additions and 137 deletions
--- a/docs/docs/providers/inference/index.mdx
+++ b/docs/docs/providers/inference/index.mdx
@ -18,6 +18,6 @@ Llama Stack Inference API for generating completions, chat completions, and embe
    This API provides the raw interface to the underlying models. Three kinds of models are supported:
    - LLM models: these models generate "raw" and "chat" (conversational) completions.
    - Embedding models: these models generate embeddings to be used for semantic search.
-    - Rerank models: these models reorder the documents by relevance.
+    - Rerank models: these models reorder the documents based on their relevance to a query.

 This section contains documentation for all available providers for the **inference** API.
--- a/docs/static/llama-stack-spec.html
+++ b/docs/static/llama-stack-spec.html
@ -17875,7 +17875,7 @@
        },
        {
            "name": "Inference",
-            "description": "This API provides the raw interface to the underlying models. Three kinds of models are supported:\n- LLM models: these models generate \"raw\" and \"chat\" (conversational) completions.\n- Embedding models: these models generate embeddings to be used for semantic search.\n- Rerank models: these models reorder the documents by relevance.",
+            "description": "This API provides the raw interface to the underlying models. Three kinds of models are supported:\n- LLM models: these models generate \"raw\" and \"chat\" (conversational) completions.\n- Embedding models: these models generate embeddings to be used for semantic search.\n- Rerank models: these models reorder the documents based on their relevance to a query.",
            "x-displayName": "Llama Stack Inference API for generating completions, chat completions, and embeddings."
        },
        {
--- a/docs/static/llama-stack-spec.yaml
+++ b/docs/static/llama-stack-spec.yaml
@ -13460,7 +13460,8 @@ tags:
      - Embedding models: these models generate embeddings to be used for semantic
      search.

-      - Rerank models: these models reorder the documents by relevance.
+      - Rerank models: these models reorder the documents based on their relevance
+      to a query.
    x-displayName: >-
      Llama Stack Inference API for generating completions, chat completions, and
      embeddings.