Update docs to include rerank models

This commit is contained in:
Jiayi 2025-09-10 11:50:55 -07:00
parent 78375889ec
commit d1b4e090ef
2 changed files with 5 additions and 3 deletions

View file

@ -17875,7 +17875,7 @@
}, },
{ {
"name": "Inference", "name": "Inference",
"description": "This API provides the raw interface to the underlying models. Two kinds of models are supported:\n- LLM models: these models generate \"raw\" and \"chat\" (conversational) completions.\n- Embedding models: these models generate embeddings to be used for semantic search.", "description": "This API provides the raw interface to the underlying models. Three kinds of models are supported:\n- LLM models: these models generate \"raw\" and \"chat\" (conversational) completions.\n- Embedding models: these models generate embeddings to be used for semantic search.\n- Rerank models: these models reorder the documents by relevance.",
"x-displayName": "Llama Stack Inference API for generating completions, chat completions, and embeddings." "x-displayName": "Llama Stack Inference API for generating completions, chat completions, and embeddings."
}, },
{ {

View file

@ -13452,13 +13452,15 @@ tags:
description: '' description: ''
- name: Inference - name: Inference
description: >- description: >-
This API provides the raw interface to the underlying models. Two kinds of models This API provides the raw interface to the underlying models. Three kinds of
are supported: models are supported:
- LLM models: these models generate "raw" and "chat" (conversational) completions. - LLM models: these models generate "raw" and "chat" (conversational) completions.
- Embedding models: these models generate embeddings to be used for semantic - Embedding models: these models generate embeddings to be used for semantic
search. search.
- Rerank models: these models reorder the documents by relevance.
x-displayName: >- x-displayName: >-
Llama Stack Inference API for generating completions, chat completions, and Llama Stack Inference API for generating completions, chat completions, and
embeddings. embeddings.