llama-stack-mirror/docs/source/providers/vector_io/milvus.md
Varsha Prasad Narsing 2cb927b498 Feat: Implement keyword search in milvus
Signed-off-by: Varsha Prasad Narsing <varshaprasad96@gmail.com>
2025-06-16 16:02:18 -07:00

3.7 KiB

orphan
true

Milvus

Milvus is an inline and remote vector database provider for Llama Stack. It allows you to store and query vectors directly within a Milvus database. That means you're not limited to storing vectors in memory or in a separate service.

Features

  • Easy to use
  • Fully integrated with Llama Stack

Usage

To use Milvus in your Llama Stack project, follow these steps:

  1. Install the necessary dependencies.
  2. Configure your Llama Stack project to use Milvus.
  3. Start storing and querying vectors.

Installation

You can install Milvus using pymilvus:

pip install pymilvus

Configuration

In Llama Stack, Milvus can be configured in two ways:

  • Inline (Local) Configuration - Uses Milvus-Lite for local storage
  • Remote Configuration - Connects to a remote Milvus server

Inline (Local) Configuration

The simplest method is local configuration, which requires setting db_path, a path for locally storing Milvus-Lite files:

vector_io:
  - provider_id: milvus
    provider_type: inline::milvus
    config:
      db_path: ~/.llama/distributions/together/milvus_store.db

Remote Configuration

Remote configuration is suitable for larger data storage requirements:

Standard Remote Connection

vector_io:
  - provider_id: milvus
    provider_type: remote::milvus
    config:
      uri: "http://<host>:<port>"
      token: "<user>:<password>"

TLS-Enabled Remote Connection (One-way TLS)

For connections to Milvus instances with one-way TLS enabled:

vector_io:
  - provider_id: milvus
    provider_type: remote::milvus
    config:
      uri: "https://<host>:<port>"
      token: "<user>:<password>"
      secure: True
      server_pem_path: "/path/to/server.pem"

Mutual TLS (mTLS) Remote Connection

For connections to Milvus instances with mutual TLS (mTLS) enabled:

vector_io:
  - provider_id: milvus
    provider_type: remote::milvus
    config:
      uri: "https://<host>:<port>"
      token: "<user>:<password>"
      secure: True
      ca_pem_path: "/path/to/ca.pem"
      client_pem_path: "/path/to/client.pem"
      client_key_path: "/path/to/client.key"

Key Parameters for TLS Configuration

  • secure: Enables TLS encryption when set to true. Defaults to false.
  • server_pem_path: Path to the server certificate for verifying the server's identity (used in one-way TLS).
  • ca_pem_path: Path to the Certificate Authority (CA) certificate for validating the server certificate (required in mTLS).
  • client_pem_path: Path to the client certificate file (required for mTLS).
  • client_key_path: Path to the client private key file (required for mTLS).

Supported Search Modes

The Milvus provider supports both vector-based and keyword-based (full-text) search modes, but with some limitations:

  • Remote Milvus supports both vector-based and keyword-based search modes.
  • Inline Milvus (Milvus-Lite) only supports vector-based search. Keyword search is not supported as Milvus-Lite has not implemented this functionality yet. For updates on this feature, see Milvus GitHub Issue #40848.

When using the RAGTool interface, you can specify the desired search behavior via the mode parameter in RAGQueryConfig. For more details on Milvus's implementation of keyword search modes, refer to the Milvus documentation.

Documentation

See the Milvus documentation for more details about Milvus in general.

For more details on TLS configuration, refer to the TLS setup guide.