RedisVL MCP#

RedisVL includes an MCP server that exposes a Redis-backed retrieval surface through a small, deterministic tool contract. It is designed for AI applications that want to search or maintain data in one or more existing Redis indexes without each client reimplementing Redis query logic.

What RedisVL MCP Does#

The RedisVL MCP server sits between an MCP client and Redis:

  1. It connects to one or more existing Redis Search indexes.

  2. It inspects each index at startup and reconstructs its schema.

  3. It initializes vector capabilities only when the configured search or upsert behavior needs them.

  4. It exposes stable MCP tools for discovery, search, and optionally upsert.

This keeps each Redis index as the source of truth for its search behavior while giving MCP clients a predictable interface.

How RedisVL MCP Runs#

RedisVL MCP works with a focused model:

  • One server process binds to one or several existing Redis indexes, each addressed by a logical id.

  • The server supports stdio (default), Streamable HTTP, and SSE transports.

  • Search behavior is owned by per-index configuration, not by MCP callers.

  • Vector search and server-side embedding are optional capabilities configured explicitly per index.

  • Upsert is optional and can be disabled globally with read-only mode or per index with a read_only flag.

A single-index server remains the simplest deployment: when exactly one index is configured, callers can omit the index selector entirely and every tool call targets that index. Multi-index support is fully formal — it adds discovery and explicit routing without changing the single-index contract.

Config-Owned Search Behavior#

MCP callers can control:

  • query

  • limit

  • offset

  • filter

  • return_fields

These request-time controls are still bounded by runtime config. In particular, deep paging is limited by a configured maximum result window, enforced as offset + limit.

On a multi-index server, callers also choose which index to target through an optional index argument (see Index Selection). Callers do not choose:

  • whether retrieval is vector, fulltext, or hybrid

  • query tuning parameters such as hybrid fusion or vector runtime settings

That behavior lives in the per-index server config under indexes.<id>.search. The response includes search_type as informational metadata, but it is not a request parameter.

Single and Multiple Index Bindings#

The YAML config uses an indexes mapping. Each entry is a logical binding keyed by an id (for example knowledge or tickets) that points to an existing Redis index through redis_name. The mapping may contain one entry or several; each binding is inspected, validated, and given its own search config, runtime limits, and optional vectorizer independently at startup. Startup is all-or-nothing — if any binding fails to initialize, the server does not start.

A single-binding config is the simplest case and behaves exactly as before: the lone binding is the implicit target of every call. With multiple bindings the server stays a single process and endpoint, but callers select a binding per call.

Index Selection and Discovery#

On a multi-index server, every tool call must say which logical index it targets:

  • search-records and upsert-records accept an optional index argument naming the logical id.

  • When exactly one index is configured, index may be omitted and resolves to that sole binding (backward compatible).

  • When multiple indexes are configured, omitting index is an invalid_request; the caller must name one.

  • An unknown logical id is an invalid_request.

  • Both tools echo the resolved index in their response so clients can confirm routing.

Because a client cannot guess the configured logical ids, multi-index servers expose a list-indexes discovery tool. Clients should call list-indexes first to enumerate the available indexes and their filterable fields, then pass the chosen id as index on subsequent calls.

Schema Inspection and Overrides#

RedisVL MCP is inspection-first:

  • the Redis index must already exist

  • the server reconstructs the schema from Redis metadata at startup

  • runtime field mappings remain explicit in config

In some environments, Redis metadata can be incomplete for vector field attributes. When that happens, schema_overrides can patch missing attrs for fields that were already discovered. It does not create new fields or change discovered field identity.

Startup also validates that the inspected schema does not collide with MCP-reserved score metadata field names for the configured search mode.

Read-Only and Read-Write Modes#

RedisVL MCP always registers search-records and list-indexes.

Write availability is enforced at two levels:

  • Global read-only mode disables writes across every binding. It is controlled by the CLI flag --read-only or the environment variable REDISVL_MCP_READ_ONLY=true.

  • Per-index read-only disables writes for a single binding via indexes.<id>.read_only: true, while other bindings stay writable.

These combine into each binding’s effective write availability: a binding is read-only if global read-only is on or that binding sets read_only: true. The upsert-records tool is registered only when at least one binding is writable, so a fully read-only server does not advertise it at all. When the tool is registered, a write to a read-only binding is rejected with forbidden before any data is changed. list-indexes reports each binding’s effective write availability as upsert_available.

Use read-only mode when Redis is serving approved content to assistants and another system owns ingestion — globally when no binding should accept writes, or per index when only some indexes are writable.

Authentication and Authorization#

The HTTP transports can require a JWT bearer token issued by an existing identity provider. The server validates the token signature, issuer, and audience, and can gate read vs write by scope or role claim. This is coarse, per-tool authorization; it does not map token claims to Redis ACL users or per-tenant filters, which remain a gateway concern. The stdio transport is local and is never authenticated.

For configuration and the gateway boundary, see Authenticate RedisVL MCP.

Tool Surface#

RedisVL MCP exposes up to three tools:

  • list-indexes enumerates the configured logical indexes for discovery (always available)

  • search-records searches a selected index using that index’s server-owned search mode

  • upsert-records validates and upserts records into a selected writable index, embedding them only when that capability is configured

These tools follow a stable contract:

  • request validation happens before query or write execution

  • the resolved logical index is echoed in every search-records and upsert-records response

  • filters support either raw strings or a RedisVL-backed JSON DSL

  • on a single-index server, search-records describes the inspected schema by advertising typed JSON DSL filter fields, object-filter exists support, and valid return_fields; on a multi-index server those hints are ambiguous, so the description instead directs clients to call list-indexes and pass index

  • error codes are mapped into a stable set of MCP-facing categories

list-indexes#

list-indexes returns one entry per configured binding so clients can route subsequent calls. Each entry reports:

  • the logical id

  • an optional description (only when configured)

  • upsert_available, reflecting the binding’s effective write availability

  • fields, the filterable fields discovered from the index

  • limits, only the runtime limits that were explicitly configured

The discovery payload is deliberately minimal:

  • the underlying Redis index name (redis_name) is never exposed

  • the vector field and the configured embed-source text field are omitted from fields, since they are implementation inputs rather than fields a client filters on

  • limits shows only explicitly set values (such as max_limit or max_upsert_records); defaults are not echoed

Why Use MCP Instead of Direct RedisVL Calls#

Use RedisVL MCP when you want a standard tool boundary for agent frameworks or assistants that already speak MCP.

Use direct RedisVL client code when your application should own index lifecycle, search construction, data loading, or richer RedisVL features directly in Python.

RedisVL MCP is a good fit when:

  • multiple assistants should share one approved retrieval surface

  • you want search behavior fixed by deployment config

  • you need a read-only or tightly controlled write boundary

  • you want to reuse an existing Redis index without rebuilding retrieval logic in every client

For setup steps, config, commands, and examples, see Run RedisVL MCP.