RedisVL MCP#
RedisVL includes an MCP server that exposes a Redis-backed retrieval surface through a small, deterministic tool contract. It is designed for AI applications that want to search or maintain data in one or more existing Redis indexes without each client reimplementing Redis query logic.
What RedisVL MCP Does#
The RedisVL MCP server sits between an MCP client and Redis:
It connects to one or more existing Redis Search indexes.
It inspects each index at startup and reconstructs its schema.
It initializes vector capabilities only when the configured search or upsert behavior needs them.
It exposes stable MCP tools for discovery, search, and optionally upsert.
This keeps each Redis index as the source of truth for its search behavior while giving MCP clients a predictable interface.
How RedisVL MCP Runs#
RedisVL MCP works with a focused model:
One server process binds to one or several existing Redis indexes, each addressed by a logical id.
The server supports stdio (default), Streamable HTTP, and SSE transports.
Search behavior is owned by per-index configuration, not by MCP callers.
Vector search and server-side embedding are optional capabilities configured explicitly per index.
Upsert is optional and can be disabled globally with read-only mode or per index with a
read_onlyflag.
A single-index server remains the simplest deployment: when exactly one index is configured, callers can omit the index selector entirely and every tool call targets that index. Multi-index support is fully formal — it adds discovery and explicit routing without changing the single-index contract.
Config-Owned Search Behavior#
MCP callers can control:
querylimitoffsetfilterreturn_fields
These request-time controls are still bounded by runtime config. In particular,
deep paging is limited by a configured maximum result window, enforced as
offset + limit.
On a multi-index server, callers also choose which index to target through an optional index argument (see Index Selection). Callers do not choose:
whether retrieval is
vector,fulltext, orhybridquery tuning parameters such as hybrid fusion or vector runtime settings
That behavior lives in the per-index server config under indexes.<id>.search. The response includes search_type as informational metadata, but it is not a request parameter.
Single and Multiple Index Bindings#
The YAML config uses an indexes mapping. Each entry is a logical binding keyed by an id (for example knowledge or tickets) that points to an existing Redis index through redis_name. The mapping may contain one entry or several; each binding is inspected, validated, and given its own search config, runtime limits, and optional vectorizer independently at startup. Startup is all-or-nothing — if any binding fails to initialize, the server does not start.
A single-binding config is the simplest case and behaves exactly as before: the lone binding is the implicit target of every call. With multiple bindings the server stays a single process and endpoint, but callers select a binding per call.
Index Selection and Discovery#
On a multi-index server, every tool call must say which logical index it targets:
search-recordsandupsert-recordsaccept an optionalindexargument naming the logical id.When exactly one index is configured,
indexmay be omitted and resolves to that sole binding (backward compatible).When multiple indexes are configured, omitting
indexis aninvalid_request; the caller must name one.An unknown logical id is an
invalid_request.Both tools echo the resolved
indexin their response so clients can confirm routing.
Because a client cannot guess the configured logical ids, multi-index servers expose a list-indexes discovery tool. Clients should call list-indexes first to enumerate the available indexes and their filterable fields, then pass the chosen id as index on subsequent calls.
Schema Inspection and Overrides#
RedisVL MCP is inspection-first:
the Redis index must already exist
the server reconstructs the schema from Redis metadata at startup
runtime field mappings remain explicit in config
In some environments, Redis metadata can be incomplete for vector field attributes. When that happens, schema_overrides can patch missing attrs for fields that were already discovered. It does not create new fields or change discovered field identity.
Startup also validates that the inspected schema does not collide with MCP-reserved score metadata field names for the configured search mode.
Read-Only and Read-Write Modes#
RedisVL MCP always registers search-records and list-indexes.
Write availability is enforced at two levels:
Global read-only mode disables writes across every binding. It is controlled by the CLI flag
--read-onlyor the environment variableREDISVL_MCP_READ_ONLY=true.Per-index read-only disables writes for a single binding via
indexes.<id>.read_only: true, while other bindings stay writable.
These combine into each binding’s effective write availability: a binding is read-only if global read-only is on or that binding sets read_only: true. The upsert-records tool is registered only when at least one binding is writable, so a fully read-only server does not advertise it at all. When the tool is registered, a write to a read-only binding is rejected with forbidden before any data is changed. list-indexes reports each binding’s effective write availability as upsert_available.
Use read-only mode when Redis is serving approved content to assistants and another system owns ingestion — globally when no binding should accept writes, or per index when only some indexes are writable.
Tool Surface#
RedisVL MCP exposes up to three tools:
list-indexesenumerates the configured logical indexes for discovery (always available)search-recordssearches a selected index using that index’s server-owned search modeupsert-recordsvalidates and upserts records into a selected writable index, embedding them only when that capability is configured
These tools follow a stable contract:
request validation happens before query or write execution
the resolved logical
indexis echoed in everysearch-recordsandupsert-recordsresponsefilters support either raw strings or a RedisVL-backed JSON DSL
on a single-index server,
search-recordsdescribes the inspected schema by advertising typed JSON DSL filter fields, object-filterexistssupport, and validreturn_fields; on a multi-index server those hints are ambiguous, so the description instead directs clients to calllist-indexesand passindexerror codes are mapped into a stable set of MCP-facing categories
list-indexes#
list-indexes returns one entry per configured binding so clients can route subsequent calls. Each entry reports:
the logical
idan optional
description(only when configured)upsert_available, reflecting the binding’s effective write availabilityfields, the filterable fields discovered from the indexlimits, only the runtime limits that were explicitly configured
The discovery payload is deliberately minimal:
the underlying Redis index name (
redis_name) is never exposedthe vector field and the configured embed-source text field are omitted from
fields, since they are implementation inputs rather than fields a client filters onlimitsshows only explicitly set values (such asmax_limitormax_upsert_records); defaults are not echoed
Why Use MCP Instead of Direct RedisVL Calls#
Use RedisVL MCP when you want a standard tool boundary for agent frameworks or assistants that already speak MCP.
Use direct RedisVL client code when your application should own index lifecycle, search construction, data loading, or richer RedisVL features directly in Python.
RedisVL MCP is a good fit when:
multiple assistants should share one approved retrieval surface
you want search behavior fixed by deployment config
you need a read-only or tightly controlled write boundary
you want to reuse an existing Redis index without rebuilding retrieval logic in every client
For setup steps, config, commands, and examples, see Run RedisVL MCP.