Mnemo MCP Server

1

Add it to Claude Code

Run this in a terminal.

Run in terminal
claude mcp add mnemo-mcp -- uvx mnemo-mcp@latest
README.md

Persistent AI memory with hybrid search and embedded sync.

Mnemo MCP Server

mcp-name: io.github.n24q02m/mnemo-mcp

Persistent AI memory with hybrid search and embedded sync. Open, free, unlimited.

Features

  • Hybrid search -- FTS5 full-text + sqlite-vec semantic + reranking for precision
  • Knowledge graph -- Automatic entity extraction and relation tracking across memories
  • Importance scoring -- LLM-scored 0.0-1.0 per memory for smarter retrieval
  • Auto-archive -- Configurable age + importance threshold to keep memory clean
  • STM-to-LTM consolidation -- LLM summarization of related memories in a category
  • Duplicate detection -- Warns before adding semantically similar memories
  • Zero config -- Built-in local Qwen3 embedding + reranking, no API keys needed. Optional cloud providers (Jina AI, Gemini, OpenAI, Cohere)
  • Multi-machine sync -- JSONL-based merge sync via embedded rclone (Google Drive, S3, Dropbox)
  • Proactive memory -- Tool descriptions guide AI to save preferences, decisions, facts

Quick Start

Claude Code Plugin (Recommended)

claude plugin add n24q02m/mnemo-mcp

MCP Server

Option 1: uvx
{
  "mcpServers": {
    "mnemo": {
      "command": "uvx",
      "args": ["mnemo-mcp@latest"],
      "env": {
        // -- optional: cloud embedding + reranking (Jina AI recommended)
        "API_KEYS": "JINA_AI_API_KEY:jina_...",
        // -- or: "API_KEYS": "GOOGLE_API_KEY:AIza...,COHERE_API_KEY:co-...",
        // -- without API_KEYS, uses built-in local Qwen3 ONNX models (CPU, ~570MB first download)
        // -- optional: LiteLLM Proxy (production, selfhosted gateway)
        // "LITELLM_PROXY_URL": "http://10.0.0.20:4000",
        // "LITELLM_PROXY_KEY": "sk-your-virtual-key",
        // -- optional: sync memories across machines via rclone
        "SYNC_ENABLED": "true",                    // default: false
        "SYNC_INTERVAL": "300"                     // auto-sync every 5min (0 = manual only)
      }
    }
  }
}
Option 2: Docker
{
  "mcpServers": {
    "mnemo": {
      "command": "docker",
      "args": [
        "run", "-i", "--rm",
        "--name", "mcp-mnemo",
        "-v", "mnemo-data:/data",
        "-e", "API_KEYS",
        "-e", "SYNC_ENABLED",
        "-e", "SYNC_INTERVAL",
        "n24q02m/mnemo-mcp:latest"
      ],
      "env": {
        "API_KEYS": "JINA_AI_API_KEY:jina_...",
        "SYNC_ENABLED": "true",
        "SYNC_INTERVAL": "300"
      }
    }
  }
}

Pre-install (optional)

# Pre-download embedding model (~570MB) and validate API keys
uvx mnemo-mcp warmup

# With cloud embedding (validates API key, skips local download if cloud works)
API_KEYS="JINA_AI_API_KEY:jina_..." uvx mnemo-mcp warmup

Sync setup

Sync is fully automatic. Just set SYNC_ENABLED=true and the server handles everything:

  1. First sync: rclone is auto-downloaded, a browser opens for OAuth authentication
  2. Token saved: OAuth token is stored locally at ~/.mnemo-mcp/tokens/ (600 permissions)
  3. Subsequent runs: Token is loaded automatically -- no manual steps needed

For non-Google Drive providers, set SYNC_PROVIDER and SYNC_REMOTE:

{
  "SYNC_ENABLED": "true",
  "SYNC_PROVIDER": "dropbox",
  "SYNC_REMOTE": "dropbox"
}

Tools

Tool Actions Description
memory add, search, list, update, delete, export, import, stats, restore, archived, consolidate Core memory CRUD, hybrid search, import/export, archival, and LLM consolidation

Tools (1)

memoryCore memory CRUD, hybrid search, import/export, archival, and LLM consolidation.

Environment Variables

API_KEYSOptional cloud provider API keys for embedding and reranking.
SYNC_ENABLEDEnable memory synchronization across machines.
SYNC_INTERVALAuto-sync interval in seconds.
SYNC_PROVIDERStorage provider for rclone sync (e.g., dropbox, s3).
SYNC_REMOTERemote path configuration for rclone.

Configuration

claude_desktop_config.json
{"mcpServers": {"mnemo": {"command": "uvx", "args": ["mnemo-mcp@latest"], "env": {"SYNC_ENABLED": "true", "SYNC_INTERVAL": "300"}}}}

Try it

Save the fact that I prefer using Python for data analysis tasks to my memory.
Search my memory for previous decisions regarding the project architecture.
Consolidate my recent notes about the Qwen3 model into a summary.
List all archived memories from the last month.

Frequently Asked Questions

What are the key features of Mnemo?

Hybrid search combining FTS5 full-text and sqlite-vec semantic search.. Automatic entity extraction and relation tracking via knowledge graph.. LLM-based importance scoring for smarter memory retrieval.. Multi-machine sync using embedded rclone.. Built-in local Qwen3 embedding model for zero-config operation..

What can I use Mnemo for?

Maintaining a persistent knowledge base of project preferences and facts across different machines.. Automatically archiving old or low-importance memories to keep the workspace clean.. Retrieving context from past conversations or research using semantic search.. Syncing AI memory state between local development environments and cloud-based instances..

How do I install Mnemo?

Install Mnemo by running: claude plugin add n24q02m/mnemo-mcp

What MCP clients work with Mnemo?

Mnemo works with any MCP-compatible client including Claude Desktop, Claude Code, Cursor, and other editors with MCP support.

Turn this server into reusable context

Keep Mnemo docs, env vars, and workflow notes in Conare so your agent carries them across sessions.

Need the old visual installer? Open Conare IDE.
Open Conare