Enhanced fork of the official ElevenLabs MCP server with conversational features
ElevenLabs MCP Enhanced
Enhanced fork of the official ElevenLabs MCP server with additional conversational AI features including conversation history and transcript retrieval.
This enhanced version is developed and maintained by Boris Djordjevic and the 199 Longevity team.
📑 Table of Contents
🚀 What's New in This Fork
This enhanced version adds critical conversational AI features missing from the original:
🤖 AI-Friendly Improvements (v1.0.0)
- ✅ Official v3 API: Now uses official ElevenLabs endpoints - no proxy needed!
- 🎯 Smart Voice Defaults:
search_voices()now returns common working voices instantly - 📚 Educational Error Messages: Errors guide AI agents to success with examples
- 💡 Clear Tool Guidance: No more confusion about single vs multi-speaker tools
- 🎤 Accurate v3 Voice IDs: All 20 v3-optimized voices now have correct IDs and descriptions
- 🏯 Auto-Split Long Dialogues: Automatically splits dialogues over 3000 chars into multiple files
- 🎯 Auto-Adjust Stability: Invalid stability values auto-round to nearest valid option (0.0, 0.5, 1.0)
- 🏷️ Smart Tag Simplification: Complex tags auto-convert to valid v3 tags for better quality
- ⏱️ Dynamic Timeouts: Prevents timeouts on complex dialogues by calculating appropriate wait times
🆕 ElevenLabs v3 Model Support (Official)
- 🎭 Enhanced Expressiveness: Use the official v3 model with
model="v3"parameter - 🎤 Audio Tags: Add emotions and sound effects like
[thoughtful],[crying],[laughing],[piano] - 👥 Multi-Speaker Dialogue: Generate natural conversations between multiple speakers
- ✨ Dialogue Enhancement: Automatically enhance your dialogue with proper formatting and tags
- 🌍 70+ Languages: v3 supports multilingual synthesis with emotional control
- ✅ Official API: Now uses the official ElevenLabs text-to-dialogue endpoint
🎙️ Conversational AI Features
- Conversation History: Retrieve full conversation details including transcripts
- 📝 Transcript Access: Get conversation transcripts in multiple formats (plain, timestamps, JSON)
- ⏳ Real-time Monitoring: Wait for ongoing conversations to complete and retrieve results
- 🔍 Conversation Search: List and filter conversations by agent, status, and more
- 🎨 Improved Formatting: Consistent formatting across all list operations
About
This is an enhanced fork of the official ElevenLabs Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and audio processing APIs. This server allows MCP clients like Claude Desktop, Cursor, Windsurf, OpenAI Agents and others to generate speech, clone voices, transcribe audio, manage conversational AI agents, and now retrieve conversation history.
🚀 Quick Install
Zero Install (Recommended)
No installation required! Just use npx:
npx elevenlabs-mcp-enhanced --api-key YOUR_API_KEY
Global Install
Install once, use everywhere:
npm install -g elevenlabs-mcp-enhanced
elevenlabs-mcp-enhanced --api-key YOUR_API_KEY
Environment Variable
Set your API key once:
export ELEVENLABS_API_KEY="your-api-key"
npx elevenlabs-mcp-enhanced
📋 Requirements
- Node.js 16+ (for npm/npx)
- Python 3.11+ (automatically managed by the npm package)
- ElevenLabs API Key - Get one at elevenlabs.io
Quickstart with Claude Desktop
Option 1: Using npm/npx (Recommended - No installation required!)
- Get your API key from ElevenLabs. There is a f
Tools (4)
search_voicesReturns common working voices and their IDs.generate_speechGenerates speech from text using specified voice and model parameters.list_conversationsLists and filters conversation history by agent or status.get_transcriptRetrieves conversation transcripts in various formats.Environment Variables
ELEVENLABS_API_KEYrequiredYour ElevenLabs API key for authentication.Configuration
{"mcpServers": {"elevenlabs": {"command": "npx", "args": ["-y", "elevenlabs-mcp-enhanced"], "env": {"ELEVENLABS_API_KEY": "YOUR_API_KEY"}}}}