Adds vision capabilities to any AI model through OpenRouter
Vision MCP Server
Ever wanted to use a model like GLM-4.6 or other great AI models that just don't have vision capabilities? This MCP server solves that problem by adding vision capabilities to any model through OpenRouter's vision models.
The Problem
Some really good AI models don't support vision. You're stuck choosing between your preferred model or vision capabilities. This server bridges that gap by providing seamless vision capabilities through OpenRouter's vision models.
The Solution
This MCP server provides a simple analyze_image tool that can:
- Analyze images from URLs, file paths, or base64 data
- Use any vision model available on OpenRouter (Claude 3.5 Sonnet, GPT-4 Vision, etc.)
- Return detailed analysis results
- Handle errors gracefully with proper validation
System Requirements
Before installing, make sure you have:
- Node.js 18.0.0 or higher (recommended: Node.js 20+)
- npm 8.0.0 or higher (comes with Node.js)
Check Your Versions
node --version # Should show v18.0.0 or higher
npm --version # Should show 8.0.0 or higher
Install/Update Node.js
If you need to install or update Node.js:
- Download from official site: nodejs.org (recommended for beginners)
- Using Node Version Manager (nvm):
# Install nvm first, then: nvm install 20 nvm use 20 - Using package managers:
- macOS:
brew install node - Windows:
winget install OpenJS.NodeJS - Ubuntu/Debian:
sudo apt install nodejs npm
- macOS:
Important: This server is written in TypeScript and uses dependencies (like
node-fetchv3) that require Node.js 18+. Older versions (like Node.js 16 or below) will not work.
Quick Start
Step 1: Get Your OpenRouter API Key
- Go to OpenRouter
- Sign up or log in to your account
- Navigate to "Keys" in your dashboard
- Click "Create Key"
- Copy your API key (starts with
sk-or-v1-...) - Keep this key safe - you'll need it in Step 3
Step 2: Install the MCP Server
Option A: Install from npm (Recommended)
npm install -g @thenomadinorbit/vision-mcp-server
Success! The package is now globally available as
vision-mcpcommand.
Option B: Install from Source (Development)
git clone https://github.com/TheNomadInOrbit/vision-mcp-server.git
cd vision-mcp-server
npm install
npm run build
npm install -g .
Note: Use this method if you want to modify the source code or contribute to the project.
Step 3: Configure Your MCP Client
Add this server configuration to your MCP client:
{
"mcpServers": {
"vision-analyzer": {
"command": "vision-mcp",
"type": "stdio",
"timeout": 60,
"disabled": false,
"autoApprove": [],
"env": {
"OPENROUTER_API_KEY": "your_api_key_here",
"OPENROUTER_MODEL": "anthropic/claude-3-5-sonnet"
}
}
}
}
Step 4: Test Your Installation
Important: The vision-mcp command requires an OpenRouter API key to run. You cannot test it directly without configuration.
Quick Test (with your API key):
OPENROUTER_API_KEY="your_api_key_here" vision-mcp --help
You should see the server start up with logs like:
Application initialized successfully
Starting Vision MCP Server...
MCP server started successfully
Vision MCP Server is running on stdio
Press Ctrl+C to stop the test.
What happens if you run `vision-mcp` without the API key?
vision-mcp
You'll get this error (this is normal and expected):
Error: OPENROUTER_API_KEY environment variable is required
This means the installation worked! The server is just protecting you from running without proper configuration.
Verify Installation Status:
# Check if the command is available
which vision-mcp
# Check if the package is installed
npm list -g @thenomadinorbit/vision-mcp-server
🔧 Configuration Options
Basic Configuration
"vision-analyzer"- Server name (you can change this to anything you like)"command": "vision-mcp"- Required: The global command to run the server"type": "stdio"- Required: Communication protocol for MCP"timeout": 60- Optional: Timeout in seconds (default: 60)"disabled": false- Optional: Set totrueto disable the server
Auto-Approve Settings
Configure which tools can run without asking for permission:
"autoApprove": []
Options:
[](empty) - Requires approval for all tools (safest)["list_models"]- Auto-approve listing available models only["analyze_image"]- Auto-approve vision analysis (convenient but less safe)["analyze_image", "list_models"]- Auto-approve all tools (most convenient)
Mod
Tools (1)
analyze_imageAnalyzes images from URLs, file paths, or base64 data using OpenRouter vision models.Environment Variables
OPENROUTER_API_KEYrequiredYour API key from OpenRouterOPENROUTER_MODELThe specific vision model to use from OpenRouterConfiguration
{
"mcpServers": {
"vision-analyzer": {
"command": "vision-mcp",
"type": "stdio",
"env": {
"OPENROUTER_API_KEY": "your_api_key_here",
"OPENROUTER_MODEL": "anthropic/claude-3-5-sonnet"
}
}
}
}