Question 1

What are the key features of MCP Vision Server?

Accepted Answer

Advanced image content recognition and description. Multi-turn visual dialogue capabilities. Support for local file paths and Base64 encoded images. OpenAI-compatible API integration. Persistent conversation history storage.

Question 2

What can I use MCP Vision Server for?

Accepted Answer

Automating data extraction from scanned documents or invoices. Analyzing UI screenshots for accessibility or design feedback. Transcribing code snippets from images into editable text. Interpreting complex charts and graphs through conversational AI.

Question 3

What tools does MCP Vision Server provide?

Accepted Answer

analyze_image: Analyzes image content and returns a detailed description, including OCR and code recognition.. chat_vision: Conducts multi-turn visual dialogues based on an image.. get_status: Retrieves the current server running status, model information, and session state..

Question 4

How do I install MCP Vision Server?

Accepted Answer

Install MCP Vision Server by running: pip install -e .

Question 5

What are the requirements for MCP Vision Server?

Accepted Answer

MCP Vision Server requires the following environment variables: VISION_API_KEY (required), VISION_BASE_URL (required), VISION_MODEL (required), VISION_MAX_IMAGE_SIZE (optional), VISION_TIMEOUT (optional), VISION_TEMPERATURE (optional), VISION_MAX_TOKENS (optional), VISION_LOG_LEVEL (optional), VISION_MAX_HISTORY (optional), VISION_ENABLE_PERSISTENCE (optional), VISION_HISTORY_PATH (optional). You'll also need a compatible MCP client like Claude Desktop or Claude Code.

Question 6

Is MCP Vision Server free to use?

Accepted Answer

Yes, MCP Vision Server is open source and free to use. You can find the source code on GitHub.

Question 7

What MCP clients support MCP Vision Server?

Accepted Answer

MCP Vision Server works with any MCP-compatible client including Claude Desktop (Anthropic's official desktop app), Claude Code (CLI tool), Cursor, and other editors with MCP support.

Question 8

How do I configure MCP Vision Server?

Accepted Answer

Configure MCP Vision Server by adding it to your MCP client's config file. The setup block at the top of this page generates a ready-to-paste config for Claude Code, Cursor, Codex, Windsurf, and Claude Desktop.

变量名	说明	默认值
`VISION_API_KEY`	API 密钥	-
`VISION_BASE_URL`	API 基础 URL	-
`VISION_MODEL`	模型名称	`glm-4v`
`VISION_MAX_IMAGE_SIZE`	最大图像大小（字节）	`20971520` (20MB)
`VISION_TIMEOUT`	请求超时（秒）	`120`
`VISION_TEMPERATURE`	温度参数	`0.7`
`VISION_MAX_TOKENS`	最大输出 tokens	`4096`
`VISION_LOG_LEVEL`	日志级别	`INFO`
`VISION_MAX_HISTORY`	对话历史最大保存数	`50`
`VISION_ENABLE_PERSISTENCE`	启用持久化	`true`
`VISION_HISTORY_PATH`	历史文件路径	`~/.mcp-vision/history.json`

MCP Vision Server MCP Server

MCP Vision Server - 图像识别 MCP 服务器

特性

安装

配置

使用方法

启动服务器

Web 配置工具

MCP 工具

1. analyze_image - 图像分析

2. chat_vision - 两轮对话

3. get_status - 状态查询

输入格式

1. 本地文件路径

2. Base64 编码

环境变量

支持的图像格式

项目结构

在 Claude Code 中配置

许可证

Tools 3

Environment Variables

Try it

Frequently Asked Questions

Turn this server into reusable context