Question 1

What are the key features of GLM OCR?

Accepted Answer

Supports text extraction from PNG, JPG, JPEG, and PDF files. Handles both local file paths and remote URLs. Provides optional structured JSON output including layout details like bounding boxes. Supports page range selection for multi-page PDF documents. Accepts base64-encoded data for direct image processing.

Question 2

What can I use GLM OCR for?

Accepted Answer

Digitizing scanned paper documents for search and analysis. Extracting data from invoices or receipts for automated bookkeeping. Processing multi-page technical manuals to make them searchable. Converting image-based screenshots into editable text content.

Question 3

What tools does GLM OCR provide?

Accepted Answer

extract_text: Extracts text from a local file or URL, supporting png, jpg/jpeg, and pdf formats..

Question 4

How do I install GLM OCR?

Accepted Answer

Install GLM OCR by running: claude mcp add --scope user glm-ocr --env ZHIPU_API_KEY=your_api_key_here --env ZHIPU_OCR_API_URL=https://open.bigmodel.cn/api/paas/v4/layout_parsing -- uvx glm-ocr-mcp

Question 5

What are the requirements for GLM OCR?

Accepted Answer

GLM OCR requires the following environment variables: ZHIPU_API_KEY (required), ZHIPU_OCR_API_URL (required). You'll also need a compatible MCP client like Claude Desktop or Claude Code.

Question 6

Is GLM OCR free to use?

Accepted Answer

Yes, GLM OCR is open source and free to use. You can find the source code on GitHub.

Question 7

What MCP clients support GLM OCR?

Accepted Answer

GLM OCR works with any MCP-compatible client including Claude Desktop (Anthropic's official desktop app), Claude Code (CLI tool), Cursor, and other editors with MCP support.

Question 8

How do I configure GLM OCR?

Accepted Answer

Configure GLM OCR by adding it to your MCP client's config file. The setup block at the top of this page generates a ready-to-paste config for Claude Code, Cursor, Codex, Windsurf, and Claude Desktop.

GLM OCR MCP Server

Add it to Claude Code

Make your agent remember this setup

What it does

Tools 1

Environment Variables

Try it