MCP Server Scraper MCP Server

1

Add it to Claude Code

Run this in a terminal.

Run in terminal
claude mcp add mcp-server-scraper -- npx -y mcp-server-scraper
README.md

Extract clean, readable content from any URL.

mcp-server-scraper

Extract clean, readable content from any URL. Returns markdown text, links, and metadata. No API keys, no config. A free alternative to Firecrawl for scraping docs, blogs, and articles.

npx mcp-server-scraper

Works with Claude Desktop, Cursor, VS Code Copilot, and any MCP client. No accounts or API keys needed.

MCP server for web scraping, content extraction, and URL metadata

Demo built with remotion-readme-kit

Why

When you're working with an AI assistant and need to reference a docs page, a blog post, or an API reference, you usually end up copy-pasting content manually. Tools like Firecrawl solve this but require a paid API key. This server does the same thing for free. It fetches a URL, runs it through Mozilla Readability (the same engine behind Firefox Reader View), and returns clean markdown. It works well for server-rendered content like documentation sites, blog posts, and articles. It won't handle JavaScript-heavy SPAs, but for the most common use case of "read this docs page and summarize it," it does the job.

Tools

Tool What it does
scrape_url Extract clean text content from a URL (Readability-powered)
extract_links Get all links with href and anchor text
extract_metadata Get title, description, OG tags, canonical, favicon
search_page Search for a query string within the page, return matching lines
scrape_multiple Batch scrape multiple URLs, get title + excerpt per URL

Quick Start

Cursor

Add to .cursor/mcp.json:

{
  "mcpServers": {
    "scraper": {
      "command": "npx",
      "args": ["-y", "mcp-server-scraper"]
    }
  }
}

Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "scraper": {
      "command": "npx",
      "args": ["-y", "mcp-server-scraper"]
    }
  }
}

VS Code

Add to your MCP settings (e.g. .vscode/mcp.json):

{
  "mcp": {
    "servers": {
      "scraper": {
        "command": "npx",
        "args": ["-y", "mcp-server-scraper"]
      }
    }
  }
}

Examples

  • "Scrape the API docs from https://docs.example.com and summarize them"
  • "Extract all links from this page"
  • "What's the OG image and description for this URL?"
  • "Search this page for mentions of 'authentication'"
  • "Scrape these 5 URLs and give me a summary of each"

How it works

Uses Mozilla Readability (the engine behind Firefox Reader View) plus linkedom for fast HTML parsing in Node. No headless browser needed. Works best with server-rendered pages: docs, blogs, articles, news sites.

Development

npm install
npm run typecheck
npm run build
npm test

See also

More MCP servers and developer tools on my portfolio.

Author


README built with README Builder

License

MIT © Ofer Shapira

Tools (5)

scrape_urlExtract clean text content from a URL using the Readability engine.
extract_linksGet all links with href and anchor text from a URL.
extract_metadataGet title, description, OG tags, canonical, and favicon from a URL.
search_pageSearch for a query string within the page and return matching lines.
scrape_multipleBatch scrape multiple URLs to get title and excerpt per URL.

Configuration

claude_desktop_config.json
{"mcpServers": {"scraper": {"command": "npx", "args": ["-y", "mcp-server-scraper"]}}}

Try it

Scrape the API docs from https://docs.example.com and summarize them.
Extract all links from this page.
What's the OG image and description for this URL?
Search this page for mentions of 'authentication'.
Scrape these 5 URLs and give me a summary of each.

Frequently Asked Questions

What are the key features of MCP Server Scraper?

Extracts clean markdown from web pages using Mozilla Readability. Retrieves page metadata including OG tags and canonical links. Supports batch scraping of multiple URLs. Performs text searching within page content. No API keys or accounts required.

What can I use MCP Server Scraper for?

Summarizing documentation pages or blog posts for AI assistants. Extracting metadata for social media sharing or link previews. Quickly searching for specific information within long articles. Aggregating content from multiple sources into a single context.

How do I install MCP Server Scraper?

Install MCP Server Scraper by running: npx mcp-server-scraper

What MCP clients work with MCP Server Scraper?

MCP Server Scraper works with any MCP-compatible client including Claude Desktop, Claude Code, Cursor, and other editors with MCP support.

Turn this server into reusable context

Keep MCP Server Scraper docs, env vars, and workflow notes in Conare so your agent carries them across sessions.

Need the old visual installer? Open Conare IDE.
Open Conare