What are the requirements for Site Crawler MCP?

Site Crawler MCP requires a compatible MCP client such as Claude Desktop, Claude Code, or Cursor. No additional environment variables are needed for basic setup.

Is Site Crawler MCP free to use?

Yes, Site Crawler MCP is open source and free to use. You can find the source code on GitHub.

What MCP clients support Site Crawler MCP?

Site Crawler MCP works with any MCP-compatible client including Claude Desktop (Anthropic's official desktop app), Claude Code (CLI tool), Cursor, and other editors with MCP support.

How do I configure Site Crawler MCP?

Configure Site Crawler MCP by adding it to your MCP client's config file. The setup block at the top of this page generates a ready-to-paste config for Claude Code, Cursor, Codex, Windsurf, and Claude Desktop.

MCP server/browser-automation

Site Crawler MCP Server

Q: What tools does Site Crawler MCP provide?

site_crawlAssets: Crawl a website and extract various assets based on specified modes..

Q: How do I install Site Crawler MCP?

Install Site Crawler MCP by running: uvx --from /path/to/site-crawler-mcp site-crawler-mcp

A powerful MCP server for crawling websites and extracting assets.

★ 5 AndacGuven/site-crawler-mcp ↗by AndacGuvenupdated Apr 9, 2026

Manual setup required. The maintainer's config contains paths only you know - edit the placeholders below before adding it to Claude Code.

Prepare the server locally

Run this once before adding it to Claude Code.

pip install site-crawler-mcp

Register it in Claude Code

claude mcp add site-crawler -- uvx --from /path/to/site-crawler-mcp site-crawler-mcp

Replace any placeholder paths in the command with the real path on your machine.

Make your agent remember this setup

site-crawler's config, env vars, and the gotchas you hit — recalled in every future Claude Code, Cursor, and Codex session.

npx conare@latest

Free · one command · indexes the sessions already on disk. Set up in the browser instead →

What it does

12 distinct extraction modes for comprehensive website analysis
Concurrent crawling with rate limiting for performance optimization
Detailed SEO auditing including meta tags and structured data
Security analysis covering HTTPS, SSL/TLS, and security headers
Business intelligence extraction including brand info and contact details

Tools 1

site_crawlAssetsCrawl a website and extract various assets based on specified modes.

Try it

→Crawl https://example.com and extract all images with their metadata.

→Perform a full SEO and security audit on https://example.com with a crawl depth of 2.

→Gather business intelligence for https://example.com including contact info and brand details.

→Check the legal compliance and privacy policy status for https://example.com.

Original README from AndacGuven/site-crawler-mcp

Site Crawler MCP

A powerful Model Context Protocol (MCP) server for crawling websites and extracting assets including images and SEO metadata. Built for e-commerce sites and general web crawling needs.

Features

Comprehensive website analysis: 12 different extraction modes for complete website insights
Multi-mode crawling: Extract multiple data types in a single pass
Smart extraction: Advanced pattern matching for accurate data extraction
Performance optimized: Concurrent crawling with rate limiting
Security analysis: HTTPS, security headers, SSL/TLS information
SEO analysis: Complete SEO audit including meta tags, structured data, and more
Legal compliance: KVKK, GDPR, privacy policy detection
Business intelligence: Brand info, references, contact details extraction

Installation

From PyPI (when published)

pip install site-crawler-mcp

From Source (Development)

Using uv (Recommended)

# Clone the repository
git clone https://github.com/AndacGuven/site-crawler-mcp.git
cd site-crawler-mcp

# Create virtual environment with Python 3.12
uv venv --python 3.12
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install dependencies and package
uv sync

Using pip

# Clone the repository
git clone https://github.com/AndacGuven/site-crawler-mcp.git
cd site-crawler-mcp

# Create virtual environment (recommended)
python -m venv venv

# Activate virtual environment
# On Windows:
venv\Scripts\activate
# On Linux/Mac:
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Install package in development mode
pip install -e .

Usage

As an MCP Server

Add to your MCP configuration file:

Windows: %APPDATA%\Claude\claude_desktop_config.json
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Linux: ~/.config/Claude/claude_desktop_config.json

Using uvx (Recommended)

{
  "mcpServers": {
    "site-crawler": {
      "command": "uvx",
      "args": ["--from", "/path/to/site-crawler-mcp", "site-crawler-mcp"]
    }
  }
}

Using uv run

{
  "mcpServers": {
    "site-crawler": {
      "command": "uv",
      "args": ["run", "site_crawler"],
      "cwd": "/path/to/site-crawler-mcp"
    }
  }
}

Using python directly

{
  "mcpServers": {
    "site-crawler": {
      "command": "python",
      "args": ["-m", "site_crawler.server"],
      "cwd": "/path/to/site-crawler-mcp/src",
      "env": {
        "PYTHONPATH": "/path/to/site-crawler-mcp/src"
      }
    }
  }
}

Note: Replace /path/to/site-crawler-mcp with your actual project path. On Windows, use backslashes and drive letters (e.g., C:\\Users\\YourName\\site-crawler-mcp).

Available Tools

`site_crawlAssets`

Crawl a website and extract various assets based on specified modes.

Parameters:

url (string, required): The URL to start crawling from
modes (array, required): Array of extraction modes (see below)
depth (number, optional): Crawling depth (default: 1)
max_pages (number, optional): Maximum pages to crawl (default: 50)

Available Modes:

images: Extract all images with metadata (alt text, dimensions, format)
meta: Basic SEO metadata (title, description, H1 tags)
brand: Company branding information (logo, name, about pages)
seo: Comprehensive SEO analysis (meta tags, structured data, open graph)
performance: Page load metrics and performance indicators
security: Security headers and HTTPS configuration
compliance: Accessibility and regulatory compliance checks
infrastructure: Server technology and CDN detection
legal: Privacy policies, terms, KVKK compliance
careers: Job opportunities and career pages
references: Client testimonials and case studies
contact: Contact information (email, phone, social media, address)

Example Requests:

Basic image extraction:

{
  "tool": "site_crawlAssets",
  "arguments": {
    "url": "https://example.com",
    "modes": ["images"],
    "depth": 1
  }
}

Full SEO and security audit:

{
  "tool": "site_crawlAssets",
  "arguments": {
    "url": "https://example.com",
    "modes": ["seo", "security", "performance"],
    "depth": 2
  }
}

Business intelligence gathering:

{
  "tool": "site_crawlAssets",
  "arguments": {
    "url": "https://example.com",
    "modes": ["brand", "contact", "references", "careers"],
    "depth": 3
  }
}

Legal compliance check:

{
  "tool": "site_crawlAssets",
  "arguments": {
    "url": "https://example.com",
    "modes": ["legal", "compliance"],
    "depth": 2
  }
}

Development

Requirements

Python 3.10+
BeautifulSoup4
aiohttp
MCP SDK
uv (recommended for development)

Setup Development Environment

Using uv (Recommended)

# Clone the repository
git clone https://github.com/AndacGuven/site-

Frequently Asked Questions

What are the key features of Site Crawler MCP?

12 distinct extraction modes for comprehensive website analysis. Concurrent crawling with rate limiting for performance optimization. Detailed SEO auditing including meta tags and structured data. Security analysis covering HTTPS, SSL/TLS, and security headers. Business intelligence extraction including brand info and contact details.

What can I use Site Crawler MCP for?

Conducting automated SEO audits for e-commerce websites. Extracting contact and brand information for lead generation. Verifying legal and regulatory compliance across multiple web pages. Analyzing website performance and security headers for technical audits.

How do I install Site Crawler MCP?

Install Site Crawler MCP by running: uvx --from /path/to/site-crawler-mcp site-crawler-mcp

What MCP clients work with Site Crawler MCP?

Site Crawler MCP works with any MCP-compatible client including Claude Desktop, Claude Code, Cursor, and other editors with MCP support.

Conare · memory for coding agents

Turn this server into reusable context

Keep Site Crawler MCP docs, env vars, and workflow notes in Conare so your agent carries them across sessions.

Set up free$npx conare@latest

Prepare the server locally

Register it in Claude Code

Make your agent remember this setup

What it does

Tools 1

Try it

Site Crawler MCP

Features

Installation

From PyPI (when published)

From Source (Development)

Using uv (Recommended)

Using pip

Usage

As an MCP Server

Using uvx (Recommended)

Using uv run

Using python directly

Available Tools

`site_crawlAssets`

Development

Requirements

Setup Development Environment

Using uv (Recommended)

Frequently Asked Questions

Turn this server into reusable context

Related MCP Servers