10 servers curated

Supercharge Your AI Agents with These Web Scraping MCP Servers

Web scraping for AI agents is notoriously brittle, often hampered by dynamic JavaScript rendering, anti-bot protections, and the sheer noise of modern HTML. Extracting clean, context-ready data while managing browser state and session persistence requires robust tooling that can handle the complexities of the DOM without overwhelming the LLM's context window.

Model Context Protocol (MCP) servers bridge this gap by providing standardized interfaces for agents to interact with the web. By offloading the heavy lifting—such as headless browser management, token compression, and visual element mapping—to these servers, agents can focus on reasoning and data synthesis rather than low-level DOM manipulation.

When selecting an MCP server, prioritize those that offer token-efficient output, such as markdown conversion, and those that provide reliable fallback mechanisms for when standard scraping fails. Evaluate the server based on its ability to handle your specific use case, whether that is simple data extraction, complex multi-step browser automation, or platform-specific scraping.

Also Worth Trying

MCP-Crawl4AI

0 stars

Built on the Crawl4AI engine, this server offers full MCP compliance and manages headless Chromium as a singleton. The crawl tool is specifically tuned to output LLM-friendly markdown and plain text, supporting deep site traversal.

2 toolswyattowalsh

Playwright Scraper

0 stars

This server leverages Playwright to handle complex, JS-rendered websites. Its scrape_to_markdown tool uses BeautifulSoup for reliable HTML cleanup, ensuring the resulting content is structured and readable for your agent.

1 toolssudinigoutham

Spectrawl

21 stars

Spectrawl is designed for stealth, integrating Camoufox and Playwright to bypass anti-bot measures. With tools like deepSearch and browse, it provides a unified layer for 24 different platforms, including CAPTCHA solving via Gemini Vision.

3 toolsFayAndXan

BrowseGrab

6 stars

BrowseGrab focuses on token efficiency by utilizing accessibility trees for navigation. Tools like browser_extract_content and browser_snapshot are optimized for local models, featuring a stable element reference system to reduce unnecessary LLM calls.

8 toolsQuartzUnit

Flyto Core

278 stars

Flyto Core is a high-precision automation engine that allows for full execution tracing and replayability. Its tools, such as browser.evaluate and browser.screenshot, provide a transparent audit trail for complex, multi-step agent workflows.

6 toolsflytohub

Skyvern

20.9k stars

Skyvern uses Vision LLMs to map visual elements, allowing it to interact with websites without relying on brittle XPath selectors. The execute_workflow tool enables agents to perform complex, multi-step tasks on sites they have never encountered before.

3 toolsSkyvern-AI

LinkedIn MCP Server

95 stars

This server provides granular access to LinkedIn data, including profiles, company posts, and job listings. Tools like search_people and get_job_details are powered by Patchright, ensuring persistent session management for reliable scraping.

7 toolseliasbiondo

Side-by-Side Comparison

ServerStarsToolsTransportAuthor
1Robot Resources Scraper12stdiorobot-resources
2Titan MCP05stdiomrhavens
3Web Fetch07stdioxiaozhuABCD1234
4MCP-Crawl4AI02stdiowyattowalsh
5Playwright Scraper01stdiosudinigoutham
6Spectrawl213stdioFayAndXan
7BrowseGrab68stdioQuartzUnit
8Flyto Core2786stdioflytohub
9Skyvern20.9k3stdioSkyvern-AI
10LinkedIn MCP Server957stdioeliasbiondo

Keep the winning workflow in memory

Find the right server here, then save the docs, prompts, and setup rules in Conare so your agent can reuse them across clients.

Need the old visual installer? Open Conare IDE.
Open Conare