10 servers curated

Supercharge Your AI Agents with These Web Scraping MCP Servers

Web scraping for AI agents is notoriously difficult due to dynamic JavaScript rendering, anti-bot protections, and the sheer volume of noise in modern HTML. Developers often struggle with brittle selectors, high token costs from bloated DOM structures, and the complexity of maintaining headless browser sessions across different environments.

Model Context Protocol (MCP) servers solve these issues by standardizing how agents interact with the web. By abstracting browser automation into a set of consistent tools, these servers allow agents to handle navigation, content extraction, and data cleaning without requiring custom integration logic for every target site.

When selecting an MCP server, prioritize those that offer token-efficient output, such as markdown conversion, and robust fallback mechanisms for headless rendering. Consider whether your use case requires simple data retrieval or complex, multi-step browser interactions, and always verify the server's ability to handle session persistence if you are targeting authenticated platforms.

Also Worth Trying

MCP-Crawl4AI

0 stars

Built on the Crawl4AI engine, this server is designed for deep site traversal and session-aware workflows. It uses the crawl tool to provide LLM-ready output, managing headless Chromium as a singleton to ensure efficient resource usage during long-running scraping tasks.

2 toolswyattowalsh

Playwright Scraper

0 stars

This server leverages Playwright to handle complex, JS-rendered websites that standard requests cannot parse. The scrape_to_markdown tool uses BeautifulSoup to clean up the DOM, ensuring the agent receives high-quality, readable content rather than raw, messy HTML.

1 toolssudinigoutham

Spectrawl

21 stars

Spectrawl is a robust solution for scraping protected sites, featuring built-in CAPTCHA solving and stealth browsing via Camoufox. Its tools, including deepSearch and browse, are backed by a wide range of platform-specific adapters, making it highly effective for difficult-to-access web data.

3 toolsFayAndXan

BrowseGrab

6 stars

Designed specifically for local LLMs, BrowseGrab uses accessibility trees to keep token usage minimal. Tools like browser_extract_content and browser_snapshot allow for reliable interaction with stable element references, reducing the number of LLM calls needed to navigate and scrape a page.

8 toolsQuartzUnit

Flyto Core

278 stars

Flyto Core provides a highly debuggable environment where every step can be traced and replayed. With tools like browser.evaluate and browser.screenshot, it is the best choice for developers who need to audit agent behavior or ensure consistent, repeatable scraping results.

6 toolsflytohub

Skyvern

20.9k stars

Skyvern moves beyond traditional selectors by using Vision LLMs to interact with websites as a human would. Its execute_workflow tool allows agents to navigate complex, unfamiliar sites by mapping visual elements, making it highly resistant to layout changes that break standard scrapers.

3 toolsSkyvern-AI

LinkedIn MCP Server

95 stars

This server provides specialized tools for scraping professional data, including get_person_profile and search_jobs. It manages persistent sessions via Patchright, allowing agents to perform granular searches and extract structured information from LinkedIn without manual intervention.

7 toolseliasbiondo

Side-by-Side Comparison

ServerStarsToolsTransportAuthor
1Robot Resources Scraper12stdiorobot-resources
2Titan MCP05stdiomrhavens
3Web Fetch07stdioxiaozhuABCD1234
4MCP-Crawl4AI02stdiowyattowalsh
5Playwright Scraper01stdiosudinigoutham
6Spectrawl213stdioFayAndXan
7BrowseGrab68stdioQuartzUnit
8Flyto Core2786stdioflytohub
9Skyvern20.9k3stdioSkyvern-AI
10LinkedIn MCP Server957stdioeliasbiondo

Keep the winning workflow in memory

Find the right server here, then save the docs, prompts, and setup rules in Conare so your agent can reuse them across clients.

Open Conare