Browser Automation MCP Server

Local setup required. This server has to be cloned and prepared on your machine before you register it in Claude Code.
1

Set the server up locally

Run this once to clone and prepare the server before adding it to Claude Code.

Run in terminal
npm install
npm run build
2

Register it in Claude Code

After the local setup is done, run this command to point Claude Code at the built server.

Run in terminal
claude mcp add browser-automation -- node "<FULL_PATH_TO_MCP>/dist/index.js"

Replace <FULL_PATH_TO_MCP>/dist/index.js with the actual folder you prepared in step 1.

README.md

Browser-driven automation, HTTP API health checks, and visual regression testing.

Browser Automation MCP Server

This TypeScript project spins up a Model Context Protocol (MCP) server that exposes three tooling primitives tailored for automation: browser-driven flows, API health checks, and visual comparisons. It listens on STDIO, so any MCP-aware agent (Claude, Cursor, LangChain, etc.) can start the Node process and negotiate tools/list + tools/call automatically.

Setup

  1. npm install
  2. npm run build

Running the server

Execute npm run start (or node build/server.js directly). The server already binds to STDIO, so configure your MCP client to launch the same command and keep the subprocess running while the agent is issuing tool requests.

Tool catalogue

Tool Description Inputs Outputs
run-browser-automation Navigates a Chromium tab, runs scripted actions (click/fill/assert), and optionally returns a final screenshot. url (required), actions array (navigate/click/fill/type/press/assert/wait), viewport/device scale, finalScreenshot flag. steps array, summary, optional screenshot (data:image/png;base64,...).
run-api-check Hits any HTTP endpoint to verify status and capture response snippets. url (required), HTTP method, headers, body, expected status. HTTP status, comparison to expectStatus, response snippet, response headers.
compare-page-screenshots Captures two pages at the same viewport and runs pixelmatch to flag visual regressions. baselineUrl, targetUrl, optional viewport/threshold. Pixel mismatch ratio, diff image (base64 PNG), baseline dimensions.

Example tool call payloads

Browser automation

{
  "url": "https://example.com/login",
  "actions": [
    {"type": "fill", "selector": "#username", "value": "test"},
    {"type": "fill", "selector": "#password", "value": "secret"},
    {"type": "click", "selector": "#submit"},
    {"type": "waitForSelector", "selector": "#welcome"}
  ],
  "finalScreenshot": true
}

Image comparison

{
  "baselineUrl": "https://example.com/home?baseline=1",
  "targetUrl": "https://example.com/home?baseline=2",
  "threshold": 0.04
}

Debugging & exploration

Use the MCP Inspector to validate the server before wiring it to a client. Run npx @modelcontextprotocol/inspector node build/server.js, open the Inspector UI, and review the tools tab, send sample arguments, and inspect logs. The Inspector also proxies your server so you can replicate production-style connections locally.

Tools (3)

run-browser-automationNavigates a Chromium tab, runs scripted actions, and optionally returns a final screenshot.
run-api-checkHits any HTTP endpoint to verify status and capture response snippets.
compare-page-screenshotsCaptures two pages at the same viewport and runs pixelmatch to flag visual regressions.

Configuration

claude_desktop_config.json
{"mcpServers": {"browser-automation": {"command": "node", "args": ["/path/to/build/server.js"]}}}

Try it

Navigate to https://example.com/login, fill in the username and password fields, click submit, and take a screenshot of the result.
Check if the API endpoint at https://api.example.com/health returns a 200 status code.
Compare the visual layout of the homepage between the production URL and the staging URL to check for regressions.
Automate a sequence to click the 'Add to Cart' button on the product page and verify the cart count updates.

Frequently Asked Questions

What are the key features of Browser Automation MCP Server?

Browser-driven automation using Chromium. Scripted UI actions including click, fill, type, and press. HTTP API health check verification. Visual regression testing with pixel comparison. Support for capturing screenshots during automation.

What can I use Browser Automation MCP Server for?

Automating end-to-end testing for web applications. Monitoring the availability and response of critical HTTP APIs. Detecting visual UI regressions after code deployments. Scripting repetitive web interactions for data entry or verification.

How do I install Browser Automation MCP Server?

Install Browser Automation MCP Server by running: npm install && npm run build

What MCP clients work with Browser Automation MCP Server?

Browser Automation MCP Server works with any MCP-compatible client including Claude Desktop, Claude Code, Cursor, and other editors with MCP support.

Turn this server into reusable context

Keep Browser Automation MCP Server docs, env vars, and workflow notes in Conare so your agent carries them across sessions.

Need the old visual installer? Open Conare IDE.
Open Conare