Skyvern MCP Server

Local setup required. This server has to be cloned and prepared on your machine before you register it in Claude Code.
1

Set the server up locally

Run this once to clone and prepare the server before adding it to Claude Code.

Run in terminal
pip install skyvern
skyvern quickstart
2

Register it in Claude Code

After the local setup is done, run this command to point Claude Code at the built server.

Run in terminal
claude mcp add skyvern -- node "<FULL_PATH_TO_SKYVERN>/dist/index.js"

Replace <FULL_PATH_TO_SKYVERN>/dist/index.js with the actual folder you prepared in step 1.

README.md

Automate Browser-based workflows using LLMs and Computer Vision

<source media="(prefers-color-scheme: dark)" srcset="fern/images/skyvern_logo.png"/>

šŸ‰ Automate Browser-based workflows using LLMs and Computer Vision šŸ‰

Skyvern automates browser-based workflows using LLMs and computer vision. It provides a Playwright-compatible SDK that adds AI functionality on top of playwright, as well as a no-code workflow builder to help both technical and non-technical users automate manual workflows on any website, replacing brittle or unreliable automation solutions.

Traditional approaches to browser automations required writing custom scripts for websites, often relying on DOM parsing and XPath-based interactions which would break whenever the website layouts changed.

Instead of only relying on code-defined XPath interactions, Skyvern relies on Vision LLMs to learn and interact with the websites.

How it works

Skyvern was inspired by the Task-Driven autonomous agent design popularized by BabyAGI and AutoGPT -- with one major bonus: we give Skyvern the ability to interact with websites using browser automation libraries like Playwright.

Skyvern uses a swarm of agents to comprehend a website, and plan and execute its actions:

This approach has a few advantages:

  1. Skyvern can operate on websites it's never seen before, as it's able to map visual elements to actions necessary to complete a workflow, without any customized code
  2. Skyvern is resistant to website layout changes, as there are no pre-determined XPaths or other selectors our system is looking for while trying to navigate
  3. Skyvern is able to take a single workflow and apply it to a large number of websites, as it's able to reason through the interactions necessary to complete the workflow A detailed technical report can be found here.

Demo

https://github.com/user-attachments/assets/5cab4668-e8e2-4982-8551-aab05ff73a7f

Quickstart

Skyvern Cloud

Skyvern Cloud is a managed cloud version of Skyvern that allows you to run Skyvern without worrying about the infrastructure. It allows you to run multiple Skyvern instances in parallel and comes bundled with anti-bot detection mechanisms, proxy network, and CAPTCHA solvers.

If you'd like to try it out, navigate to app.skyvern.com and create an account.

Run Locally (UI + Server)

Choose your preferred setup method:

Option A: pip install (Recommended)

Dependencies needed:

Additionally, for Windows:

  • Rust
  • VS Code with C++ dev tools and Windows SDK
1. Install Skyvern
pip install skyvern
2. Run Skyvern
skyvern quickstart

Option B: Docker Compose

  1. Install Docker Desktop
  2. Clone the repository:
    git clone https://github.com/skyvern-ai/skyvern.git && cd skyvern
    
  3. Run quickstart with Docker Compose:
    pip install skyvern && skyvern quickstart
    
    When prompted, choose "Docker Compose"

Tools (3)

navigate_to_urlInstruct the browser to navigate to a specific website URL.
execute_workflowRun a multi-step automation workflow on a website using natural language instructions.
extract_dataExtract structured data from the current page based on provided schema.

Environment Variables

SKYVERN_API_KEYAPI key for accessing Skyvern cloud services if applicable.

Configuration

claude_desktop_config.json
{"mcpServers": {"skyvern": {"command": "skyvern", "args": ["--mcp"]}}}

Try it

→Navigate to the company website and extract the latest press release titles.
→Fill out the contact form on this page with my provided details and submit it.
→Automate the process of logging into the portal and downloading the monthly invoice.
→Find the pricing page on this website and summarize the different subscription tiers.

Frequently Asked Questions

What are the key features of Skyvern?

Uses Vision LLMs to interact with websites without brittle XPath selectors. Supports multi-step automation workflows via natural language. Resistant to website layout changes through visual element mapping. Playwright-compatible SDK for browser-based automation. Capable of operating on websites it has never seen before.

What can I use Skyvern for?

Automating data entry into legacy web portals that lack APIs. Extracting structured data from complex, dynamic websites for research. Performing repetitive multi-step tasks like invoice downloading or form submission. Monitoring website changes or updates across multiple domains.

How do I install Skyvern?

Install Skyvern by running: pip install skyvern && skyvern quickstart

What MCP clients work with Skyvern?

Skyvern works with any MCP-compatible client including Claude Desktop, Claude Code, Cursor, and other editors with MCP support.

Turn this server into reusable context

Keep Skyvern docs, env vars, and workflow notes in Conare so your agent carries them across sessions.

Need the old visual installer? Open Conare IDE.
Open Conare