MCP Desktop Tools MCP Server

An MCP server that gives Claude desktop automation capabilities

README.md

MCP Desktop Tools

An MCP server that gives Claude desktop automation capabilities — browser control, screenshots, mouse/keyboard input, window management, and clipboard access.

Built with TypeScript, Playwright, and native Windows APIs.

Tools

Browser

Tool Description
browser_open Launch Chromium and navigate to a URL
browser_navigate Navigate to a URL with configurable wait conditions
browser_click Click elements by CSS selector
browser_type Type into input fields, optionally clear or press Enter
browser_read Read page content (text, HTML, title, URL, or specific elements)
browser_screenshot Capture viewport or full-page screenshots
browser_close Close the browser

Screenshots

Tool Description
screenshot_fullscreen Capture entire screen (multi-monitor supported)
screenshot_region Capture a rectangular region by coordinates
screenshot_window Capture a specific window by title (partial match)

Desktop

Tool Description
desktop_mouse_click Click at screen coordinates
desktop_mouse_move Move cursor (instant or smooth animation)
desktop_keyboard_type Type text via simulated keystrokes
desktop_keyboard_hotkey Press keyboard shortcuts (e.g. ctrl+c, alt+tab)
desktop_window_list List all visible windows with positions and sizes
desktop_window_focus Focus a window by title
desktop_window_resize Move and/or resize a window
desktop_app_launch Launch apps by path, name, or URI
desktop_clipboard_read Read clipboard text
desktop_clipboard_write Write text to clipboard

Setup

npm install
npm run build
npx playwright install chromium

Claude Code Configuration

Add to your Claude Code MCP settings (~/.claude/settings.json):

{
  "mcpServers": {
    "desktop-tools": {
      "command": "node",
      "args": ["C:/Users/<you>/mcp-desktop-tools/dist/index.js"]
    }
  }
}

Restart Claude Code to pick up the new server.

Requirements

  • Windows 10/11
  • Node.js 18+
  • PowerShell (used for native window/mouse/keyboard operations)

License

MIT

Tools 20

browser_openLaunch Chromium and navigate to a URL
browser_navigateNavigate to a URL with configurable wait conditions
browser_clickClick elements by CSS selector
browser_typeType into input fields, optionally clear or press Enter
browser_readRead page content (text, HTML, title, URL, or specific elements)
browser_screenshotCapture viewport or full-page screenshots
browser_closeClose the browser
screenshot_fullscreenCapture entire screen (multi-monitor supported)
screenshot_regionCapture a rectangular region by coordinates
screenshot_windowCapture a specific window by title (partial match)
desktop_mouse_clickClick at screen coordinates
desktop_mouse_moveMove cursor (instant or smooth animation)
desktop_keyboard_typeType text via simulated keystrokes
desktop_keyboard_hotkeyPress keyboard shortcuts (e.g. ctrl+c, alt+tab)
desktop_window_listList all visible windows with positions and sizes
desktop_window_focusFocus a window by title
desktop_window_resizeMove and/or resize a window
desktop_app_launchLaunch apps by path, name, or URI
desktop_clipboard_readRead clipboard text
desktop_clipboard_writeWrite text to clipboard

Try it

Open a browser to google.com and search for the latest news on MCP servers.
Take a screenshot of the currently active window and save it.
List all open windows and focus the one titled 'Notepad'.
Copy the text currently in my clipboard and summarize it for me.
Launch the Calculator app and type 123 + 456 using the keyboard tools.

Frequently Asked Questions

What are the key features of MCP Desktop Tools?

Full browser automation using Playwright. Multi-monitor screenshot capabilities. Native Windows mouse and keyboard input simulation. Window management including listing, focusing, and resizing. System clipboard read and write access.

What can I use MCP Desktop Tools for?

Automating repetitive web-based data entry tasks on Windows. Creating visual documentation by capturing specific application windows. Managing complex multi-window workflows via natural language commands. Integrating clipboard data directly into Claude's analysis process. Launching and interacting with local desktop applications programmatically.

How do I install MCP Desktop Tools?

Install MCP Desktop Tools by running: npm install && npm run build && npx playwright install chromium

What MCP clients work with MCP Desktop Tools?

MCP Desktop Tools works with any MCP-compatible client including Claude Desktop, Claude Code, Cursor, and other editors with MCP support.

Turn this server into reusable context

Keep MCP Desktop Tools docs, env vars, and workflow notes in Conare so your agent carries them across sessions.

Open Conare