Browser-Use Agent

Any AI agent that can control a Chromium browser can use espctl through esphome.cloud/mcp/esp-idf — without installing anything. No binary, no package, no PATH.

This page covers setup for agents like browser-use, computer-use, or any framework that drives a browser via CDP, Playwright, or Puppeteer.

Requirements

Requirement	Details
Browser	Chrome, Edge, or another Chromium-based browser
Agent capability	Can navigate to a URL, click elements, read text
Network	HTTPS access to `esphome.cloud`
Install	Nothing — that’s the point

Configuration

No MCP server config needed. The agent opens a browser tab instead of running a binary. Point your agent at:

https://esphome.cloud/mcp/esp-idf

If your agent framework has a “start URL” or “initial page” setting, use that URL. If it needs a task description, tell it:

Open https://esphome.cloud/mcp/esp-idf in Chrome. Sign in if prompted. Click Connect. Then follow the build instructions.

Build flow

The agent follows this sequence in the browser:

Navigate to esphome.cloud/mcp/esp-idf.
Sign in if a sign-in prompt appears.
Click Connect — wait for the green dot.
Pick target chip from the dropdown (esp32, esp32s3, …).
Pick IDF version (optional — the default works).
Pick build type (release or debug).
Click Build — logs scroll live in the panel below.
Wait for the build to finish (status changes to succeeded or failed).
Read results — click Size Report, SBOM, or Diagnostics for post-build analysis.
Download firmware — click the download icon on the firmware card.

Flash flow (optional)

If the agent has access to a USB-connected ESP device:

Switch to the Flash tab.
Click Connect — pick the USB device from the port list.
Click Flash.

Note: Web Serial requires the browser to have USB access. If the agent runs in a headless or sandboxed environment, flashing may not be available.

Monitor flow (optional)

The Monitor tab works without signing in or connecting to the build server:

Switch to the Monitor tab.
Click Open Monitor — pick the USB device.
Pick a baud rate (115200 default).
Read serial output.

Check it’s working

After the agent navigates to the page and clicks Connect, it should see:

A green connection indicator
A Tools Inspector panel listing available tools
The Build configuration controls (target, version, build type)

If the agent sees a sign-in prompt instead, it needs to complete sign-in first.

Compared to local MCP

	This (browser MCP)	Local MCP
Config needed	Just the URL	JSON in settings file + binary install
Agent capability	Browser control	Shell command execution
Works without install	Yes	No
Same tools	All 40 + browser extras	All 40

Use browser MCP when your agent can’t install binaries. Use local MCP when your agent has shell access and you want the tighter MCP protocol integration.

ESP-IDF MCP — User Manual