Browser — web automation
The Browser panel lets the agent browse the web inside the desktop app. It opens pages, reads content, fills forms, takes screenshots, and interacts with elements — all visible to you in a side panel.What it is
The Browser is a built-in web browser that the agent can control. It’s not a separate window — it opens as a panel next to the chat, so you can watch the agent browse in real time. The browser is powered by a headless Chromium instance that runs inside the runtime. It supports:- Navigation (open URLs, click links, go back/forward)
- Page reading (extract text, find elements, scrape content)
- Form filling (type into inputs, select options, click buttons)
- Screenshots (capture the full page or a specific element)
- File downloads (save files from the web to your project folder)
- Tab management (open multiple tabs, switch between them)
Opening the Browser
Two approaches:- From the sidebar: Click the Browser icon. A panel opens next to the chat with a URL bar.
- From chat: Ask the agent to browse — “open github.com and find the latest release notes for Headmaster.” The agent opens the URL in the Browser panel and starts working.
What the agent can do
Read a page
Fill a form
Take a screenshot
Search the web
Download a file
Browser panel controls
The Browser panel has:- URL bar — type a URL and press Enter to navigate. The agent sees where you navigated.
- Back / Forward — standard browser navigation.
- Refresh — reload the current page.
- Tabs — open and switch between multiple tabs.
- Screenshot — capture the current page.
- DevTools — open the browser developer tools (for debugging).
Browser and approvals
In Work Along mode, the agent asks for approval before:- Navigating to a new URL.
- Submitting a form.
- Downloading a file.
- Clicking a button that triggers an action.
Anti-detection
The browser uses anti-detection measures to avoid being blocked by websites that detect automated browsers:- Realistic user agent strings.
- Realistic viewport sizes.
- Human-like interaction delays.
- Canvas and WebGL fingerprint randomization.
Browser vs. web search tool
Headmaster has two web-related capabilities:| Feature | What it does | When to use |
|---|---|---|
| Web search tool | Runs a search query and returns text results. Fast, no visual. | ”Search for X and summarize” |
| Browser panel | Opens a real browser, navigates pages, fills forms, screenshots. Visual, interactive. | ”Open this URL”, “Fill this form”, “Take a screenshot” |