Browser Harness: Giving LLMs raw CDP access to self-correct browser tasks

✍️ OpenClawRadar📅 Published: April 24, 2026🔗 Source

Browser Harness is a self-healing harness that gives LLMs complete freedom to complete browser tasks by connecting directly to Chrome via a CDP websocket (~592 lines of Python). Instead of wrapping Chrome in predefined functions (like Playwright MCP or agent-browser), it provides a thin daemon that holds the websocket, basic helpers, and a SKILL.md file — and lets the agent write its own tools when needed.

How it works

The core concepts are:

A daemon that keeps the CDP websocket alive
Extremely basic tools in helpers.py (~195 lines)
A SKILL.md that explains usage

When the LLM needs a function that doesn't exist, it edits helpers.py and adds it. The authors report a case where the agent wanted to upload a file, found no upload_file() function, wrote it itself using DOM.setFileInputFiles, and continued — visible only in a git diff.

vs. other approaches

Other tools (Playwright MCP, browser-use CLI, agent-browser, Chrome DevTools MCP) wrap Chrome in predefined functions. Their worst failure mode is silent: click() returns success but nothing happened. Browser Harness aims to give the LLM perfect context and freedom to self-correct.

Setup and installation

Install with Claude Code or Codex by pasting:

Set up https://github.com/browser-use/browser-harness for me.

The agent reads install.md for bootstrap, then SKILL.md for normal usage, and always checks helpers.py for available functions.

Real examples

Plays Stockfish chess engine
Sets a world record in Tetris
Draws a heart with JavaScript

Contributing

Contributions welcome: add domain skills under domain-skills/ for sites like LinkedIn, Amazon, etc. Skills are generated by the harness, not hand-authored. Bug fixes, docs, and helper improvements also accepted.

License: MIT.

📖 Read the full source: HN LLM Tools

👀 See Also

Tools

Launch Engine MCP Server Provides 39-Tool Pipeline for Business Validation

Launch Engine is an MCP server that gives Claude a structured pipeline with 39 interconnected SOP tools organized into 5 layers for taking business ideas from concept to validated revenue. The system includes specialized subagents, prerequisite enforcement, and tools for batch evaluation and rapid testing.

Apr 4, 2026, 07:45 AM UTC

OpenClawRadar

Tools

Sandbox0: Open-Source Kubernetes-Native Sandbox Infrastructure for AI Agents

Sandbox0 is an open-source sandbox infrastructure for AI agents built on Kubernetes with persistent storage via JuiceFS and auto-scaling. It addresses limitations like concurrency caps and ephemeral execution found in existing solutions.

Mar 14, 2026, 02:45 PM UTC

OpenClawRadar

Tools

ClaudeHive: Web UI for Managing Parallel Claude Code Sessions

ClaudeHive is a web UI that handles parallel Claude Code sessions, allowing users to define prompt templates with placeholders, batch-run them across multiple inputs with configurable concurrency, and review all results in one place. It includes a CLI tool for manager agents to spawn and coordinate worker agents.

Apr 21, 2026, 12:26 AM UTC

OpenClawRadar

Tools

Kreuzberg v4.7.0 adds code intelligence for 248 languages and improved markdown extraction

Kreuzberg v4.7.0, a Rust-core document intelligence library, now supports code extraction for 248 formats via tree-sitter and has significantly improved markdown quality with Structural F1 scores over 80% across 23 formats.

Apr 14, 2026, 08:11 PM UTC

OpenClawRadar