Browser Harness: Giving LLMs raw CDP access to self-correct browser tasks

✍️ OpenClawRadar📅 Published: April 24, 2026🔗 Source
Browser Harness: Giving LLMs raw CDP access to self-correct browser tasks
Ad

Browser Harness is a self-healing harness that gives LLMs complete freedom to complete browser tasks by connecting directly to Chrome via a CDP websocket (~592 lines of Python). Instead of wrapping Chrome in predefined functions (like Playwright MCP or agent-browser), it provides a thin daemon that holds the websocket, basic helpers, and a SKILL.md file — and lets the agent write its own tools when needed.

How it works

The core concepts are:

  • A daemon that keeps the CDP websocket alive
  • Extremely basic tools in helpers.py (~195 lines)
  • A SKILL.md that explains usage

When the LLM needs a function that doesn't exist, it edits helpers.py and adds it. The authors report a case where the agent wanted to upload a file, found no upload_file() function, wrote it itself using DOM.setFileInputFiles, and continued — visible only in a git diff.

Ad

vs. other approaches

Other tools (Playwright MCP, browser-use CLI, agent-browser, Chrome DevTools MCP) wrap Chrome in predefined functions. Their worst failure mode is silent: click() returns success but nothing happened. Browser Harness aims to give the LLM perfect context and freedom to self-correct.

Setup and installation

Install with Claude Code or Codex by pasting:

Set up https://github.com/browser-use/browser-harness for me.

The agent reads install.md for bootstrap, then SKILL.md for normal usage, and always checks helpers.py for available functions.

Real examples

  • Plays Stockfish chess engine
  • Sets a world record in Tetris
  • Draws a heart with JavaScript

Contributing

Contributions welcome: add domain skills under domain-skills/ for sites like LinkedIn, Amazon, etc. Skills are generated by the harness, not hand-authored. Bug fixes, docs, and helper improvements also accepted.

License: MIT.

📖 Read the full source: HN LLM Tools

Ad

👀 See Also

Launch Engine MCP Server Provides 39-Tool Pipeline for Business Validation
Tools

Launch Engine MCP Server Provides 39-Tool Pipeline for Business Validation

Launch Engine is an MCP server that gives Claude a structured pipeline with 39 interconnected SOP tools organized into 5 layers for taking business ideas from concept to validated revenue. The system includes specialized subagents, prerequisite enforcement, and tools for batch evaluation and rapid testing.

OpenClawRadar
Sandbox0: Open-Source Kubernetes-Native Sandbox Infrastructure for AI Agents
Tools

Sandbox0: Open-Source Kubernetes-Native Sandbox Infrastructure for AI Agents

Sandbox0 is an open-source sandbox infrastructure for AI agents built on Kubernetes with persistent storage via JuiceFS and auto-scaling. It addresses limitations like concurrency caps and ephemeral execution found in existing solutions.

OpenClawRadar
ClaudeHive: Web UI for Managing Parallel Claude Code Sessions
Tools

ClaudeHive: Web UI for Managing Parallel Claude Code Sessions

ClaudeHive is a web UI that handles parallel Claude Code sessions, allowing users to define prompt templates with placeholders, batch-run them across multiple inputs with configurable concurrency, and review all results in one place. It includes a CLI tool for manager agents to spawn and coordinate worker agents.

OpenClawRadar
Kreuzberg v4.7.0 adds code intelligence for 248 languages and improved markdown extraction
Tools

Kreuzberg v4.7.0 adds code intelligence for 248 languages and improved markdown extraction

Kreuzberg v4.7.0, a Rust-core document intelligence library, now supports code extraction for 248 formats via tree-sitter and has significantly improved markdown quality with Structural F1 scores over 80% across 23 formats.

OpenClawRadar