Browser Harness: Giving LLMs raw CDP access to self-correct browser tasks

Browser Harness is a self-healing harness that gives LLMs complete freedom to complete browser tasks by connecting directly to Chrome via a CDP websocket (~592 lines of Python). Instead of wrapping Chrome in predefined functions (like Playwright MCP or agent-browser), it provides a thin daemon that holds the websocket, basic helpers, and a SKILL.md file — and lets the agent write its own tools when needed.
How it works
The core concepts are:
- A daemon that keeps the CDP websocket alive
- Extremely basic tools in
helpers.py(~195 lines) - A
SKILL.mdthat explains usage
When the LLM needs a function that doesn't exist, it edits helpers.py and adds it. The authors report a case where the agent wanted to upload a file, found no upload_file() function, wrote it itself using DOM.setFileInputFiles, and continued — visible only in a git diff.
vs. other approaches
Other tools (Playwright MCP, browser-use CLI, agent-browser, Chrome DevTools MCP) wrap Chrome in predefined functions. Their worst failure mode is silent: click() returns success but nothing happened. Browser Harness aims to give the LLM perfect context and freedom to self-correct.
Setup and installation
Install with Claude Code or Codex by pasting:
Set up https://github.com/browser-use/browser-harness for me.
The agent reads install.md for bootstrap, then SKILL.md for normal usage, and always checks helpers.py for available functions.
Real examples
- Plays Stockfish chess engine
- Sets a world record in Tetris
- Draws a heart with JavaScript
Contributing
Contributions welcome: add domain skills under domain-skills/ for sites like LinkedIn, Amazon, etc. Skills are generated by the harness, not hand-authored. Bug fixes, docs, and helper improvements also accepted.
License: MIT.
📖 Read the full source: HN LLM Tools
👀 See Also

Launch Engine MCP Server Provides 39-Tool Pipeline for Business Validation
Launch Engine is an MCP server that gives Claude a structured pipeline with 39 interconnected SOP tools organized into 5 layers for taking business ideas from concept to validated revenue. The system includes specialized subagents, prerequisite enforcement, and tools for batch evaluation and rapid testing.

Sandbox0: Open-Source Kubernetes-Native Sandbox Infrastructure for AI Agents
Sandbox0 is an open-source sandbox infrastructure for AI agents built on Kubernetes with persistent storage via JuiceFS and auto-scaling. It addresses limitations like concurrency caps and ephemeral execution found in existing solutions.

ClaudeHive: Web UI for Managing Parallel Claude Code Sessions
ClaudeHive is a web UI that handles parallel Claude Code sessions, allowing users to define prompt templates with placeholders, batch-run them across multiple inputs with configurable concurrency, and review all results in one place. It includes a CLI tool for manager agents to spawn and coordinate worker agents.

Kreuzberg v4.7.0 adds code intelligence for 248 languages and improved markdown extraction
Kreuzberg v4.7.0, a Rust-core document intelligence library, now supports code extraction for 248 formats via tree-sitter and has significantly improved markdown quality with Structural F1 scores over 80% across 23 formats.