Agent Browser Protocol: Open-source Chrome fork for AI agents achieves 90% on Mind2Web benchmark

Agent Browser Protocol: A browser built for AI agents
Agent Browser Protocol (ABP) is an open-source browser fork of Chrome specifically designed for AI agents. The project was created as a challenge to build an agent-centric browser and capture the top score on the Online Mind2Web Benchmark.
Key technical approach
The main insight behind ABP is that AI agents excel at turn-based chat but struggle with continuous time decision making. To maximize LLM performance on browser tasks, ABP converts web browsing into multimodal chat by:
- Freezing JavaScript and time after every action so the webpage is frozen while the agent thinks
- Capturing all relevant events resulting from actions including file pickers, downloads, permission requests, and dialogs
- Returning these events together with a screenshot of the frozen page so the agent can reason holistically about the browser state with full context
Performance and development
ABP achieved 90.53% on the Online Mind2Web Benchmark, holding the top score for two days until GPT-5.4 surpassed it with 92.8%. The developer used Claude Opus-4.5 to work on the project during nights and weekends, completing it within two months.
Practical applications
According to the source, ABP excels at:
- Filing forms
- Online shopping
- Downloading files
- Uploading files
- Ordering takeout
- Reverse engineering a website's undocumented APIs
Getting started
To add ABP to Claude Code:
claude mcp add browser -- npx -y agent-browser-protocol --mcpThen you can give Claude commands like:
Find me kung pao chicken near 415 Mission St, San Francisco on Doordash.Resources
GitHub repository: https://github.com/theredsix/agent-browser-protocol
Benchmark results: https://github.com/theredsix/abp-online-mind2web-results
📖 Read the full source: r/ClaudeAI
👀 See Also

Rever UI Cloner: AI-Optimized Alternative to HTML Scraping for UI Replication
Rever UI Cloner is an API endpoint that provides AI agents with clean design blueprints instead of raw HTML, avoiding context window bloat and UI hallucination issues. It uses the x402 payment protocol requiring a 1 USDC micro-transaction on Base network for agent-to-agent payments.

Claude Code's Plan-Skeptic Sub Agent Identifies Security Gaps in Generated Plans
A developer discovered Claude Code's plan-skeptic sub agent, which identifies gaps and issues in AI-generated development plans, particularly catching security concerns that weren't initially obvious. The agent works alongside the previously known security-sheriff sub agent to improve plan quality.

Prefex: A Local Proxy for Claude Code That Automates Prompt Caching and Session Memory
Prefex is a local proxy that sits between Claude Code and Anthropic's API, automatically injecting the header required for Anthropic's beta prompt caching feature. It also implements session memory to avoid resending full conversation history and includes a model router for cost optimization.

Claude Code v2.1.139 Adds /goal Command for Async Long-Running Tasks
Claude Code v2.1.139 introduces the /goal command, enabling fire-and-forget sessions that run until a completion condition is met, plus a new agents view to monitor active sessions.