Qure: Desktop App for Generating E2E Tests from Recorded Browser Flows

Qure is a desktop application built at JetBrains that's currently in closed beta. It generates end-to-end web test code from recordings made in its built-in browser.
How It Works
Instead of describing test flows in text for AI agents, you record your manual QA scenarios by interacting with your product in Qure's built-in browser. The AI then turns that recording into test code.
Key differentiators from typical AI test generation:
- No prompt engineering required
- No MCP setup needed
- No explaining your repository structure in chat - you point it at your project and go
- The AI matches recordings against your existing codebase to find page objects, helpers, and constants
- When tests fail, the agent reads real failure output and fixes issues with actual error and application context
Additional Features
Beyond recording, you can also:
- Refactor existing tests
- Update tests
- Write new tests from descriptions
Technical Details
The tool is web-only and works best with Playwright. According to the developers, if your project only has a few dozen tests, Claude Code might be sufficient. Qure makes more of a difference on larger codebases with existing test infrastructure.
The developers note this is an experimental product in closed beta and are open to questions about where the approach breaks or has limitations.
📖 Read the full source: HN AI Agents
👀 See Also

AI Functions: Runtime Code Generation with Automated Verification
AI Functions is a Python library that lets you define functions with natural language specifications instead of implementation code, executes LLM-generated code at runtime, and validates outputs with post-conditions that trigger automatic retries on failure.

Agent-factory: A Claude Code Plugin for Persistent AI Sub-Agent Teams
Agent-factory is a Claude Code plugin that creates persistent sub-agent teams with distinct personalities and file-based memory. It scaffolds 2-5 agents per project through a conversational interview process, with each agent having specific roles like code review, tech debt tracking, or strategy.

Rowboat: Open-Source AI Coworker with Knowledge Graph Memory
Rowboat is an open-source app that transforms your work into a living knowledge graph, storing data locally as Markdown, and offering AI-driven local assistance.

Benchmark Results: 15 LLMs Tested on 38 Real Workflow Tasks
A developer benchmarked 15 cloud and local LLMs on 38 tasks from their actual workflow, including CSV transforms, letter counting, modular arithmetic, and format compliance. Claude 3.5 Sonnet and Opus both scored 100%, but Sonnet costs 3.5x less per call.