Pair Programmer Plugin Adds Live Screen, Voice, and Audio Context to Claude Code

A developer has released Pair Programmer, a plugin that addresses Claude Code's lack of real-time context by providing live desktop perception. The tool captures three data streams: screen content (with visual indexing generating short scene descriptions), microphone input (transcription plus lightweight intent classification for questions, explanations, or commands), and system audio (indexing meetings, tutorials, or other audio playing on the machine).
Architecture and Implementation
The system uses a multi-agent pipeline rather than a single model approach. It runs specialized agents in parallel:
- Screen reader for visual context
- Voice processor for microphone transcription and intent classification
- Audio classifier for system audio
- Orchestrator that correlates all inputs and synthesizes a single response
The plugin is built on VideoDB infrastructure. While indexing currently uses cloud models, the design is model-agnostic—the Index layer can swap in any VLM or LLM. The developer mentions interest in wiring local models for visual description and transcription layers.
Current Status and Installation
The plugin is currently macOS only. Installation requires three commands. The GitHub repository is available at https://github.com/video-db/claude-code/tree/main.
The developer is seeking feedback on architectural approaches, specifically whether developers prefer the multi-agent pipeline with specialized models and orchestration or pushing toward a single model end-to-end solution for desktop perception systems.
📖 Read the full source: r/ClaudeAI
👀 See Also

OpenClaw Codex-GPT5.4 Task Validation Loop Issue
A developer reports Codex-GPT5.4 through OpenClaw gets stuck in a task validation loop during autonomous project work, repeatedly identifying and confirming tasks without executing them. They've implemented workspace controls including TASKS.md, heartbeat rules, and persona files to address the issue.

Local Behavioral Monitoring System with MCP Pipeline and Claude Code
A developer built a local behavioral monitoring system called BRAIN that tracks app switches, file operations, and dev sessions, piping data through a custom MCP server to Claude Code. The system runs 100% locally with zero cloud dependency.

Claude Code v2.1.141: New Environment Variables, Hooks Enhancement, and Bug Fixes
Anthropic released Claude Code v2.1.141 with new environment variables (CLAUDE_CODE_PLUGIN_PREFER_HTTPS, ANTHROPIC_WORKSPACE_ID), terminalSequence field for hooks, agent listing by cwd, and over 20 bug fixes.

Claude AI Product Launch Skill: Structured Playbooks for AI Product Launches
A free Claude skill provides six battle-tested launch playbooks covering strategy, preparation, messaging, and channel execution for AI product launches. The repository includes English and Chinese materials organized by launch stage.