Google DeepMind AI Pointer: Gemini Mouse for Contextual Commands

Google DeepMind has unveiled the AI-enabled pointer, a prototype that augments the traditional mouse cursor with Gemini-powered context awareness. The core idea: instead of dragging content into an AI tool's window, users can point at anything on screen and issue a natural-language command (e.g., point at a building image and say “Show me directions”). The AI understands both the visual and semantic context, treating pixels as actionable entities (places, dates, objects).

Four Interaction Principles

Maintain the flow: AI works across all apps, not in a separate window. Examples: point at a PDF and ask for a bullet-point summary to paste into an email; hover over a table and request a pie chart; highlight a recipe and say “double all ingredients.”
Show and tell: The pointer captures visual+semantic context, so you don't need a detailed prompt. Just point, and the AI knows which word, paragraph, image part, or code block is relevant.
Embrace the power of 'This' and 'That': Use natural shorthand like “Fix this,” “Move that here,” or “What does this mean?”—the AI combines gesture, context, and speech to infer intent.
Turn pixels into actionable entities: A photo of a scribbled note becomes an interactive to-do list; a paused frame in a travel video becomes a booking link for the shown restaurant.

Integration in Products

DeepMind is rolling out these capabilities in two places:

Chrome (Gemini integration): Point at part of a webpage and ask Gemini about it. Example: select a few products and ask to compare them, or point to where you want to visualize a new couch.
Googlebook (Magic Pointer): A forthcoming feature for the Googlebook laptop that puts Gemini “at your fingertips” for intuitive interactions.

Experimental demos are also available in Google AI Studio for editing images or finding places on a map by pointing and speaking. The team is also testing future concepts via Google Labs’ Disco platform.

Who it's for: Developers building AI-agent interfaces, UX researchers, and anyone working on human-AI interaction patterns.

📖 Read the full source: HN AI Agents

Google DeepMind's AI Pointer: Reimagining the Mouse for Gemini Interactions

Four Interaction Principles

Integration in Products

👀 See Also

Claude AI credited in macOS Tahoe 26.5 update release notes

LLM Spatial Reasoning Tested: Sokoban Benchmark Shows ChatGPT, Qwen3.7-max, Gemini 3.5-thinking Lead

Open-source models match or beat Claude Opus 4.6 on benchmarks

Project Health Check: Bus Factor and Commit Activity Across Claw/Assistant Repos