Google DeepMind's AI Pointer: Reimagining the Mouse for Gemini Interactions
Google DeepMind has unveiled the AI-enabled pointer, a prototype that augments the traditional mouse cursor with Gemini-powered context awareness. The core idea: instead of dragging content into an AI tool's window, users can point at anything on screen and issue a natural-language command (e.g., point at a building image and say “Show me directions”). The AI understands both the visual and semantic context, treating pixels as actionable entities (places, dates, objects).
Four Interaction Principles
- Maintain the flow: AI works across all apps, not in a separate window. Examples: point at a PDF and ask for a bullet-point summary to paste into an email; hover over a table and request a pie chart; highlight a recipe and say “double all ingredients.”
- Show and tell: The pointer captures visual+semantic context, so you don't need a detailed prompt. Just point, and the AI knows which word, paragraph, image part, or code block is relevant.
- Embrace the power of 'This' and 'That': Use natural shorthand like “Fix this,” “Move that here,” or “What does this mean?”—the AI combines gesture, context, and speech to infer intent.
- Turn pixels into actionable entities: A photo of a scribbled note becomes an interactive to-do list; a paused frame in a travel video becomes a booking link for the shown restaurant.
Integration in Products
DeepMind is rolling out these capabilities in two places:
- Chrome (Gemini integration): Point at part of a webpage and ask Gemini about it. Example: select a few products and ask to compare them, or point to where you want to visualize a new couch.
- Googlebook (Magic Pointer): A forthcoming feature for the Googlebook laptop that puts Gemini “at your fingertips” for intuitive interactions.
Experimental demos are also available in Google AI Studio for editing images or finding places on a map by pointing and speaking. The team is also testing future concepts via Google Labs’ Disco platform.
Who it's for: Developers building AI-agent interfaces, UX researchers, and anyone working on human-AI interaction patterns.
📖 Read the full source: HN AI Agents
👀 See Also

M5 Max vs M3 Max Inference Benchmarks for Qwen Models on oMLX
Benchmarks comparing M5 Max and M3 Max MacBook Pros running Qwen 3.5 models via oMLX v0.2.23 show M5 Max delivering 1.4-1.7x faster token generation and up to 4x faster prefill at long contexts.

Claude Code Opus Fails with Rate Limit Error Despite Available Weekly Capacity
A Claude Max subscriber reports that Claude Code Opus returns 'API Error: Rate limit reached' even though their usage dashboard shows 97% of their weekly 'All models' capacity remains unused. The issue occurs specifically in Claude Code while Opus works normally on claude.ai from the same account.

Anthropic Secures 300MW Compute at Colossus 1 with 220,000 NVIDIA GPUs via SpaceX Partnership
Anthropic announced a partnership with SpaceX to use all compute capacity at the Colossus 1 data center, gaining over 300MW and more than 220,000 NVIDIA GPUs within a month.

User Reports Sonnet 4.6 Outperforms Opus 4.6 for Practical Coding Tasks
A developer testing Claude AI models found that Opus 4.6 produced over-engineered solutions with performance gaps, while Sonnet 4.6 delivered more careful, efficient fixes with lower token usage.