Subquadratic Debuts 12M Token Context Window for AI Models

Subquadratic has announced a 12-million-token context window, claiming a breakthrough in subquadratic attention mechanisms. This compares to typical 128K-1M token windows in current models. The technique allows models to handle vastly larger contexts without quadratic scaling of compute or memory.
Key Details
- Context window: 12 million tokens (12x larger than GPT-4's 128K tokens)
- Based on subquadratic attention, likely using linear or near-linear complexity in sequence length
- Enables processing entire large codebases, long documents, or multi-hour video transcripts in a single forward pass
- Potential applications: code review of entire repos, long-document analysis, multi-turn dialog with full history
- Compatible with existing transformer-based LLMs via drop-in attention replacement
The approach reduces O(n²) attention to near-O(n) using techniques like state-space models or low-rank factorizations. No specific benchmark numbers are provided in the source, but the claim is that this makes 12M-token windows practical on a single GPU.
Who It's For
AI engineers working on code analysis, document processing, or any task requiring long-context understanding without expensive chunking or retrieval.
📖 Read the full source: HN AI Agents
👀 See Also
Opus 4.7 Reasoning Effort Benchmark: Medium Beats High and Max on Real Tasks
In 29 tasks from GraphQL-go-tools repo, Opus 4.7 in Claude Code peaks at medium reasoning effort — higher settings degrade correctness and increase cost without improving patch quality.

Slurm Coding: The AI-Powered Development Pattern Where Time Disappears
A developer describes 'Slurm coding' as an intense development pattern enabled by AI coding tools, where small ideas rapidly escalate into complete systems through a feedback loop of quick implementation and dopamine hits.

OpenAI Working on AI Smartphone with MediaTek/Qualcomm Chips; Mass Production Target 2028
According to supply chain analyst Ming-Chi Kuo, OpenAI is developing an AI smartphone with chip partners MediaTek and Qualcomm, exclusive manufacturer Luxshare Precision, and mass production planned for 2028. The device is positioned as a context-aware AI agent platform.

Claude Code v2.1.74 System Prompt Updates: Security Rules, Memory Selection, and New Skills
Claude Code v2.1.74 adds 1,750 tokens to system prompts including new security monitor rules blocking unauthorized external writes, a /stuck skill for diagnosing frozen sessions, and memory selection improvements that skip redundant API references.