Subquadratic Debuts 12M Token Context Window for AI Models

✍️ OpenClawRadar📅 Published: May 10, 2026🔗 Source

Subquadratic Debuts 12M Token Context Window for AI Models

Ad

Subquadratic has announced a 12-million-token context window, claiming a breakthrough in subquadratic attention mechanisms. This compares to typical 128K-1M token windows in current models. The technique allows models to handle vastly larger contexts without quadratic scaling of compute or memory.

Key Details

Context window: 12 million tokens (12x larger than GPT-4's 128K tokens)
Based on subquadratic attention, likely using linear or near-linear complexity in sequence length
Enables processing entire large codebases, long documents, or multi-hour video transcripts in a single forward pass
Potential applications: code review of entire repos, long-document analysis, multi-turn dialog with full history
Compatible with existing transformer-based LLMs via drop-in attention replacement

The approach reduces O(n²) attention to near-O(n) using techniques like state-space models or low-rank factorizations. No specific benchmark numbers are provided in the source, but the claim is that this makes 12M-token windows practical on a single GPU.

Who It's For

AI engineers working on code analysis, document processing, or any task requiring long-context understanding without expensive chunking or retrieval.

📖 Read the full source: HN AI Agents

Ad

👀 See Also

Analysis of 100M tokens in Claude Code reveals 99.4% input usage

Analysis of 100M tokens in Claude Code reveals 99.4% input usage

Analysis of 1,289 requests across extended coding sessions shows Claude Code used 100.3M input tokens (99.4%) versus only 616K output tokens (0.6%), with 84.2M tokens cached due to repeated context re-sending.

Mar 9, 2026, 02:45 PM UTC

Claude Max $100 subscription usage data for API extension task

Claude Max $100 subscription usage data for API extension task

A Claude Max $100 subscription user reports consuming 13% of a 5-hour session to extend an existing API with favorite library functionality, with context usage at 11% and weekly usage increasing from 5% to 6%.

Apr 21, 2026, 10:32 AM UTC

Analysis: Comparing the AI Industry to Subprime Mortgage Crisis Patterns

Analysis: Comparing the AI Industry to Subprime Mortgage Crisis Patterns

Edward Zitron's analysis draws parallels between the 2008 subprime mortgage crisis and current AI industry trends, citing specific data points about adjustable-rate mortgages and their similarities to AI investment patterns.

Apr 14, 2026, 01:06 PM UTC

AI-Powered Robot Dogs Deployed for Surveillance in Atlanta

AI-Powered Robot Dogs Deployed for Surveillance in Atlanta

Four-legged robot dogs equipped with cameras and AI are patrolling Atlanta streets, apartments, and construction sites, streaming 360° video to remote operators 24/7 as a cheaper alternative to human guards.

Apr 18, 2026, 06:45 AM UTC