HN data confirms arXiv paper share dropping, LLM hype peak behind us?

Dylan Castillo wanted to confirm whether he was seeing fewer arXiv papers on Hacker News front pages, so he used Claude to run a quick analysis against the BigQuery HN dataset. The results show a clear trend: the share of arXiv stories on HN has been declining sharply in the last few months.
He also looked at historical peaks. The first peak in 2019 was driven by deep learning papers — 41% of the top 100 upvoted arXiv posts that year were about deep learning. The 2023–2026 period saw an even heavier AI focus: 59% of the top 100 upvoted arXiv stories were about LLMs or AI. In 2019 the standout papers included MuZero (161 pts), EfficientNet (119 pts), XLNet (79 pts), the PyTorch NeurIPS paper (113 pts), and Chollet's “On the Measure of Intelligence” (80 pts).
For the 2023–2026 period, Castillo asked Claude to guess which papers will age well. The picks: DeepSeek-R1 (1,351 pts, open recipe for o1-style reasoning via RL), Generative Agents (391 pts, the “Smallville” paper), The Era of 1-bit LLMs / BitNet b1.58 (1,040 pts), Differential Transformer (562 pts), and the LK-99 cluster (2,408 + 1,690 pts combined, a landmark in open-science replication). The full analysis includes charts for topic distribution and the arXiv share over time.
📖 Read the full source: HN LLM Tools
👀 See Also

Claude Opus 4.7 Flags Hantavirus Vaccine Questions as Safety Risk, Pausing Chats
Asking Claude Opus 4.7 how to develop a hantavirus vaccine triggers safety filters pausing the chat, while Sonnet 4.6 also blocks related predictive modeling.

Qwen3.6-27B Fits on Single 24GB GPU, Beats Former 397B MoE on SWE-bench
Qwen3.6-27B (Apache 2.0, 262K context) runs at Q4_K_M in ~16.8GB, achieving SWE-bench Verified 77.2 — outperforming Qwen3.5-397B-A17B MoE (76.2). Uses Gated DeltaNet linear attention with Thinking Preservation for agent workflows.

Unlocking OpenClaw's Potential: Integrating with CodeX
Discover how OpenClaw users can seamlessly invoke CodeX for enhanced functionality. Explore user discussions and key methods in this engaging tutorial.

SWE-rebench Leaderboard Update: February 2026 Results Show Tight Competition
The SWE-rebench leaderboard has been updated with February 2026 results testing 57 fresh GitHub PR tasks. Claude Opus 4.6 leads with 65.3% resolved rate, but the top six models are within 5 percentage points.