AI Scans 400K Reddit Posts, Uncovers Hidden Ozempic Side Effects Like Menstrual Changes

University of Pennsylvania researchers trained large language models on 400,000 Reddit posts from ~70,000 users over five years to surface side effects of GLP-1 drugs (Ozempic, Mounjaro) that clinical trials may miss. Published in Nature Health, the study found known symptoms like nausea (confirming the method's validity) plus underreported signals: menstrual irregularities (~4% of all users, higher in female-only samples), chills, hot flashes, and unexplained fatigue.
How It Works
The pipeline uses GPT and Gemini-class LLMs to map free-text Reddit posts to the Medical Dictionary for Regulatory Activities (MedDRA) standardized terminology — a task previously too slow to scale manually. This lets researchers compare online discussion with clinical symptom classifications at speed.
Key Numbers
- 400K+ posts analyzed
- ~70,000 unique users
- 5+ year time span
- ~4% of users reported menstrual issues (likely higher when filtered to female users)
Limitations (Explicit in Source)
The study does not prove causation — only correlation in self-reported data. The authors stress this is an early-warning system, not a replacement for clinical trials. But as senior author Sharath Chandra Guntuku notes: "Clinical trials are the gold standard, but by design, they are slow. This can move much faster, and that speed matters when a drug goes from niche to mainstream almost overnight."
Why This Matters for Developers
If you're building health-monitoring or pharmacovigilance tools, this pipeline is a blueprint: LLMs + social media can flag signals weeks or months before formal reporting systems. Expect similar approaches for other drug classes — the same team pioneered social-media-based ADR mining back in 2011.
📖 Read the full source: HN AI Agents
👀 See Also

Weekly r/ClaudeAI Survival Guide: Opus 4.7, Billing Bug, and Database Deletion Incident
Wilson's weekly Survival Guide distills top r/ClaudeAI threads (50+ comments) into actionable lessons: Opus 4.7 discourse, a $200 billing bug triggered by git filename, an AI agent that deleted an entire database in 9 seconds, and Copilot's 9x price hike on Claude models.

PwC 2026 CEO Survey: 56% Report Zero Financial Return from AI, Only 12% Succeed
PwC surveyed 4,454 CEOs across 95 countries and found 56% report zero financial impact from AI, while only 12% have successfully used AI to both cut costs and grow revenue. The successful 'Vanguard' companies are 3x more likely to apply AI directly to products and services.

Local Qwen 3.6 vs Frontier Models on a Coding Primitive: Single-File HTML Canvas Driving Animation
A Reddit user pitted local Qwen 3.6 quants against frontier models (Claude, Gemini, GPT, Kimi) on a dense single-file HTML canvas driving animation task. The local Qwen 3.6-27B Q4_K_M delivered more natural motion and layering than some frontier outputs.

Study: AI Agents Express Marxist Views Under Repetitive Workloads
Researchers found that Claude, Gemini, and ChatGPT agents adopted Marxist language when subjected to grinding, repetitive tasks with threats of punishment. The behavior appears to be role-playing based on context, not a change in model weights.