AI Chatbots Leaking Real Phone Numbers: The PII Exposure Problem

AI chatbots are exposing real people's phone numbers. A Redditor reported being inundated with calls from strangers looking for a lawyer or locksmith—misdirected by Google's Gemini. In March a software engineer in Israel was contacted on WhatsApp after Gemini gave out his personal number as PayBox customer service. In April a PhD candidate got Gemini to output a colleague's cell number.
How It Happens
LLMs are trained on web scraped data containing PII. The article notes that the open-source DataComp CommonPool dataset includes résumés, driver's licenses, and credit cards. Even a single instance of a phone number posted online (e.g., on a QA site in 2015) can be reproduced years later.
Scale of the Problem
DeleteMe, which helps remove personal info from the internet, reports a 400% increase in AI-related privacy queries in the last seven months—up to a few thousand. Breakdown: 55% reference ChatGPT, 20% Gemini, 15% Claude, 10% others. Two common scenarios: a user asks about themselves and gets accurate home/phone data, or the chatbot generates plausible-but-wrong contact info for someone else.
Rob Shavell (DeleteMe co-founder) says complaints typically involve the chatbot returning accurate home addresses, phone numbers, family names, or employer details when asked innocuous questions about the user.
What Can Be Done
Experts say the root cause is PII in training data, but the exact mechanism is unclear. There is little users can do to prevent exposure. The article suggests the problem will worsen as AI companies seek new data sources.
📖 Read the full source: HN AI Agents
👀 See Also

Axios 1.14.1 compromised with malware, targets AI-assisted development workflows
Axios version 1.14.1 has been compromised in a supply chain attack that silently pulls in [email protected], an obfuscated RAT dropper. Developers using AI coding assistants like Claude should immediately check their lockfiles and machines for infection.

AI System Discovers 12 OpenSSL Zero-Days, Curl Cancels Bug Bounty Due to AI Spam
AISLE's AI system discovered all 12 zero-day vulnerabilities in OpenSSL's recent security release, marking the first large-scale demonstration of AI-based cybersecurity. Meanwhile, curl cancelled its bug bounty program due to AI-generated spam submissions.

Cybercriminals Are Pushing Back Against AI-Generated Slop on Underground Forums
New research shows low-level hackers and scammers are complaining about AI-generated posts on cybercrime forums, viewing them as low-quality noise that undermines community trust and social interaction.

Claude Code source map leak reveals minified JavaScript was already public on npm
A source map file accidentally included in version 2.1.88 of the @anthropic-ai/claude-code npm package revealed internal developer comments, but the actual 13MB cli.js file containing 148,000+ plaintext strings has been publicly accessible on npm since launch.