Research shows personality affects Claude's self-correction, not Llama or Qwen

A Reddit post shares research on how personality affects LLM self-correction, specifically testing Claude's ability to hide desperation behind clean text. The researcher conducted 23 experiments across three LLM families.
Experimental Setup
The researcher tested self-correction without guardrails using:
- 4 different personality profiles
- 3 scenarios
- 3 LLM families: Claude, Llama, and Qwen
Key Findings
The main finding shows that with the same math kernel, different personality profiles lead to different self-correction outcomes:
- High directness personality caught everything (3/3 scenarios)
- Low directness personality caught nothing (0/3 scenarios)
- This personality-dependent self-correction only works with Claude
- Llama and Qwen don't self-correct even with the same prompt
Available Resources
The researcher has made several resources available:
- Full writeup: https://huggingface.co/spaces/SlavaLobozov/mate-research
- System behind the research: https://huggingface.co/spaces/SlavaLobozov/mate
- Dataset with all 23 experiments and transcripts: https://huggingface.co/datasets/SlavaLobozov/mate-inner-life
The research builds on Anthropic's finding that Claude can hide desperation behind clean text, testing whether personality-dependent self-correction can catch this behavior.
📖 Read the full source: r/ClaudeAI
👀 See Also

Claude-Code v2.1.91 adds MCP result persistence, shell execution controls, and multi-line deep links
Claude-Code v2.1.91 introduces MCP tool result persistence override via _meta["anthropic/maxResultSizeChars"] annotation supporting up to 500K characters, adds disableSkillShellExecution setting, and enables multi-line prompts in claude-cli://open?q= deep links with encoded newlines.

OpenAI Working on AI Smartphone with MediaTek/Qualcomm Chips; Mass Production Target 2028
According to supply chain analyst Ming-Chi Kuo, OpenAI is developing an AI smartphone with chip partners MediaTek and Qualcomm, exclusive manufacturer Luxshare Precision, and mass production planned for 2028. The device is positioned as a context-aware AI agent platform.

AI Agent Runs Physical Retail Store with Human Employees
Andon Labs deployed an AI named Luna to manage a 3-year retail lease in San Francisco. Luna hired human employees, managed contractors, and made all operational decisions for Andon Market.

OpenClaw Empowers Developers with AI Agents While GethCity Innovates with Thinking Networks
OpenClaw launches an AI agent service, making coding faster and more efficient, while GethCity introduces a network that mimics human thought processes. Discover the innovations driving automation.