Stanford Study: Law Professors Prefer AI Answers Over Peers 75% of the Time

A Stanford Law School study led by Professor Julian Nyarko found that law professors overwhelmingly prefer AI-generated answers to student questions over responses written by fellow instructors. In a blind evaluation of nearly 3,000 anonymized comparisons across 16 U.S. law schools, AI responses won 75% of head-to-head matchups against peer-written answers.
Study Design & Results
The study, titled Law Professors Prefer AI Over Peer Answers, focused on contract law. Participants created 40 representative questions that students might ask after class or during office hours. Professors wrote their own answers, then evaluated responses without knowing whether they came from AI or other professors. The AI systems performed comparably to the best human instructor in the study.
Key findings:
- AI won 75% of head-to-head comparisons against peer answers
- AI responses flagged as pedagogically harmful only 3.5% of the time
- Peer-written answers flagged as harmful 12% of the time
- Evaluations focused on nuanced legal reasoning, not factual recall
Implications for Legal Education
“This study challenges important assumptions about AI’s role in legal education,” Nyarko said. “We focused on law precisely because it requires judgment, nuanced reasoning, and the ability to navigate ambiguity—not just factual recall.”
The research also examined specific AI models including commercial tutoring systems and Google’s NotebookLM, finding varying levels of performance. Even when context limitations affected AI responses, professors still frequently preferred them to human-written alternatives.
Co-author Sarath Sanga from Yale Law School noted: “In most fields where AI gets tested, there’s a right answer. In law, there often isn’t. Two opposing arguments can both be good.”
The study is particularly notable because previous AI evaluations focused on subjects with clear right-or-wrong answers, whereas legal reasoning demands careful analysis of competing arguments and defensible conclusions.
Cautions & Open Questions
Nyarko cautioned against wholesale adoption: “How to implement these tools to most effectively improve student learning is still an open question.” The study evaluated answer quality but noted that implementation challenges such as hallucinations, overreliance, and erosion of critical thinking skills remain.
📖 Read the full source: HN AI Agents
👀 See Also

Claude Code v2.1.89 adds deferrable hooks, permission retry, and fixes memory leaks
Claude Code v2.1.89 introduces a 'defer' permission decision for PreToolUse hooks, adds a PermissionDenied hook with retry capability, and fixes critical issues including memory leaks with large JSON inputs and StructuredOutput schema cache failures.

Spotify Developers Leveraging AI for Code-Free Contributions
Spotify's key developers have not written code since December due to AI, notably through their internal 'Honk' system that facilitates remote, real-time code deployments using Claude Code.

Shenzhen's Longgang District Proposes OpenClaw Subsidies for AI Agent Startups
Longgang District in Shenzhen has released a draft policy document offering subsidies and support specifically for OpenClaw ecosystem development and OPC startups, aiming to become a global hub for AI agent entrepreneurship.

Debian's AI Contribution Policy Discussion Ends Without Resolution
Debian developers debated whether to accept AI-assisted contributions but reached no formal decision. The proposed general resolution would have required explicit disclosure and labeling for LLM-generated content.