arXiv to Ban Authors for a Year Over Unchecked AI Use in Research Papers
The new policy reinforces that authors remain fully responsible for the integrity of their work, even if parts of it were AI-generated.
News
- Anthropic to Brief Financial Stability Board on Mythos Cyber Findings
- OpenAI Explores Legal Options Amid Tensions With Apple
- AI Has Outpaced How Companies Measure Developer Productivity, Report Finds
- AI Dispatch | May 8 - 14
- du Launches Sovereign Industrial AI Platform for UAE Manufacturers
- Huang Foundation Donates $108M CoreWeave Compute for AI Research
[Image: ChetanJha/MITSMR Middle East]
The scientific community is drawing sharper lines around accountability as the use of generative AI in science spikes. The latest signal comes from arXiv, the popular preprint open repository that has become central to how research circulates in fields such as computer science, mathematics, and physics.
Although papers uploaded to arXiv are not peer-reviewed, the platform plays an outsized role in shaping scientific discourse. Researchers often use it to rapidly disseminate findings, establish precedence, and track emerging trends long before journal publication. Increasingly, however, it is also confronting the unintended consequences of widespread large language model (LLM) adoption.
The repository has already introduced measures to limit low-quality or machine-generated submissions, including requiring first-time contributors to obtain endorsements from established authors. Now, arXiv moderators are signaling a tougher stance on what they see as careless or unchecked use of generative AI in research papers.
In a recent policy note, Thomas Dietterich, chair of arXiv’s computer science section, stated that submissions containing “incontrovertible evidence” that authors failed to verify LLM-generated content would face significant penalties. Such evidence could include hallucinated citations, fabricated references, or residual prompts and comments copied directly from AI systems into manuscripts.
Authors whose papers violate the rule could face a one-year ban from arXiv. After that period, any future submissions would need to be accepted first by a reputable peer-reviewed venue before appearing on the repository.
The policy does not prohibit researchers from using LLMs. Instead, it reinforces a principle increasingly emphasized across academia: authors remain fully responsible for the integrity of their work, regardless of whether parts of it were generated by AI. Researchers cannot deflect blame for factual errors, plagiarism, biased language, or fabricated citations by attributing them to a chatbot.
Dietterich said the policy would operate as a “one-strike” system, though enforcement would include multiple layers of review. Moderators must first flag problematic submissions, after which section chairs confirm the evidence before penalties are imposed. Authors will also retain the right to appeal.
The move highlights concerns about the reliability of AI-assisted scholarship. Recent peer-reviewed studies have documented a rise in fabricated citations in biomedical research, a trend many researchers attribute to generative AI systems that confidently invent sources.
