ArXiv will issue a one-year submission ban to authors whose preprints carry visible evidence of unchecked large-language-model output, according to a clarified policy posted on X by Thomas Dietterich, chair of the platform’s computer science section. After serving the year, an author’s next submission must first clear peer review at a reputable venue before normal preprint posting resumes.
The penalty is narrower than the headlines suggest. Moderators are looking for in-manuscript artifacts: hallucinated references, leftover model meta-comments, placeholder tables that any human reader would catch on a single read. The shape of that bar is what makes the rule both useful and limited.
What ArXiv Will Treat as Incontrovertible
The policy hinges on a single phrase Dietterich used in his clarification: incontrovertible evidence that the authors did not check the results of LLM generation. That phrase carves the rule’s perimeter. Routine AI-assisted editing, code generation, proof checking, and idea drafting all stay legal. Pasting model output without reading it is what trips the wire.
The mechanism is two-step. A moderator flags the submission. The section chair, Dietterich himself for computer science, must confirm the evidence before the suspension applies. Authors retain an appeal window between flag and confirmation.
| Submission Behavior | Outcome |
|---|---|
| Hallucinated references that point to non-existent papers | One-year suspension |
| Visible model meta-comments left in the body (“would you like me to make any changes?”) | One-year suspension |
| Placeholder text in tables or figures (“fill in with real numbers”) | One-year suspension |
| LLM used for drafting, with author verification of every output | No penalty |
| LLM used for code, proofs, or simulations | No penalty |
| LLM used for grammar or proofreading | No penalty |
The rule does not require detection software. It requires a human moderator to read the file and find an artifact a co-author should have caught.
The Three Tells That End a Submission
ArXiv’s moderators do not run authorship classifiers across every submission. They look for three specific kinds of manuscript-side evidence, all of them visible to anyone reading the PDF.
- Fabricated references. Citations that point to papers nobody wrote, often with a plausible journal name and a DOI-shaped string that resolves to nothing.
- Leftover model dialogue. Phrases such as “here is a 200-word summary, would you like me to make any changes?” or “I cannot help with that” sitting inside the manuscript body, captions, or footnotes.
- Placeholder instructions. Notes like “fill in with real numbers from your experiments” or “insert citation here” embedded in tables, figure legends, or methods sections.
Each is something a co-author skimming the PDF would catch in thirty seconds. That is the deliberate design. The bar for evidence is what any reasonable reader could verify, which is also what makes the suspension defensible on appeal.
Why the Crackdown Lands in May
The clarification follows a Lancet audit of fabricated biomedical citations published on May 7 by researchers at Columbia University’s School of Nursing and Data Science Institute. The team scanned 2.5 million PubMed Central papers published between January 2023 and February 2026, verifying roughly 97.1 million references against the published record.
The headline finding is a twelvefold rise in fabrication, with the steepest jump beginning in mid-2024 when writing-grade language models became broadly accessible.
- 1 in 277 PubMed-indexed papers in the first seven weeks of 2026 cited at least one paper that does not exist.
- 1 in 458 was the rate across 2025.
- 1 in 2,828 was the rate during 2023, when the audit window opened.
- 98.4 percent of affected papers had received no correction or retraction from their publishers when the audit closed in February.
Lead author Maxim Topaz, associate professor at Columbia’s School of Nursing, told the institution’s press desk that the rate tracks the mid-2024 mainstreaming of generative writing tools. One paper in the audit sample carried 18 fabricated references out of 30, several of which had already been cited by downstream work shaping clinical practice. The pattern is the broader context here, and it sits alongside the school-level problem covered in how schools are struggling with student AI use. The arXiv decision falls seven days after the Lancet letter landed. The two are not coincidental.
The Pressure Now Shifts to Peer Review
The rule’s most consequential feature has barely registered in early coverage. The re-entry condition turns peer review into the gatekeeper for any author who has served the suspension. That assumes peer review is in shape to absorb the load. The Columbia audit suggests it is not.
Why the Bar to Re-Enter Just Rose
A suspended author cannot wait out twelve months and simply resume posting. The next paper must first be accepted at a recognized peer-reviewed journal or conference. Computer-science conference review cycles run on six-to-nine-month timelines for the major venues, and those venues are themselves dealing with hallucinated-citation outbreaks. GPTZero’s screening tool flagged more than 100 fabricated references across 53 accepted NeurIPS 2025 papers and surfaced confirmed fabrications in 50 of 300 ICLR submissions it sampled. The gatekeeper is also leaking.
What This Means for Conference Pipelines
Section chairs at the preprint server and program chairs at top conferences now share an enforcement burden the literature was not designed to carry. The repository’s October 2025 update to its computer-science review policy already required that review articles and position papers in the CS category clear peer review before posting. The May rule extends the pattern: anything flagged for unchecked LLM artifacts is funneled into the same bottleneck. Conferences absorb the verification cost of work that would previously have circulated as a preprint. Processing queues get longer for everyone.
Where the Burden Lands Next
Funding agencies, institutional review boards, and journal editors are the next nodes likely to feel the pressure. If the preprint server’s filter pushes flagged work into peer review, and peer review’s filter is itself porous, the responsibility for catching fabrication migrates toward the funder or the employer. None of those institutions has a standing process for detecting hallucinated citations in real time. The infrastructure to verify that every reference in a million-paper-a-year output points to a real document does not yet exist outside small audits like the one Topaz published.
What the Rule Leaves Untouched
The policy as described does not catch the polished bad actor. A researcher who pastes generated output, then sweeps the manuscript for meta-comments, swaps the obviously fake references for real ones formatted correctly, and removes placeholder text walks past the bar. The substance of the LLM-generated argument stays in the paper. Nothing in Dietterich’s clarification suggests moderators will go looking for it.
By signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents are generated.
That language sits in the EurekAlert release accompanying the Lancet audit and echoes Dietterich’s framing. It puts the moral burden on the author. The technical filter, by contrast, only catches authors who did not bother to read what they pasted. The two move in opposite directions. The careful user of unverified tools is invisible to this rule, which is why the Columbia trend curve is unlikely to bend from the preprint-server side alone.
Frequently Asked Questions
Does ArXiv Ban Authors for Using AI to Write Any Part of a Paper?
No. The policy targets unchecked output only. Using a model for drafting, code generation, proof checking, or proofreading remains permitted as long as the author verifies every result before submission and removes any model-side artifacts.
What Counts as Incontrovertible Evidence?
Three categories: hallucinated references that point to non-existent papers, visible LLM meta-comments left in the body of the manuscript, and placeholder text in tables or figures. A moderator flags the issue; the section chair must confirm the evidence before any suspension applies.
Can a Banned Author Appeal?
Yes. The appeal window sits between the moderator flag and the section chair’s confirmation. Authors can contest the evidence and offer documentation that the artifact was inadvertent or that the references resolve correctly.
What Happens After the One-Year Ban Expires?
The author’s next paper must be accepted at a reputable peer-reviewed journal or conference before standard arXiv posting resumes. Conference acceptance counts. Journal acceptance counts. A second preprint posting does not.
Does This Affect Papers Already Posted to ArXiv?
No retroactive removals have been announced. The clarified rule applies to new submissions reviewed by moderators going forward, beginning with manuscripts received after the May posting.
If enforcement leans on the visible artifacts any reader can catch, the careless authors disappear from the upload queue and the underlying fabrication rate the Columbia team is tracking keeps climbing. If section chairs broaden the evidentiary bar to subtler signals, the appeals docket will tell us by late summer.





