Glossary
False positive.
Human-written text that the detector flagged as AI. The most damaging error in academic-integrity contexts.
False positive
A false positive is a wrongful flag. It is the category of error that hurts students, it's what puts a human-written essay in front of an academic-integrity committee. Every detector has a false-positive rate above zero; reputable vendors publish theirs.
Known populations at elevated false-positive risk: non-native English writers, students with autism (whose writing sometimes patterns statistically differently), heavily-revised or highly formal academic prose, short passages. See /transparency for our stratified false-positive rates.
Why false positives are the failure mode that matters
In the classroom-integrity loop, a false positive sends a student who did honest work into a meeting they didn't earn. The harm is real: stress, suspicion, sometimes a permanent record. A false negative, by contrast, is recoverable through the rest of the integrity process. That asymmetry is why every reputable academic-detection vendor weights its operating threshold conservatively, flagging fewer essays at a higher confidence rather than maximizing recall.
The known elevated-risk populations
Documented and reproduced across multiple studies: non-native English writers (formal register reads as low-perplexity), students with autism (statistical patterns sometimes diverge from training distribution norms), heavily-revised academic prose, and short passages under 200 words. The student-side guide for what to do if flagged is at /for-students/how-to-appeal; the teacher-side recommended workflow is at /for-teachers/ai-policy-templates.
How false positives interact with related metrics
False positives exist in direct tension with precision and recall in AI detection systems. Precision measures the proportion of flagged essays that are actually AI-generated, meaning a high false positive rate directly reduces precision. When a detector flags 100 essays as AI-written but only 60 are genuine cases, the 40 false positives lower precision to 60 percent. Detection platforms often tune their thresholds to balance this tradeoff, since reducing false positives typically increases false negatives (missed AI essays).
The false positive rate, calculated as false positives divided by all human-written samples, provides institutions with a complementary view to overall accuracy. A detector with 95 percent accuracy can still produce unacceptable outcomes if its false positive rate among human essays reaches 10 percent in a class of 200 students. This metric becomes especially relevant when evaluating detectors across different writing populations, since false positive rates often vary between native and non-native English writers, technical versus humanities essays, and undergraduate versus graduate work. Institutions should request disaggregated false positive rates rather than relying solely on headline accuracy figures.
Practical implications for institutions and educators
False positives create procedural and relational costs that extend beyond individual cases. When an instructor confronts a student based on a detector flag that later proves incorrect, the interaction can damage trust and discourage legitimate help-seeking behavior. Students falsely accused may avoid writing centers, refrain from asking clarifying questions, or adopt defensive writing styles that prioritize evasion over clarity. Administrative processes for appeals consume time from academic integrity offices, and repeated false positives erode faculty confidence in detection tools, leading to inconsistent enforcement across departments.
Institutions have responded by implementing verification protocols that treat detector outputs as preliminary signals rather than conclusive evidence. Best practices include requiring human review of flagged content, conducting follow-up conversations focused on process rather than accusation, and maintaining records of false positive rates by detector and document type. Some universities now prohibit automated detection as sole evidence for academic integrity violations, instead using detectors to identify cases warranting closer examination through interviews or supplementary assignments. These guardrails acknowledge that false positives, while statistically inevitable, require systematic mitigation to preserve educational relationships and due process standards.