Glossary

Threshold.

The score cutoff above which a detector flags a passage as AI. Movable, with tradeoffs.

Every detector produces a continuous probability (say, 0.73). The threshold is where you draw the line between "flag" and "don't flag". A threshold of 0.5 is the default; higher thresholds reduce false positives at the cost of missing more AI.

Published performance metrics are always threshold-specific. When comparing detectors, use AUC (threshold-independent) or compare precision/recall at matched thresholds.

What the threshold knob actually controls

The classifier outputs a continuous probability for every input, say 0.73 for one essay, 0.31 for another. The threshold is the cutoff that turns those continuous numbers into binary "flag" or "don't flag" decisions. Set the threshold at 0.5 and 0.73 flags, 0.31 doesn't. Set it at 0.8 and only the very-high-confidence cases get flagged.

Why threshold choice is a deployment decision, not a tool decision

The detector's classifier doesn't know the deployment context, classroom integrity, editorial screening, content moderation each have different cost structures. Reputable detectors expose the threshold as a configurable so institutions can tune to their tolerance, and publish performance metrics at multiple operating points. Our recommended thresholds for each use case (classroom, editorial, mass-market) are documented on /methodology.

Where This Concept Is Most Often Misunderstood

The most common misunderstanding about thresholds in AI detection is the belief that a single universal cutoff value applies across all contexts and models. In practice, threshold values are model-specific and dataset-dependent. A threshold of 0.5 in one detection system may correspond to vastly different false positive rates compared to another system using the same numeric value. Institutions frequently import threshold recommendations from one platform directly into another without recognizing that these values are calibrated against different training data, different feature sets, and different underlying architectures.

Another persistent confusion involves the relationship between threshold and certainty. A detection score above threshold does not mean the system is certain about AI authorship. It means the score exceeds the predetermined decision boundary. A text flagged at 0.51 when the threshold is 0.50 carries nearly identical statistical properties to one scored at 0.49, yet one triggers intervention and the other does not. This binary outcome from a continuous probability distribution leads educators to overweight small numerical differences that fall near the boundary, treating marginally different scores as categorically distinct evidence.

Practical Implications for Institutions and Teachers

Setting institutional thresholds requires balancing investigative resources against tolerance for undetected AI use. A lower threshold increases sensitivity but generates more cases requiring manual review, creating workload burdens that smaller institutions may lack capacity to address. Research from academic integrity offices in 2024 and 2025 indicates that thresholds below 0.6 in most commercial systems produce review queues that exceed available staff hours by factors of three to five. Conversely, high thresholds above 0.8 reduce false positives but allow substantial AI-assisted work to pass unexamined, particularly in cases where students blend AI output with original writing.

Teachers should document threshold values and apply them consistently across all students in a course. Adjusting the threshold mid-semester or applying different standards to different assignments introduces bias and undermines procedural fairness. Some learning management systems allow per-assignment threshold configuration, but variance in these settings often reflects instructor uncertainty rather than pedagogical intent. Best practice involves establishing a single institutional or departmental threshold with explicit guidelines for when human reviewers should escalate borderline cases, rather than relying on ad hoc threshold adjustments to manage ambiguous results.

Back to the full glossary.

All terms