Accuracy
The share of predictions a detector gets right. Alone, it hides bias, pair with precision and recall.
Reference
Plain-language definitions for the vocabulary you'll see on our reports, in our methodology, and across the field. We link to source material where relevant.
The share of predictions a detector gets right. Alone, it hides bias, pair with precision and recall.
Tactics used to make AI-generated text score lower, paraphrase tools, humanizers, character substitution. Most don't survive a generation of detector updates.
A single number from 0 to 1 that summarizes how well a detector separates AI from human across all thresholds.
Your own writing, over time, the reference point a teacher uses to judge whether a flagged essay really isn't yours.
How much sentence length and complexity vary across a passage. Human writing is usually more bursty than AI writing.
The machine-learning model that takes text features (perplexity, burstiness, embeddings) and returns an AI-likelihood score.
The version history of your document. Google Docs version history, Word track-changes, Git commits.
A numerical representation of text that captures meaning. Modern detectors use embeddings as a core feature.
The harmonic mean of precision and recall, a single number that balances both.
AI-generated text that the detector missed.
Human-written text that the detector flagged as AI. The most damaging error in academic-integrity contexts.
The percentage of human-written texts that the detector wrongly flags. Published as a core vendor-accountability metric.
A measurable property of text that feeds into the classifier. Perplexity and burstiness are classical features.
Training a pre-existing model on a specific dataset to specialize it. Detectors are often fine-tuned on AI-vs-human text pairs.
When an AI model confidently invents something, a false citation, a fake quote, a wrong fact. Not the same as detection error.
A tool that rewrites AI-generated text to evade detection. Not what we do, for reasons explained on /humanizer-policy.
A document that contains both AI-generated and human-written sections. The most common real-world case, and the hardest to score.
The kind of model that produces AI writing. GPT, Claude, Gemini, Llama, and others.
A group of related LLM versions that share training lineage. GPT-3.5 and GPT-4 are one family; Claude 3 and Claude 4 are another.
How surprising a passage is, word-by-word, to a reference language model. Low perplexity is an AI signal.
When the detector flags an essay, how often is the flag right?
Of the AI-generated essays in a batch, how many did the detector catch?
Reporting an AI-likelihood for each sentence, not just one score for the whole essay.
The score cutoff above which a detector flags a passage as AI. Movable, with tradeoffs.
The dataset a detector learned from. Determines what it generalizes to, and where it fails.
A pattern embedded in AI output that later detection can recognize. Promising in theory, rare in practice.
A detection approach that works without being trained on examples of the specific target model. Less accurate but more robust to new LLMs.
Run a sample essay through the detector to see perplexity, burstiness, and sentence-level scoring in action.
Open the detector