Glossary

LLM (large language model).

The kind of model that produces AI writing. GPT, Claude, Gemini, Llama, and others.

A large language model is a neural network trained on a lot of text to predict the next word given the previous ones. The major commercial LLMs in 2026 are GPT (OpenAI), Claude (Anthropic), Gemini (Google), Llama (Meta), and a growing list of open-source and regional alternatives. Each has a recognizable output fingerprint that detectors can, to varying degrees, identify.

The 2026 commercial landscape

Five model families dominate commercial LLM use as of 2026. OpenAI's GPT (GPT-4o and GPT-4.5 variants), Anthropic's Claude (Claude 4 series), Google's Gemini (1.5 and 2.5 Pro), Meta's Llama (open-weights, widely fine-tuned), and a tier of regional and open-source alternatives. Each family has identifiable output fingerprints that detectors can, to varying degrees, recognize.

Why model identification matters for detection

A detector that knows it's looking at Claude output can apply Claude-specific weighting and outperform a generic detector on the same text. Family identification is itself a learnable task: classifiers can often distinguish GPT from Claude from Gemini above chance even when they can't tell AI from human. We surface model-family identification as a secondary output for readers who want to follow up. Family-specific detectors are at /ai-detector/chatgpt, /ai-detector/claude, and /ai-detector/gemini.

Where this concept is most often misunderstood

A common misconception treats LLMs as databases or search engines that retrieve stored facts. In reality, these models generate text through probabilistic predictions based on patterns learned during training. They do not look up information but instead calculate the most likely next token given prior context. This distinction explains why LLMs can produce fluent responses on topics absent from their training data, and why they sometimes generate plausible but incorrect statements (often called hallucinations).

Another misunderstanding conflates model size with capability in a linear fashion. While parameter count (measured in billions) correlates with performance on benchmarks, architectural choices, training data quality, and fine-tuning methods exert comparable influence on output quality. A 7-billion parameter model trained on curated academic text may outperform a 70-billion parameter model trained on unfiltered web scrapes for domain-specific tasks. Institutions evaluating detection tools should recognize that LLM-generated text varies significantly based on these factors, not solely on whether a model is considered large.

Practical implications for institutions and educators

Educational institutions face policy challenges because LLMs blur the line between assisted writing and original authorship. A student using an LLM to generate an essay outline engages differently than one submitting generated paragraphs verbatim, yet both involve LLM interaction. Universities have responded by requiring process documentation (drafts, revision histories) rather than relying solely on final submissions. Some institutions now incorporate LLM literacy into writing curricula, teaching students to critically evaluate generated content for accuracy and coherence rather than prohibiting use entirely.

Assessment design has shifted toward formats less susceptible to LLM shortcuts. Timed in-class essays, oral examinations, and assignments requiring personal reflection or local data analysis reduce the utility of generic LLM outputs. Instructors also calibrate expectations around citation practices, as LLMs frequently generate nonexistent references that appear formally correct. Detection tools serve as one component in a broader strategy that includes pedagogical redesign, explicit policy communication, and fostering academic integrity through transparency rather than surveillance alone.

Back to the full glossary.

All terms

LLM (large language model).