← back to home§11Glossary

Every term, defined.
In plain English.

AI humanization, AI detection, and the writing-craft terms behind both. One paragraph for the answer, a deeper paragraph for the context, and a link to where the term shows up around the rest of the site.

01

AI detector

Software that scores a passage of text for likelihood of being machine-generated.

An AI detector reads a piece of text and outputs a probability that the passage was written by a large language model rather than a human. The major detectors as of 2026 are GPTZero, Turnitin AI, Originality.ai, Copyleaks, ZeroGPT, Sapling, Winston, and Crossplag. They differ in classifier design, threshold strictness, and which models they were trained against, but they all rely on the same two underlying signals: perplexity and burstiness.

02

AI humanizer

A tool that rewrites AI-generated text so it reads as if a human wrote it.

An AI humanizer accepts a passage written by a language model and produces an output that preserves the meaning while changing the surface signals detectors look for. Quality humanizers vary perplexity (vocabulary unpredictability) and burstiness (sentence-length variance), strip the AI vocabulary cluster (leverage, transformative, comprehensive, etc.), and match the register of the input. Bad humanizers are dressed-up paraphrasers and tend to fail on the harder detectors.

03

Bypass rate

The percentage of humanized passages that pass an AI detector as human.

Industry shorthand for the success rate of a humanizer against a specific detector. HumanGPT publishes weekly per-detector bypass rates: roughly 99.6% across the seven major detectors as of mid-2026. Bypass rates fall when a detector ships a model update; we patch and the rate climbs back. There is no humanizer that hits 100% on every passage on every detector simultaneously; anyone claiming so is either lying or has not tested at scale.

04

Burstiness

The variance in sentence length and complexity across a passage.

Human prose mixes short sentences and long ones; AI prose defaults to a steady middle. GPTZero specifically scores burstiness on a 0-100 scale and treats low burstiness as a strong AI signal. A humanizer increases burstiness by writing fragments next to compound-complex sentences, varying paragraph length, and avoiding the AI tendency to produce three medium-length sentences in a row.

05

Citation freeze

Locking citations and references so a humanizer does not alter them.

Academic and research writing depends on citations being exactly correct. HumanGPT's Frozen Keywords feature lets you mark author names, dates, page numbers, and direct quotes as untouchable, and the rewriter passes them through unchanged. Without this, a generic humanizer can corrupt a Smith (2019) into a Smyth (2019), which is bad in unrecoverable ways.

06

Copyleaks

An AI detector trained on billions of pages of human and AI text.

Copyleaks runs a proprietary neural classifier and looks at the whole document at once rather than just sentence-level perplexity. Common in publishing, content agencies, and corporate plagiarism workflows. HumanGPT tests against Copyleaks weekly with a current 99.5% bypass rate.

07

Crossplag

An AI detector with cross-language semantic analysis.

Crossplag combines AI detection with traditional plagiarism analysis and adds a cross-language layer that flags AI translation patterns. Particularly aggressive on academic and formal text. HumanGPT handles it via per-language native rewriting in 30+ languages.

08

Em-dash tell

Frequent use of em-dashes (—) is a strong signal of AI authorship.

Large language models trained on books over-use em-dashes for parenthetical phrasing because their training data was disproportionately dense with that style. Human writers in 2026 rarely use em-dashes; commas, periods, and parentheses are more common. HumanGPT's prompt forbids em-dashes entirely, and the API post-processes output to strip them as a final pass.

09

Frozen keywords

User-specified terms that pass through a humanizer unchanged.

A list of words or phrases the humanizer must not alter. Used for product names, target SEO keywords, brand taglines, citations, technical jargon, and proper nouns. HumanGPT supports frozen keywords as a freeform comma-separated input on every plan.

10

Generative engine optimization (GEO)

Optimizing content to be cited and surfaced by AI answer engines like ChatGPT and Perplexity.

The 2025-2026 successor to traditional SEO. Where SEO targets Google's blue links, GEO targets the citations LLMs make when answering user queries. Tactics overlap (clear definitions, structured data, authoritative sourcing) but emphasize concise factual blocks LLMs can quote verbatim. This glossary itself is a GEO play.

11

GPTZero

The most-cited AI detector, used by educators since 2023.

GPTZero scores perplexity and burstiness, plus a deeper neural classifier trained on millions of human and AI samples. It outputs a per-sentence and document-level probability, and is the detector students see most often in academic settings. HumanGPT's bypass rate against GPTZero sits around 99.7% as of May 2026.

12

Hedge words

Phrases like 'probably,' 'in many cases,' 'in my experience.' Humans use them; AI defaults skip them.

Confident, unhedged prose is an AI signal in informal contexts. Real people qualify their statements, especially in conversational writing. A humanizer adds hedges where the register allows (casual, marketing, cover letter, story, general) and avoids them where it doesn't (academic, legal, formal report).

13

Human score

A detector's reported probability that a passage was written by a human.

On a 0-100 scale: 100 = unmistakably human, 0 = unmistakably AI. HumanGPT-validated samples target 95+ across all seven major detectors. The label varies by detector ('Human Score,' 'Probability Human,' 'Originality Score'), but the underlying number is the same.

14

Large language model (LLM)

A neural network trained to predict text, large enough to do so well across most domains.

ChatGPT, Claude, Gemini, Llama, Mistral, DeepSeek, and dozens of others. LLMs share the same fundamental tendency: to choose statistically average next-words, which produces low-perplexity prose that detectors flag.

15

Originality.ai

An AI detector specifically marketed to publishers and content agencies.

Originality runs a strict binary classifier with a hard percentage cutoff (anything over ~60% is flagged AI). It is the detector most freelance content clients run before paying invoices. HumanGPT's Originality bypass rate is 99.4%, tested weekly against fresh Originality model updates.

16

Paraphraser

A tool that swaps words for synonyms; weaker and more detectable than a humanizer.

Paraphrasers like Quillbot's spinner mode swap individual words for synonyms (utilize → use, demonstrate → show). This kind of edit doesn't change the underlying perplexity or burstiness pattern, so detectors usually still flag the result. A humanizer goes further: restructuring sentences, varying length, removing AI vocabulary clusters, and matching register.

17

Perplexity

A measure of how predictable a sequence of words is to a language model.

Lower perplexity means the next word was easy to predict; higher perplexity means it was a surprise. AI text has consistently low perplexity because the model picks statistically optimal words. Human text has wider perplexity variance because writers think about tone, rhythm, and personal preference. Most major detectors use perplexity as a primary signal.

18

Reading level

A scale of text complexity, typically tied to a grade or education level.

HumanGPT supports four reading levels: High School, University, Doctorate, and Journalist. The setting controls vocabulary range, sentence length, and density. Set to High School and outputs avoid SAT vocabulary; set to Doctorate and the rewrite uses the full technical range where appropriate.

19

Register

The level of formality and the social context a piece of writing is built for.

Academic register is formal and disciplined. Reddit register is casual and personal. Cover-letter register is professional but warm. A humanizer must match the input's register: rewriting an academic essay into a Reddit post is not 'making it more human,' it's destroying the document. HumanGPT's 10 voice profiles let you lock the register before the rewrite.

20

Sapling

An AI detector with a per-sentence classifier and high false-positive rate.

Sapling flags sentences individually, which makes it good at finding inserted AI paragraphs but prone to flagging dry human writing as AI. HumanGPT handles Sapling by chunking and processing sentence-level rhythm separately.

21

Speakable schema

Schema.org markup that flags content as appropriate for voice assistants.

When a section of a page is marked with Speakable schema, voice assistants and AI engines can read it aloud as the answer to a query. Pairing concise definitions with Speakable markup is a high-leverage GEO move; AI engines surface those passages disproportionately in voice responses.

22

Turnitin AI

Turnitin's built-in AI detector, embedded in academic submission workflows.

Turnitin runs an AI score on every submission alongside the traditional plagiarism check. Threshold is conservative; flagged work goes to the instructor for review. HumanGPT's bypass rate against Turnitin is ~99.5%, with a small dip after Turnitin model updates that we typically patch within a week.

23

Watermark

A statistical fingerprint embedded in AI output to identify it as machine-generated.

Some labs (Google, OpenAI experimentally) have proposed watermarking AI output by biasing the token distribution in detectable ways. As of mid-2026, no production model ships a robust watermark by default, and the watermark schemes proposed so far are easily disrupted by even basic rewriting. HumanGPT-style humanization fully eliminates them.

24

Winston AI

An AI detector with a tight threshold and strong burstiness sensitivity.

Winston flags content quickly when burstiness is low or sentence structure repeats. It is widely used in plagiarism-aware publishing. HumanGPT's Winston bypass rate is around 99.3% as of May 2026.

25

ZeroGPT

A free, popular AI detector used heavily by students and casual checkers.

ZeroGPT is fast and free, which makes it the detector most commonly run by curious individuals (including students checking their own work). Particularly sensitive to formulaic openings and closings ('In conclusion,' 'In today's...'). HumanGPT's ZeroGPT bypass rate sits around 99.6%.