What is perplexity in AI detection?

Perplexity measures how predictable your next word is. Low perplexity means the words follow each other in expected ways, which is how AI text tends to read. High perplexity means more surprising, varied word choices, which is more typical of humans. It's the original signal GPTZero was built on, created by Edward Tian at Princeton in January 2023. Detectors pair it with burstiness, the variation in your sentence lengths.

Are non-native English writers flagged more often?

Yes, and the gap is large. A 2023 Stanford study led by Weixin Liang found GPT detectors flagged 61.3% of TOEFL essays by non-native English speakers as AI, compared with 5.1% for essays by US-born students. The cause is mechanical: second-language writers often use simpler, steadier sentence structures, which detectors read as low perplexity, the same signal machine text produces. It's a known bias in the tools, not a sign of cheating.

Does changing a few words bypass AI detection?

Usually not. Swapping synonyms changes the surface but leaves the deep patterns intact, especially sentence-length variance and burstiness, which are the strongest residual signals. The Pangram Labs DAMAGE paper (2025) showed several stylometric features survive even a full humanizing pass. Word-swapping shifts your AI focal vocabulary score a little, but a good detector still reads the rhythm of your sentences underneath. Structure matters far more than vocabulary here.

Which AI detector is the most accurate?

There's no single answer, because vendor benchmarks aren't comparable. Originality.ai reports about 96.7% accuracy on the RAID benchmark for paraphrased text, and HumanGPT's detector hits a held-out AUROC of 0.987. But Scribbr's 2024 third-party test put real-world accuracy near 76% across tools. Accuracy also drops once text is edited or translated, per Weber-Wulff et al. (2023). Treat any single percentage with caution and never rely on one tool alone.

What does AUROC 0.987 actually mean?

AUROC measures how well a detector separates AI text from human text across every possible threshold. A score of 0.5 is a coin flip and 1.0 is perfect, so 0.987 is strong. Close, not perfect. For HumanGPT's detector, that translates to catching 95% of AI text at a 5% false-positive rate, and 77% at a stricter 1% false-positive rate. Held-out means the score reflects text the model never saw in training, not memorization.

What is burstiness in writing?

Burstiness is the ratio of how much your sentence lengths vary to their average length. Human writing wobbles: a short sentence next to a long one, landing roughly in a 0.6 to 1.2 band. AI text sits below that, too even, because models produce consistent, smoothed-out sentences. Along with perplexity, burstiness is one of the two core signals AI detectors rely on. Varying your sentence rhythm raises it toward the human range.

Why do detectors flag transition words like 'however' and 'moreover'?

Because RLHF-tuned models overuse them. Real human prose uses transition adverbs (however, moreover, furthermore, thus, therefore) in under 0.5% of words. AI text hits 1% to 5%, several times the human rate. It's a small but reliable tell. The fix is simple: cut most of them. Your writing reads less stiff, the rhythm loosens, and you drop a signal detectors key on. It also tends to be better prose, and it sounds more like you.

Can teachers rely on AI detectors to catch cheating?

Not on their own. A detector score is a probability, not proof, and the false-positive rate is uneven: 61.3% of non-native English essays flagged versus 5.1% for US-born writers in the 2023 Stanford study. Turnitin shipped AI detection in April 2023, but third-party testing puts real-world accuracy near 76%. Use detection as one input alongside drafts, version history, and conversation. Never as a standalone verdict about a student.

How can I make my writing pass as human?

Focus on structure, not synonyms. The strongest tell is sentence-length variance: human prose runs a standard deviation of 10 to 15 words, while AI clusters at 3 to 7, so put a four-word sentence next to a thirty-word one. Cut transition adverbs below 0.5% of words and add specific, unpredictable detail. No tool guarantees a result, but varied rhythm moves the read far more than swapping flagged words ever does.

What Triggers AI Detectors? 7 Patterns From 2026 Data

Q: Why is my text flagged as AI when I wrote it myself?

Because detectors measure patterns, not authorship. They reward unpredictable word choice and varied sentence lengths. If you write in a clean, even, plainspoken style, your text can score as low-perplexity, the same signal AI produces. Same look, different author. A 2023 Stanford study found detectors flagged 61.3% of essays by non-native English speakers as AI, versus 5.1% for US-born writers. A flag is a probability, not proof you used a machine.

Q: Does humanizing AI text actually work?

Sometimes, with no guarantees. Tools that only swap synonyms shuffle surface vocabulary while leaving sentence-length variance and burstiness untouched, so a good detector still reads the structure. What moves a score is genuine restructuring: varying sentence lengths, widening vocabulary, adding specific detail. Weber-Wulff et al. (2023) found detectors grow unreliable once text is meaningfully edited, which cuts both ways. Anyone promising you'll definitely pass is selling certainty that doesn't exist. Treat any tool as one help, not a guarantee.

Human writing breathes. You write a long, winding sentence that piles clause on clause and runs out almost where you started, and then you stop. Short. The standard deviation of your sentence lengths lands somewhere around 10 to 15 words. AI text doesn't do that. It stays flat, every sentence near the same length, and it stays flat even after a humanizing pass. Once you see writing the way a detector sees it, the flags stop feeling random. They start looking like fingerprints, not noise. So here's the honest version. What the data shows, where the numbers come from, and why your own writing sometimes gets caught in the net.

What actually triggers an AI detector?

AI detectors are triggered mainly by two things: how predictable your next word is (perplexity) and how much your sentence lengths vary (burstiness). Detectors don't understand what you wrote. They measure those two signals above all else, and everything else is a refinement of them.

Human writing is messy in a very specific way. You ramble, then clip yourself off. You start a sentence with But. You repeat a word because you forgot you used it two lines up. You reach for an odd verb. Models, especially ones tuned with reinforcement learning from human feedback, smooth all of that out. They produce text that's clean, even, and slightly too consistent. That consistency is the signal.

So when a detector flags something, it isn't accusing you of anything. It's reporting that your text looks statistically more like the patterns it learned from machine output than from human output. That's a probability, not a verdict. And as you'll see, the math gets people wrong all the time.

Where these numbers come from

These numbers come from HumanGPT's own validated detector plus published outside research, not guesswork. Here's the methodology behind them, in plain terms.

Our own detector, validated

HumanGPT runs an in-house AI-text detector, version 5, validated at a held-out AUROC of 0.987. In practice that means it catches about 95% of AI-generated text when you set the false-positive rate at 5%, and 77% when you tighten the false-positive rate to 1%. Held-out means the validation text was never used in training, so the score reflects real performance on text the model hadn't seen, not memorization.

A consensus, not a single oracle

No single detector is right all the time, and we don't pretend ours is. Alongside our own model we run a multi-detector consensus and analyze the stylometric features of the text we process: sentence-length distributions, lexical diversity, transition-word rates, and word frequencies. When several independent signals agree, the read is far more trustworthy than any one of them alone. When they disagree, that disagreement tells you how borderline a piece of text really is.

Outside research we leaned on

Many of the feature rankings line up with the Pangram Labs "DAMAGE" paper (arXiv:2501.03437, January 2025), which studied which stylometric signals survive even after AI text is run through a humanizer. The false-positive and bias figures come from a 2023 Stanford study led by Weixin Liang. Competitor accuracy numbers come from Originality.ai's reported RAID benchmark results, Scribbr's 2024 third-party testing, and Weber-Wulff et al. (2023). Where a number comes from someone else's work, we say so.

The 7 patterns that trigger AI detectors, ranked

Here are the seven signals that separate human from AI text most cleanly, ranked by how reliably they hold up, including after a humanizing pass. The ranges are real measurements from our detector and the DAMAGE paper. Read the table, then we'll talk about which ones matter most.

A notebook page showing seven small dial gauges beside a ruler, an open book, a fountain pen, and a metronome, standing in for the signals an AI detector reads. — What a detector actually measures: sentence rhythm, length, vocabulary, and how predictable your next word is.

Rank	Pattern	What the AI habit looks like	The human fix
1	Sentence-length variance	AI sentences cluster tight, a standard deviation of about 3 to 7 words, even after humanizing.	Human prose runs a standard deviation of 10 to 15 words. Put a 4-word sentence next to a 30-word one.
2	Lexical diversity (MTLD)	AI and humanized AI plateau around an MTLD of 75 to 90 over a passage.	Human prose over 250 words scores about 120 to 160. Use a wider, less repetitive vocabulary.
3	Transition-adverb rate	RLHF-tuned models use however, moreover, furthermore, thus in 1% to 5% of all words.	Real human prose keeps these under 0.5% of words. Cut most of them and the rhythm loosens up.
4	AI focal vocabulary	Words that spiked after ChatGPT (delve, tapestry, realm, leverage) appear 2 to 8 times per 1000 words.	Humans use them about 0.5 times per 1000 words. If a word feels like a press release, swap it.
5	Burstiness	AI sits below the human band, too even, with little variation around its mean sentence length.	Human writing lands roughly in a 0.6 to 1.2 band of variation-to-mean. Let your pacing wobble.
6	Sentence-initial conjunctions	Models almost never open a sentence with But, And, or So.	Humans do this 5% to 12% of the time. Starting a sentence with And is fine. Honestly, it helps.
7	Low perplexity	AI text is far more predictable word to word. This is the original GPTZero signal.	Human word choice is less predictable. Specific detail and the occasional odd phrasing raise perplexity.

Which fixes move the needle most?

Not all seven patterns carry equal weight. If you only changed one thing, you'd want to know which one.

Variance beats vocabulary

Swapping out a few flagged words feels productive, but it barely moves the score. Word-swapping only touches the surface. The skeleton underneath stays exactly where it was. Sentence-length variance is the strongest residual signal in the data, and it survives almost every humanizing pass we tested. If your sentences all breathe at the same length, a smarter vocabulary won't save you. You're moving signals one, five, and seven all at once when you vary the lengths, because they're three views of the same underlying property.

Rhythm and diversity together

The two fixes that actually shift the read are sentence-length variance (signal 1) and lexical diversity (signal 2). They're related. When you vary your sentence rhythm, you naturally pull in a wider range of words, because long sentences and short ones want different vocabulary. Get those two right and the transition-adverb rate and burstiness tend to fall into a human range on their own. That's where the real payoff sits.

Why does my own writing get flagged as AI?

This is the part nobody warns you about, and it's the part that hurts. Detectors get human writing wrong, and they get it wrong unevenly. A 2023 Stanford study led by Weixin Liang found that GPT detectors flagged 61.3% of TOEFL essays written by non-native English speakers as AI-generated, compared with just 5.1% for essays written by US-born students. Read that again. Same task, wildly different treatment, and the difference tracks with how you learned English, not whether you used a machine.

The reason is mechanical, not moral. Detectors reward unpredictability. Writers using a second language often lean on simpler, more common words and steadier sentence structures, which reads as low perplexity, the same signal AI produces. Same look, different author. So clean, careful, plainspoken human writing can score as machine-like. If English isn't your first language, or you just write in a tidy, even style, you're more exposed to a false flag through no fault of your own. That's a limitation of the tools, and any honest detector vendor should say so out loud.

Does humanizing AI text actually work, or just shuffle the patterns?

Honest answer: it depends what the humanizer changes, and nothing about it is guaranteed. A lot of tools just swap synonyms and call it a day. That shuffles the surface vocabulary while leaving the deep patterns, sentence-length variance and burstiness, exactly where they were. The DAMAGE paper makes this point clearly: several stylometric signals survive humanizing because they live in the structure of the text, not its word choices. If a humanizer only touches words, a good detector still reads the rhythm underneath.

What does move the score is genuinely restructuring the writing: changing sentence lengths, breaking up uniform paragraphs, widening vocabulary, and adding the specific, slightly unpredictable detail a person would include. That's harder than synonym-swapping, and it's why results vary. Weber-Wulff et al. (2023) found detectors become unreliable once text is meaningfully edited or translated, which cuts both ways. It means humanizing can change a read, and it also means no tool can promise an outcome. Anyone who guarantees you'll "pass" is selling certainty that doesn't exist.

How the major AI detectors compare

Detector	What it keys on most	False-positive risk	Notes
GPTZero	Perplexity and burstiness	Moderate to high on clean human text	Built by Edward Tian at Princeton in January 2023. The tool that put perplexity on the map.
Turnitin	Sentence-level prediction patterns	Moderate, and consequential in academic settings	Shipped AI detection in April 2023. Integrated into existing plagiarism workflows.
Originality.ai	Trained classifier on AI vs human corpora	Lower on its own benchmarks, real-world varies	Reports about 96.7% accuracy on the RAID benchmark for paraphrased and humanized text.
Copyleaks	Classifier plus stylometric features	Moderate, varies by text type	Third-party tests show accuracy drops once text is edited or translated.
HumanGPT detector v5	Stylometric features plus multi-detector consensus	Tunable: 95% catch at 5% FPR, 77% at 1% FPR	Held-out AUROC 0.987. Built to report a probability, not a verdict, and to flag uncertainty.

One caveat reframes that whole table. Originality's 96.7% is measured on the RAID benchmark, which is a controlled dataset, not your professor's inbox. Scribbr's 2024 third-party test put real-world detector accuracy closer to 76%. That gap between lab and life is the whole story. Vendor accuracy numbers come from each tool's own benchmarks and aren't directly comparable, so treat any single percentage with care and never lean on one tool alone.

What this means for writers, students, and teams

What it means depends on your role, but the through-line is the same: a detector score is a probability, never a verdict. Here's how that lands for three groups.

If you're a student

A detector flag is a probability, not proof. If your own work gets flagged, that's not rare, and it's more likely if you write in plain, even English or learned the language later. Keep your drafts, version history, and notes. They're the strongest evidence you actually wrote the thing. And if you write in tidy, uniform sentences, know that the style itself can read as machine-like.

If you're a writer or editor

The patterns that flag AI are mostly the patterns of dull writing. Even sentence lengths, narrow vocabulary, a pile of howevers and moreovers. Fixing them makes your prose better whether or not a detector is watching. Vary your rhythm. Cut the transition adverbs. Use the specific word, not the safe one.

If you run a team

Don't make a single detector score a disciplinary trigger. The false-positive data, 61.3% versus 5.1% across writer backgrounds, should worry anyone using these tools to make decisions about people. And with real-world accuracy near 76%, roughly one in four calls can be wrong in the field. Use detection as one input, weighted alongside context, drafts, and conversation. Never as a verdict on its own.

The bottom line

AI detectors measure patterns, not honesty. The clearest tells are structural: how much your sentences vary, how diverse your vocabulary is, how predictable your next word is. The same machinery that catches AI also catches careful, plainspoken, second-language human writers, which is why no score should ever stand alone as proof. Understand what these tools measure and the flags stop being mysterious. They become something you can read. If you want to see how your own text scores, you can try HumanGPT free, 200 words a day, no signup required.

What Triggers AI Detectors? 7 Patterns From 2026 Data

What actually triggers an AI detector?

Where these numbers come from

Our own detector, validated

A consensus, not a single oracle

Outside research we leaned on

The 7 patterns that trigger AI detectors, ranked

Which fixes move the needle most?

Variance beats vocabulary

Rhythm and diversity together

Why does my own writing get flagged as AI?

Does humanizing AI text actually work, or just shuffle the patterns?

How the major AI detectors compare

What this means for writers, students, and teams

If you're a student

If you're a writer or editor

If you run a team

The bottom line

Frequently asked questions

01Why is my text flagged as AI when I wrote it myself?

02What is perplexity in AI detection?

03Are non-native English writers flagged more often?

04Does changing a few words bypass AI detection?

05Which AI detector is the most accurate?

06What does AUROC 0.987 actually mean?

07What is burstiness in writing?

08Why do detectors flag transition words like 'however' and 'moreover'?

09Does humanizing AI text actually work?

10Can teachers rely on AI detectors to catch cheating?

11How can I make my writing pass as human?