AI-Slopslop-cop/references/vocabulary.md

Vocabulary

~150 word-level tells across 7 categories: LLM verbs, cliché metaphors, intensifiers, sycophancy, weasels, connectors, spike words.

Raw on GitHub ↗·2,844 words·15 KB

AI Slop Vocabulary — The Cut List

The phrases below trigger AI-detection radar instantly. Each one is documented in published research, AI-detector vendor methodology, or community-sourced lists with high agreement. The mechanical scanner (scripts/scan.py) catches every instance; this file is the human-readable reference with replacements and rationale.

Use this as a search-and-destroy list during a final pass. For each hit, ask: does this word survive the context? In almost every case, the answer is no — cut it.

How severity works

  • H (always cut): the phrase is essentially never the right choice. Cut without exception unless ironic / scare-quoted.
  • M (usually cut): the phrase survives in narrow bands. Default is to cut; keep only if doing specific work.
  • L (context-dependent): weak tell on its own. Note in audit but don't down-score.

Density matters more than individual instances. See calibration.md.


2A. LLM-favored verbs

These are the verbs models reach for first. They sound active and serious without committing to a specific action. Most can be swapped for a one-syllable Anglo-Saxon verb that does the same work with less smell.

Word/PhraseWhy it's a tellReplacementSeverity
delve / delve into"Delves" appeared 6,697%+ more in 2024 PubMed vs 2020 — the flagship AI verblook at, get into, studyH
leverageCorporate cliche; appears in every blacklistuseH
harnessRare in human prose, ubiquitous in AIuse, channelH
fosterCorporate-NGO speakencourage, buildH
empowerMarketing fluffhelp, give X toH
unlock"Unlock the potential of" is a top-10 GPT phrasereveal, findH
elevateEmpty intensifierraise, lift, improveH
streamlineMcKinsey-speaksimplify, cutH
revolutionizeHyperbole baselinechangeH
transformSamechange, rebuildH
underscore / underscores904% spike post-ChatGPT (PubMed study)show, proveH
illuminateDecorative academicclarify, showH
navigate"Navigate the complexities of" — top-10 clichehandle, manage, work throughH
garnerSpike-word in academic writing studiesget, win, attractH
utilizeSounds smart, means "use"useH
facilitateSamehelpH
optimizeTech-jargon defaultimprove, tuneH
enhanceGeneric intensifier verbimprove, sharpenH
embark / embark on"Embark on a journey" is iconic AIstartH
showcase / showcasing9.2x more frequent in AI than human (GPTZero)show, displayH
boast / boastsWikipedia-flagged copula avoidancehasH
demystifyLinkedIn clicheexplainM
igniteCopy-clichestart, sparkM
superchargeMarketing-speakspeed upM
unleashSamereleaseM
unveilPress-release verbreveal, showM
exploreUsed to fill space ("we'll explore")look at, go throughM
dive into"Let's dive into" — sycophant clusterstart, look atH
resonate / resonatesVague impact-wordmatch, connectM
reverberateEven more decorativeechoM
transcend / transcendsOften "transcends mere X"go beyondM
spearheadPress-release clicheleadM
reimagineTech-deck clicheredesignM
craftOverused as a verbmake, writeL
pave the wayCliche metaphorenable, set upH
shed light onClicheexplain, showH

Audit instruction: any H-tier verb is a hit. For M-tier, replace and re-read; if the sentence is unchanged or sharper, the verb was AI smell. Cut.


2B. Cliché metaphors and grandiose nouns

These nouns convert a small subject into an epic one. AI defaults to them because metaphor scores well in training data; humans use them sparingly because they sound like a press release.

Word/PhraseWhy it's a tellReplacementSeverity
tapestryIconic AI metaphormix, weave (sparingly), or specific nounH
landscape"The landscape of X" — top clichefield, market, worldH
realmDecorative for "area"area, fieldH
beacon"A beacon of X"example, leaderH
treasure troveAlways cutcollection, sourceH
symphonyPretentious metaphorcombination, mixH
journeyEspecially "embark on a journey"path, processH
roadmapTech-deck clicheplan, stepsM
ecosystemTech-cliche for "industry"industry, networkM
paradigm / paradigm shiftOverusedchange, shiftH
testament"A testament to"proof, evidenceH
cornerstoneClichebasis, anchorM
crucibleDecorativetest, trialM
labyrinthDecorativemaze, complexityM
metropolisTravel-guide clichecityM
enigmaDecorativemystery, puzzleM
myriad / a myriad ofInflated "many"many, lots ofH
plethoraSamelots of, too manyH
kaleidoscopeDecorative metaphormix, rangeM
arenaCliche metaphorfieldM
arsenalCliche metaphortoolkitM

Audit instruction: if you find one of these, ask whether the sentence needs a metaphor at all. Most don't. When the answer is yes, pick a domain-specific image — the model defaulted to the most-trained metaphor; you can do better.


2C. Empty intensifiers / hedges / vague adjectives

The biggest category by far. These adjectives are all reach and no grip — they assert importance without earning it. Density is what makes them lethal: one "crucial" survives; three in a paragraph guarantees AI.

Word/PhraseWhy it's a tellReplacementSeverity
crucialTop-3 spike wordimportant, key, or cutH
essentialSameneeded, centralH
vitalSameneededH
pivotalSamekey, decisiveH
paramountEven more inflatedmost importantH
profound"A profound impact"big, deepM
robustTech-clichestrong, reliableH
seamlessTech-clichesmoothH
comprehensiveUsed to inflate completenessfull, completeH
holisticBuzz-wordwhole, fullM
multifacetedAlways inflatedcomplex, many-sidedH
nuancedUsed as a flexcomplex, subtleM
intricate / intricacies611% spike post-ChatGPTcomplex, detailH
meticulous / meticulouslyTop spike wordcarefulH
compellingMarketing-clichestrong, persuasiveM
commendablePretentiousgood, deserving praiseM
insightfulOften empty praiseuseful, sharpM
invaluableHyperboleuseful, importantM
unwaveringAlways inflatedsteady, firmH
transformativeAlways inflatedmajorH
groundbreakingAlways inflatednew, originalH
cutting-edgeClichenew, currentH
state-of-the-artSamenew, topH
game-changer / game-changingClichebig changeH
next-generationTech-clichenewM
future-proofTech-clichelastingM
dynamicVague intensifieractive, fast-changingM
vibrantTravel-guide clichelively, busyM
bustlingSamebusyM
dauntingClichehard, intimidatingM
ever-evolving / ever-changing"In the ever-evolving landscape"changing, shiftingH
ever-expandingSame familygrowingM
timelessInflated clichelastingM
enduringOften emptylastingM
diverse / diverse array ofEmpty fillermixed, variedM
unique blendMarketing clichemixM
fast-paced"In today's fast-paced..." 107x AIfast, busyH
hyper-connectedClicheconnectedM
modern / today'sOften fillernow, currentL

Audit instruction: for each instance, try deleting it. If the sentence is fine or stronger without the word, it was filler. If a replacement is needed, prefer the shortest, most concrete word in the column.


2D. Sycophantic openers / closers

Direct RLHF artifacts. Every one of these is a model performing helpfulness instead of being helpful. Even when they leak into prose meant for publication, they read as machine-trained politeness.

PhraseWhy it's a tellReplacementSeverity
Great question!RLHF flatterydeleteH
Excellent question!SamedeleteH
I'd be happy to helpSamedelete and answerH
Absolutely!Opener flatterydeleteH
Certainly!SamedeleteH
Of course!SamedeleteH
Sure! Here's...Samedelete the opener, keep "Here's" if neededH
I hope this helps!Closing flatterydeleteH
Let me know if you have any questionsSamedeleteH
Feel free to reach outSamedeleteH
Don't hesitate to askSamedeleteH
Is there anything else I can help you with?SamedeleteH
I hope this answers your questionSamedeleteH
Happy to clarifySamedeleteH
Let me know if you'd like me to elaborateSamedeleteH

Audit instruction: every instance is a high-severity violation. Cut without exception. End on the last load-bearing sentence; openers go in the trash.


2E. Vague-authority phrases

Wikipedia's number-one content-pattern flag. These phrases assert evidence without citing any. AI defaults to them when it lacks specifics; humans either name a source or admit they're guessing.

PhraseWhy it's a tellReplacementSeverity
Studies showUncited authority claimname the study or cutH
Research suggestsSamename the research or cutH
Many experts agreeSamename the experts or cutH
Industry reports indicateSamename the report or cutH
It is widely understoodWikipedia-flagged weaselcut or attributeH
Observers have notedAnonymous authorityname the observer or cutH
Some critics argueSamename the critic or cutH
Generally speakingHedge fillercutM
In many casesHedge fillercut or specifyM
It is commonly knownWeasel + fillercutM

Audit instruction: for each hit, either supply the citation (link, name, source) or cut the claim. "I think" beats "experts say" every time.


2F. Closing / connector clichés

The signposting words. AI was trained on five-paragraph essays and op-eds; it defaults to scaffolding even in short pieces. Real prose flows; it doesn't announce its turns.

PhraseWhy it's a tellReplacementSeverity
In conclusionCompulsive summarycutH
To concludeSamecutH
In summarySamecutH
To summarizeSamecutH
OverallSamecutH
UltimatelyFiller closercutM
All things consideredSamecutM
At the end of the dayFillercutH
In essenceRestatementcutH
To put it simplyRestatementcutH
In a nutshellSamecutM
FurthermoreStock connectorperiod or "Also"H
MoreoverSameperiod or "Also"H
AdditionallySameperiod or "Also"H
First and foremostListicle cliche"First" or cutH
Last but not leastListicle cliche"Finally" or cutH
On the other handStock contrast"But"M
That being saidStock contrast"Still" or cutM
With that in mindFiller transitioncutM
NotablyThroat-clearingcutM
IndeedFillercutM

Audit instruction: most of these are deletable. End sentences with periods, not signposts. If a transition is genuinely needed, "But" / "Also" / "Still" carry their weight without smelling AI.


2G. Academically-validated spike words

These are the highest-confidence subset in the entire vocabulary list. Each one is statistically validated against pre-2022 baselines via published research — the spike is direct evidence that LLMs caused the surge in usage.

Word/PhraseSpike dataSource
delves6,697% increase 2020 to 2024arXiv
underscores904% increasearXiv
intricate611% increasearXiv
showcasingr=9.2 ratio AI vs humanarXiv
meticulous, meticulouslyspike-confirmedPubMed
pivotalspike-confirmedPubMed
commendablespike-confirmedPubMed
garneredtop-21 focal wordarXiv 2412.11385
boaststop-21 focal wordarXiv 2412.11385
groundbreakingtop-21 focal wordarXiv 2412.11385
advancementstop-21 focal wordarXiv 2412.11385
aligns / aligns with16x more frequent in AIGPTZero
surpassing12x more frequent in AIGPTZero
impacting11x more frequent in AIGPTZero
play a significant role in shaping182x more frequent in AIGPTZero
today's fast-paced world107x more frequent in AIGPTZero
notable works include120x more frequent in AIGPTZero
aims to explore50x more frequent in AIGPTZero
objective study aimed269x more frequent in AIAtlas
research needed to understand235x more frequent in AIAtlas

A 2024 PubMed study estimated at least 13.5% of 2024 biomedical abstracts were processed with LLMs based on this excess vocabulary. After viral attention in early 2024, "delve" frequency in arXiv abstracts dropped sharply — confirming the words function as a fingerprint authors can sand off.

Audit instruction: any hit in this category is a near-certain AI marker. The data is statistical, not stylistic — these aren't bad words because they sound bad, they're bad because their frequency proves recent LLM authorship. Cut without exception.


How to use this list

  1. Run the scanner first — scripts/scan.py flags every instance mechanically across all categories.
  2. Walk the always-cut items (Severity H) — each hit is a high-severity violation. Cut without exception.
  3. Walk the often-cut items (Severity M) — apply judgment. Default is to cut; keep only if doing specific work.
  4. Note the context-dependent items (Severity L) — these flag stylistic register but don't down-score the verdict.
  5. The list isn't exhaustive. New LLM tells emerge over time. If a phrase reads as engineered, it probably is. The default action is to cut.

The 2G category (academically-validated spike words) is the highest-confidence subset — these are statistically validated against pre-2022 baselines via published research.