Find duplicate words in a list

Find duplicate words across an entire text - the op splits on whitespace (`/\s+/`) across the full input, not per line. Output is `count<tab>word` for each word appearing 2+ times, sorted by frequency descending. Single-occurrence words are dropped. Case-insensitive by default.

Input
Ready
Output
Live

Whitespace-split word frequency with duplicates-only filter

The op uses `input.split(/\s+/)` which splits on any whitespace run - spaces, tabs, newlines. The whole input is treated as one stream of words, so line boundaries are irrelevant. For line-based counting, use Find most frequent instead.

Words appearing exactly once are filtered out - only duplicates (count ≥ 2) make the output. Each line of output is `count\tword`, tab-separated for paste-into-spreadsheet compatibility.

Sort by Frequency (default) lists most-common first with alphabetical tiebreak. Sort by Alphabetical orders A-Z ignoring counts. Case sensitive off (default) merges `Apple` and `apple` under the lowercase key.

How to use find duplicate words in a list

  1. 1Paste text into the input panel - spaces, tabs, newlines all split into words
  2. 2Toggle Case sensitive to distinguish cased variants (default: off)
  3. 3Pick Sort by: Frequency (default, most common first) or Alphabetical
  4. 4Output is `count<tab>word` per line for words appearing 2+ times
  5. 5For single-occurrence words too, use Find most frequent

Keyboard shortcuts

Drive ListShift without touching the mouse.

Shortcut Action
Ctrl ZUndo last input change
Ctrl Shift ZRedo
Ctrl Shift EnterToggle fullscreen focus on the editor
EscExit fullscreen
Ctrl KOpen the command palette to jump to any tool
Ctrl SSave current pipeline draft Plus
Ctrl PRun a saved pipeline Plus

What this tool actually does

Whitespace-split word tally, filter to count ≥ 2.

Splits on any whitespace run

Regex `/\s+/` catches spaces, tabs, and newlines in any combination. Line boundaries are ignored - `word1 word2\nword3` tokenizes to three words.

Only duplicates emitted

Words appearing exactly once are dropped. Only words with count ≥ 2 appear in the output. For full frequency tables including singletons, use Find most frequent.

Tab-separated output

Each line is `count<tab>word`. Pastes cleanly into Excel / Sheets as two columns.

Case-insensitive by default

Words lowercased before tallying. `Apple`, `apple`, `APPLE` all count together as three occurrences of `apple`. Toggle Case sensitive to separate them.

No stopword filter

Every duplicate word is listed - including `the`, `and`, `of`. Chain Filter before or after to drop stopwords.

Worked example

Seven words total; `apple` appears 3 times, `banana` appears 2 times. Single-occurrence words (`grape`, `orange`) are filtered out.

Input
apple banana apple grape
banana orange apple
Output
3	apple
2	banana

Settings reference

How each option shapes the output using the sample above.

Setting What it does Effect on the sample
Sort by: Frequency (default) Most common duplicate first, alphabetical tiebreak `3\tapple` / `2\tbanana`
Sort by: Alphabetical A → Z regardless of count `3\tapple` / `2\tbanana` (same here; different if more dupes)
Case sensitive: on Separates cased variants `Apple` and `apple` become two entries if present
Single-occurrence words (automatic) Filtered out `grape` and `orange` dropped

FAQ

How does the 'Case sensitive' option work?
When enabled, words with different cases are treated as distinct. 'Apple' and 'apple' would be separate.
Can I download the results?
Yes, you can download the duplicate words as a plain text file for further use.
How are duplicates sorted?
Duplicates can be sorted alphabetically or by frequency, depending on your selection.
What input formats are supported?
You can input plain text lists. The tool processes words separated by spaces or line breaks.