Whitespace-split word frequency with duplicates-only filter
The op uses `input.split(/\s+/)` which splits on any whitespace run - spaces, tabs, newlines. The whole input is treated as one stream of words, so line boundaries are irrelevant. For line-based counting, use Find most frequent instead.
Words appearing exactly once are filtered out - only duplicates (count ≥ 2) make the output. Each line of output is `count\tword`, tab-separated for paste-into-spreadsheet compatibility.
Sort by Frequency (default) lists most-common first with alphabetical tiebreak. Sort by Alphabetical orders A-Z ignoring counts. Case sensitive off (default) merges `Apple` and `apple` under the lowercase key.
How to use find duplicate words in a list
- 1Paste text into the input panel - spaces, tabs, newlines all split into words
- 2Toggle Case sensitive to distinguish cased variants (default: off)
- 3Pick Sort by: Frequency (default, most common first) or Alphabetical
- 4Output is `count<tab>word` per line for words appearing 2+ times
- 5For single-occurrence words too, use Find most frequent
Keyboard shortcuts
Drive ListShift without touching the mouse.
What this tool actually does
Whitespace-split word tally, filter to count ≥ 2.
Splits on any whitespace run
Regex `/\s+/` catches spaces, tabs, and newlines in any combination. Line boundaries are ignored - `word1 word2\nword3` tokenizes to three words.
Only duplicates emitted
Words appearing exactly once are dropped. Only words with count ≥ 2 appear in the output. For full frequency tables including singletons, use Find most frequent.
Tab-separated output
Each line is `count<tab>word`. Pastes cleanly into Excel / Sheets as two columns.
Case-insensitive by default
Words lowercased before tallying. `Apple`, `apple`, `APPLE` all count together as three occurrences of `apple`. Toggle Case sensitive to separate them.
No stopword filter
Every duplicate word is listed - including `the`, `and`, `of`. Chain Filter before or after to drop stopwords.
Worked example
Seven words total; `apple` appears 3 times, `banana` appears 2 times. Single-occurrence words (`grape`, `orange`) are filtered out.
apple banana apple grape banana orange apple
3 apple 2 banana
Settings reference
How each option shapes the output using the sample above.
| Setting | What it does | Effect on the sample |
|---|---|---|
| Sort by: Frequency (default) | Most common duplicate first, alphabetical tiebreak | `3\tapple` / `2\tbanana` |
| Sort by: Alphabetical | A → Z regardless of count | `3\tapple` / `2\tbanana` (same here; different if more dupes) |
| Case sensitive: on | Separates cased variants | `Apple` and `apple` become two entries if present |
| Single-occurrence words (automatic) | Filtered out | `grape` and `orange` dropped |