Convert a Text List to Binary

Per-character binary encoder

Each character in each non-blank line becomes a binary number of at least 8 bits, space-separated within a line. Lines stay on their own lines so the output lines up one-to-one with the input. `a` (97) becomes `01100001`; a space (32) becomes `00100000`. Code points above 255 produce more than 8 bits (see next paragraph).

ASCII characters (0-127) fit cleanly in 8 bits. Latin-1 characters (128-255) also fit in 8 bits - `é` (233) becomes `11101001`. Above 255 - CJK characters like `中` (20013), mathematical symbols, most emoji - the binary is wider than 8 bits to fit the bigger number. The 8-bit padding is a minimum; it never truncates.

The encoding is per-character, not per-byte - `中` produces the raw binary of its code point (15 bits), not its UTF-8 byte sequence (3 bytes = 24 bits). For a byte-exact encoding, you need to UTF-8-encode the text first in another tool, then run the bytes through here.

How to use convert a text list to binary

1Paste your text into the input panel, one string per line
2Each character becomes at least 8 binary digits (wider for characters above code point 255), space-separated within a line
3Each input line stays on its own line - so one string per row stays paired up
4Copy the binary, or download it as a .txt file from the output panel
5Reverse the direction with Convert binary to text

Keyboard shortcuts

Drive ListShift without touching the mouse.

Shortcut	Action
`Ctrl` `Z`	Undo last input change
`Ctrl` `Shift` `Z`	Redo
`Ctrl` `Shift` `Enter`	Toggle fullscreen focus on the editor
`Esc`	Exit fullscreen
`Ctrl` `K`	Open the command palette to jump to any tool
`Ctrl` `S`	Save current pipeline draft Plus
`Ctrl` `P`	Run a saved pipeline Plus

What this encoder actually does

No options - paste text, get binary per character (at least 8 bits, wider for characters above code point 255).

8-bit output for ASCII and Latin-1

Every character with code point 0-255 produces exactly 8 binary digits, left-padded with zeros where needed. `a` → `01100001`; space → `00100000`; `é` → `11101001`.

Wider output for higher code points

Characters above 255 (CJK, most emoji, mathematical symbols) produce as many bits as needed for their code point - no truncation. `中` → 15 bits. If exact 8-bit bytes matter, UTF-8-encode the text first.

Per-character, not per-byte

`中` is emitted as the raw binary of its code point (20013 → 15 bits), NOT its UTF-8 byte sequence (`11100100 10111000 10101101`). For byte-accurate UTF-8 binary, encode to bytes elsewhere first.

Line-preserving layout

Groups within a line are space-separated; newlines between input lines stay as newlines. Three input lines produce three output lines - keeps the decoder round-trip unambiguous.

Blank lines dropped

Empty and whitespace-only input lines do not produce empty output rows. A trailing blank in the paste does not leave a dangling empty line.

Worked example

Each character of each line becomes at least 8 binary digits (all sample characters are ASCII, so they fit in exactly 8); characters separated by spaces, lines preserved. Characters above code point 255 would produce wider groups.

Input

apple
banana
cherry

Output

01100001 01110000 01110000 01101100 01100101
01100010 01100001 01101110 01100001 01101110 01100001
01100011 01101000 01100101 01110010 01110010 01111001

Behavior reference

No user options. These are the fixed rules the encoder applies.

Rule	What it does	Example
Per-character encoding	Each character becomes one binary group (8 bits minimum)	`a` (97) → `01100001`
8-bit minimum padding	Values below 256 are padded with leading zeros to reach 8 digits	`\n` (10) → `00001010`, not `1010`
Non-Latin-1 widening	Values above 255 produce more than 8 digits (padding is a minimum, not a cap)	`中` (20013) → `100111000101101` (15 bits)
Group separator	Space between every encoded character within a line	`ab` → `01100001 01100010`
Line separator	Newlines between input lines are preserved in the output	Two input lines always produce two output lines
Blank lines (automatic)	Empty and whitespace-only input lines are skipped	Trailing blank line produces no extra output row

FAQ

Are characters always encoded as exactly 8 bits?

Values 0-255 yes, higher code points no. The 8-bit padding is a minimum - it pads with leading zeros to reach 8 but never truncates. `é` (233) fits in 8 bits; `中` (20013) and most emoji produce more. For byte-exact output, UTF-8-encode the text first.

What happens to non-ASCII characters like `é` or `中`?

`é` (U+00E9 = 233) becomes `11101001` (8 bits). `中` (U+4E2D = 20013) becomes `100111000101101` (15 bits) - the raw binary of the code point, NOT its UTF-8 byte sequence. For UTF-8 byte encoding, convert to bytes elsewhere first.

Why is there a space between every character?

Unambiguous decoding. Without spaces, `01100001 01100010` would collide with `01100001 011000 10` and the reverse direction could not tell groups apart. Space-separated groups make the decoder work with any group length, not just 8-bit.

What happens to blank lines in my input?

They are skipped. A blank input line does not produce an empty output row. If you want to preserve them as empty rows, insert a placeholder character in your source.

Can I decode binary back into text?

Yes, use Convert binary to text - it accepts any whitespace-separated binary groups (not just 8-bit) and reverses the encoding per character.

Also known as

text to binary binary conversion encode text convert list text encoding binary list ASCII binary text to bits

Per-character binary encoder

How to use convert a text list to binary

Keyboard shortcuts

What this encoder actually does

8-bit output for ASCII and Latin-1

Wider output for higher code points

Per-character, not per-byte

Line-preserving layout

Blank lines dropped

Worked example

Behavior reference

FAQ

Also known as

Explore another workspace

Sort

Count

Filter

Clean

Compare