0xxxxxxx
110xxxxx
10xxxxxx
1110xxxx
11110xxx
The 127 1-byte codes are compatible with ASCII
The 2048 2-byte codes include most Latin-script alphabets
The 65536 3-byte codes include most Asian languages
The 2097152 4-byte codes include symbols and emojis and ...