12
Strings
(cont)
UTF-8
Most common Unicode standard
Specifies mapping of around 150,000 characters
ASCII-compatible
0b
0
xxxxxxx
ASCII letters 0-127
Two, three and four byte long characters
0b
110
xxxxxxxxxxxxx
most Latin-script alphabets
0x
E
XXXXX
Chinese, Japanese, Korean characters
0x
F
XXXXXXX
mathematical symbols, emojis
unicode.org/emoji/charts/full-emoji-list.html