[Links]   [Notes]   [ISO 8859-x to Latin y Conversion]   [UTF-8 Bit Distribution]  


Links

Current SiteExternal *
Character ClassesMiscellaneous
unicode.org
* External links open in new browser window.

Notes


ISO 8859-x to Latin y Conversion

ISO 8859-xLatin yReference

UTF-8 Bit Distribution

Unicode Scalar ValueByte 1Byte 2Byte 3Byte 4
0 0000 0000 0000 0abc defg (0–0x7F)0abc defg
0 0000 0000 0abc defg hijk (0x80–0x7FF)110a bcde10fg hijk
0 0000 abcd efgh ijkl mnop (0x800–0xD7FF, 0xE000–0xFFFF *)1110 abcd10ef ghij10kl mnop
a bcde fghi jklm nopq rstu (0x10000–0x10FFFF)1111 0abc10de fghi10jk lmno10pq rstu
* Scalar values in the range 0xD800–0xDFFF are reserved for surrogate pairs and do not yield valid UTF-8 sequences so abcde should never equal 11011.