User:Utopes/unicode

Holding onto the Unicode/Versions article which is currently written like a changelog. Changelogs written as such are not suitable for standalone Wikipedia articles. But, while passing by its AfD on September 2nd, at a glance there seemed to be potentially useful information here to reference during general Wikipedia activities.

Contents below.

This page is about each version specification, and the differences between the versions.

Unicode 1.0
Unicode 1.0 was the first version of Unicode, released October 1991. It encoded 7,161 new characters.

"Blocks"
This version of Unicode did not formally group characters in blocks. But in comparison with version 2.0, the following "blocks" were available: U+0000-U+FFFD 51 Blocks


 * Basic Latin (U+0000-U+007F), containing 128 characters.
 * Latin-1 Supplement (U+0080-U+00FF), containing 128 characters.
 * Latin Extended-A (U+0100-U+017F), containing 127 characters.
 * Latin Extended-B (U+0180-U+01FF), containing 113 characters.
 * IPA Extensions (U+0250-U+02AF), containing 89 characters.
 * Spacing Modifier Letters (U+02B0-U+02FF), containing 57 characters.
 * Combining Diacritical Marks (U+0300-U+036F), containing 66 characters.
 * Greek and Coptic (U+0370-U+03FF), containing 112 characters.
 * Cyrillic (U+0400-U+04FF), containing 192 characters.
 * Armenian (U+0530-U+058F), containing 84 characters.
 * Hebrew (U+0590-U+05FF), containing 52 characters.
 * Arabic (U+0600-U+06FF), containing 169 characters.
 * Devanagari (U+0900-U+097F), containing 104 characters.
 * Bengali (U+0980-U+09FF), containing 89 characters.
 * Gurmukhi (U+0A00-U+0A7F), containing 74 characters.
 * Gujarati (U+0A80-U+0AFF), containing 75 characters.
 * Oriya (U+0B00-U+0B7F), containing 78 characters.
 * Tamil (U+0B80-U+0BFF), containing 61 characters.
 * Telugu (U+0C00-U+0C7F), containing 80 characters.
 * Kannada (U+0C80-U+0CFF), containing 80 characters.
 * Malayalam (U+0D00-U+0D7F), containing 78 characters.
 * Thai (U+0E00-U+0E7F), containing 92 characters.
 * Lao (U+0E80-U+0EFF), containing 70 characters.
 * Tibetan (U+1000-U+105F), containing 71 characters.
 * Georgian (U+10A0-U+10FF), containing 78 characters.
 * General Punctuation (U+2000-U+206F), containing 67 characters.
 * Superscripts and Subscripts (U+2070-U+209F), containing 28 characters.
 * Currency Symbols (U+20A0-U+20CF), containing 11 characters.
 * Combining Marks for Symbols (U+20D0-U+20FF), containing 18 characters.
 * Letterlike Symbols (U+2100-U+214F), containing 57 characters.
 * Number Forms (U+2150-U+218F), containing 48 characters.
 * Arrows (U+2190-U+21FF), containing 91 characters.
 * Mathematical Operators (U+2200-U+22FF), containing 242 characters.
 * Miscellaneous Technical (U+2300-U+23FF), containing 43 characters.
 * Control Pictures (U+2400-U+243F), containing 37 characters.
 * Optical Character Recognition (U+2440-U+245F), containing 11 characters.
 * Enclosed Alphanumerics (U+2460-U+24FF), containing 139 characters.
 * Box Drawing (U+2500-U+257F), containing 128 characters.
 * Block Elements (U+2580-U+259F), containing 22 characters.
 * Geometric Shapes (U+25A0-U+25FF), containing 79 characters.
 * Miscellaneous Symbols (U+2600-U+26FF), containing 106 characters.
 * Dingbats (U+2700-U+27BF), containing 160 characters.
 * CJK Symbols and Punctuation (U+3000-U+303F), containing 56 characters.
 * Hiragana (U+3040-U+309F), containing 90 characters.
 * Katakana (U+30A0-U+30FF), containing 90 characters.
 * Bopomofo (U+3100-U+312F), containing 40 characters.
 * Hangul Compatibility Jamo (U+3130-U+318F), containing 94 characters.
 * Kanbun (U+3190-U+31FF), containing 16 characters.
 * Enclosed CJK Letters and Months (U+3200-U+32FF), containing 191 characters.
 * CJK Compatibility (U+3300-U+33FF), containing 187 characters.
 * Hangul (U+3400-U+3D2D), containing 2,350 characters.
 * Private Use Area (U+E000-U+FDFF), reserved for 5,632 characters.
 * CJK Compatibility Forms (U+FE30-U+FE4F), containing 28 characters.
 * Small Form Variants (U+FE50-U+FE6F), containing 26 characters.
 * Arabic Presentation Forms-B (U+FE70-U+FEFF), containing 140 characters.
 * Halfwidth and Fullwidth Forms (U+FF00-U+FFEF), containing 216 characters.
 * Specials (U+FFF0-U+FFFF), containing 1 character.

Unicode 1.0.1
Unicode 1.0.1 was released June 1992. It encoded 28,365 characters, adding 21,204 new characters, removing 96 characters.

New blocks

 * CJK Unified Ideographs (U+4E00-U+9FFF), containing 20,902 Han Ideographs for Chinese, Japanese and Korean, was added.
 * CJK Compatibility Ideographs (U+F900-U+FAFF), containing 302 Han Ideographs for compatibility with existing character sets, was added.

Removed blocks

 * Tibetan, containing 71 letters for the Tibetan script, was removed from the Unicode standard.

Removed characters

 * Phonetic Order Vowel Signs (total 5 characters) were removed from Thai. (U+0E70-U+0E74)
 * Phonetic Order Vowel Signs (total 5 characters) were removed from Lao. (U+0EF0-U+0EF4)
 * APL Compose Operator and APL Out (total 2 characters) were removed from Miscellaneous Technical. (U+2300-U+2301)
 * (total 7 characters) were removed from Greek and Coptic. (U+03DB, U+03DD, U+03DF, U+03E1, U+0371-U+0372 and U+0374)
 * Letters Ka and Kha with Ogonek (total 4 characters) were removed from Cyrillic. (U+04C5-U+04C6 and U+04C9-U+04CA)
 * An Ideographic Ditto Mark (total 1 character) was removed from CJK Symbols and Punctuation (U+3004) and merged with CJK Unified Ideograph-4EDD.

Rearranged characters

 * Circled Katakana: The characters well be arranged in modern order: e.g., A, I, U, E, O, KA, KI (U+32D0-U+32FE)
 * Basic Glyphs For Arabic Language: The character shapes will be arranged in different order: Isolate, Final, Initial and Medial (U+FE80-FEFC)
 * A Japanese Industrial Standard symbol (〄) was moved from Enclosed CJK Letters and Months (U+32FF) to CJK Symbols and Punctuation. (U+3004)

Characters with semantics changed

 * Zero Width Non-Joiner [ZWNJ] (U+20DC)
 * Zero Width Joiner [ZWJ] (U+20DD)

Unicode 1.1
Unicode 1.1 was released June 1993. It encoded 34,233 characters, adding 5,939 new characters. It finalized the long anticipated Han Unification.

New blocks

 * Hangul Jamo (U+1100-U+11FF), containing 240 jamo for the Hangul script, was added.
 * Latin Extended Additional (U+1E00-U+1EFF), containing 245 precomposed characters for transliteration and Vietnamese, was added.
 * Greek Extended (U+1F00-U+1FFF), containing 233 precomposed characters for polytonic Greek, was added.
 * Hangul Supplementary-A (U+3D2E-U+44B7), containing 1,930 precomposed syllables for the Hangul script, was added.
 * Hangul Supplementary-B (U+44B8-U+4DFF), containing 2,376 precomposed syllables for the Hangul script, was added.
 * Alphabetic Presentation Forms (U+FB00-U+FB4F), containing 57 precomposed characters and ligatures, was added.
 * Arabic Presentation Forms-A (U+FB50-U+FDFF), containing 593 combinations of Arabic letters, was added.
 * Combining Half Marks (U+FE20-U+FE2F), containing 4 halves of diacritical marks, was added.

Extended blocks

 * The long S (ſ) (total 1 character) was added to Latin Extended-A. (U+017F)
 * The Hungarian Dz, characters for transliteration purposes and precomposed characters with double grave and inverted breve (total 35 characters) were added to Latin Extended-B (U+01F1-U+01F5 and U+01FA-U+0217). The block was expanded from (U+0180-U+01FF) to (U+0180-U+024F)
 * Diacritics for polytonic Greek and double width diacritics (total 6 characters) were added to Combining Diacritical Marks. (U+0342-U+0345 and U+0360-U+0361)
 * Compatibility characters now deprecated, Ano Teleia and Small Letter Yot (total 5 characters) were added to Greek and Coptic (U+0374-U+0375, U+037A, U+037E, U+0387 and U+03F3).
 * Additional characters for non-Slavic languages (total 38 characters) were added to Cyrillic. (U+04D0-U+04EB, U+04EE-U+04F5 and U+04F8-U+04F9)
 * A ligature of Ech and Yiwn (և) (total 1 character) was added to Armenian. (U+0587)
 * One deprecated compatibility character and several characters for biblical texts (total 25 characters) were added to Arabic. (U+066D and U+06D6-U+06ED)
 * A sign Virama (total 1 character) was added to Gurmukhi (U+0A4D).
 * Letters Candra O and E (total 3 characters) were added to Gujarati. (U+0A8D, U+0A91 and U+0AC9)
 * An Ai Length mark (total 1 character) was added to Oriya. (U+0B56)
 * An undertie, a pair of brackets and six formatting characters now deprecated (total 9 characters) were added to General Punctuation. (U+203F, U+2045-U+2046 and U+206A-U+206F)
 * Some additional symbols and the complete set of APL functional symbols (total 79 characters) were added to Miscellaneous Technical. (U+2300 and U+232D-U+237A)
 * A large circle (◯) (total 1 character) was added to Geometric Shapes. (U+25EF)
 * The ideographic telegraph line feed separator symbol (〷) (total 1 character) was added to CJK Symbols and Punctuation. (U+3037)
 * Four Katakana letters not in use since 1945 (total 4 characters) were added to Katakana. (U+30F7-U+30FA)
 * Ideographic telegraph symbols for the twelve months (total 12 characters) were added to Enclosed CJK Letters and Months. (U+32C0-U+32CB)
 * Ideographic telegraph symbols for hours and days and six additional measure units (total 62 characters) were added to CJK Compatibility. (U+3358-U+3376 and U+33E0-U+33FE)
 * Some more space (total 2,304 characters) was added to the Private Use Area.
 * Seven halfwidth geometric shapes (total 7 characters) were added to Halfwidth and Fullwidth Forms. (U+FFE8-U+FFEE)

Unicode 2.0
Unicode 2.0 was released July 1996. It encoded 38,950 characters, adding 4,717 new characters, and was the first Unicode version to reserve blocks outside of the Basic Multilingual Plane.

New blocks

 * Hangul Syllables (U+AC00-U+D7AF), containing 11,172 precomposed syllables for the Hangul script, was added.
 * High Surrogates (U+D800-U+DB7F), containing 896 characters, was added.
 * High Private Use Surrogates (U+DB80-U+DBFF), containing 128 characters, was added.
 * Low Surrogates (U+DC00-U+DFFF), containing 1,024 characters, was added.
 * Supplementary Private Use Area-A (U+F0000-U+FFFFF), reserving 65,534 characters for private use, was added.
 * Supplementary Private Use Area-B (U+100000-U+10FFFF), reserving 65,534 characters for private use, was added.

Reinstated blocks

 * Tibetan (U+0F00-U+0FFF), now containing 168 characters for the Tibetan script including religious signs, was readded.

Removed blocks

 * Hangul, containing 2,350 precomposed syllables for the Hangul script, was removed from the Unicode standard.
 * Hangul Supplementary-A, containing 1,930 precomposed syllables for the Hangul script, was removed from the Unicode standard.
 * Hangul Supplementary-B, containing 2,376 precomposed syllables for the Hangul script, was removed from the Unicode standard.

Extended blocks

 * Cantillation marks for use in religious texts (total 31 characters) were added to Hebrew. (U+0591-U+05A1, U+05A3-U+05AF and U+05C4)
 * A long S with Dot Above (total 1 character) was added to Latin Extended Additional. (U+1E9B)
 * A Vietnamese Dong sign (total 1 character) was added to Currency Symbols. (U+20AB)

Unicode 2.1
Unicode 2.1 was released May 1998. It encoded 38,952 characters, adding only 2 new characters.

Extended blocks

 * A Euro sign (total 1 character) was added to Currency Symbols. (U+20AC)
 * An Object Replacement Character (total 1 character) was added to Specials. (U+FFFC)

Unicode 3.0
Unicode 3.0 was released September 1999. It was a big update and encoded 49,259 characters, adding 10,307 new characters.

New blocks

 * Syriac (U+0700-U+074F), containing 71 characters used for writing in Syriac script, was added.
 * Thaana (U+0780-U+07BF), containing 49 characters used for writing in Thaana script, was added.
 * Sinhala (U+0D80-U+0DFF), containing 80 characters for the Sinhala script, was added.
 * Myanmar (U+1000-U+109F), containing 78 characters for the Burmese script, was added.
 * Ethiopic (U+1200-U+137F), containing 345 syllables and punctuation marks for the Ethiopic script, was added.
 * Cherokee (U+13A0-U+13FF), containing 85 syllables for the Cherokee script, was added.
 * Unified Canadian Aboriginal Syllabics (U+1400-U+167F), containing 630 syllables and punctuation marks for writing in aboriginal languages of Canada, was added.
 * Ogham (U+1680-U+169F), containing 29 characters for the ancient Ogham script, was added.
 * Runic (U+16A0-U+16FF), containing 81 characters for the Germanic runes, was added.
 * Khmer (U+1780-U+17FF), containing 103 characters for the Khmer script, was added.
 * Mongolian (U+1800-U+18AF), containing 155 characters for the classical Mongolian script, was added.
 * Braille Patterns (U+2800-U+28FF), containing 256 Braille letters, was added.
 * CJK Radicals Supplement (U+2E80-U+2EFF), containing 115 non-Kangxi radicals, was added.
 * Kangxi Radicals (U+2F00-U+2FDF), containing 214 radicals from the Kangxi dictionary, was added.
 * Ideographic Description Characters (U+2FF0-U+2FFF), used to describe a Han ideograph not available in the font, was added.
 * Bopomofo Extended (U+31A0-U+31BF), containing 24 characters used for phonetic transcription of minority languages of Taiwan, was added.
 * CJK Unified Ideographs Extension A (U+3400-U+4DBF), containing 6,582 additional Han Ideographs, was added.
 * Yi Syllables (U+A000-U+A48F), containing 1,165 syllables of the modern Yi script, was added.
 * Yi Radicals (U+A490-U+A4CF), containing 50 radicals of Yi Syllables, was added.

Extended blocks

 * Additional precomposed characters, letters and capital letters of lowercase-only letters (total 30 characters) were added to Latin Extended-B. (U+01F6-U+01F9, U+0218-U+021F and U+0222-U+0233)
 * Extensions for disordered speech (total 5 characters) were added to IPA Extensions. (U+02A9-U+02AD)
 * Some additional modifier letters (total 6 characters) were added to Spacing Modifier Letters. (U+02DF and U+02EA-U+02EE)
 * Additional combining diacritics for IPA (total 10 characters) were added to Combining Diacritical Marks. (U+0346-U+034E and U+0362)
 * Lowercase versions of archaic letters and the Kai symbol (total 5 characters) were added to Greek and Coptic. (U+03D7, U+03DB, U+03DD, U+03DF and U+03E1)
 * Nonstandard letters for Macedonian, combining numeral signs and three letters for Kildin Sami (total 12 characters) were added to Cyrillic. (U+0400, U+040D, U+0450, U+045D, U+0488-U+0489, U+048C-U+048F and U+04EC-U+04ED)
 * A Hyphen (total 1 character) was added to Armenian. (U+058A)
 * Combining hamza and maddah and nine additional Arabic characters (total 12 characters) were added to Arabic. (U+0653-U+0655, U+06B8-U+06B9, U+06BF, U+06CF and U+06FA-U+06FE)
 * Additional letters and religious symbols (total 25 characters) were added to Tibetan. (U+0F6A, U+0F96, U+0FAE-U+0FB0, U+0FB8, U+0FBA-U+0FBC, U+0FBE-U+0FCC and U+0FCF)
 * A narrow no-break space and 6 additional punctuation marks (total 7 characters) were added to General Punctuation. (U+202F and U+2048-U+204D)
 * The Kip, Tugrik and Drachma sign (total 3 characters) were added to Currency Symbols. (U+20AD-U+20AF)
 * An enclosing screen and an enclosing key (total 2 characters) were added to Combining Diacritical Marks for Symbols. (U+20E2-U+20E3)
 * The information symbol and a rotated Q (total 2 characters) were added to Letterlike Symbols. (U+2139-U+213A)
 * A mirrored Roman capital numeral hundred (Ↄ) (total 1 character) was added to Number Forms. (U+2183)
 * Some additional arrows (total 9 characters) were added to Arrows. (U+21EB-U+21F3)
 * Some additional technical symbols, including common keys on a 101 keyboard (total 33 characters) were added to Miscellaneous Technical. (U+2301, U+237B and U+237D-U+239A)
 * Two additional control pictures (total 2 characters) were added to Control Pictures. (U+2425-U+2426)
 * Squares and circles with quadrants (total 8 characters) were added to Geometric Shapes. (U+25F0-U+25F7)
 * Two Syriac crosses and a signature mark (total 3 characters) were added to Miscellaneous Symbols. (U+2619 and U+2670-U+2671)
 * Three Hangzhou numerals and a variation indicator (total 4 characters) were added to CJK Symbols and Punctuation. (U+3038-U+303A and U+303E)
 * A ligature Yod with Hiriq (יִ) (total 1 character) was added to Alphabetic Presentation Forms. (U+FB1D)
 * Three additional control characters for ruby markup (total 3 characters) were added to Specials. (U+FFF9-U+FFFB)

Unicode 3.1
Unicode 3.1 was released March 2001. It encoded 94,205 characters, adding 44,946 new characters, and mainly focused on blocks outside of the Basic Multilingual Plane.

New blocks

 * Old Italic (U+10300-U+1032F), containing 35 letters for the Etruscan script, was added.
 * Gothic (U+10330-U+1034F), containing 27 letters for the Gothic script, was added.
 * Deseret (U+10400-U+1044F), containing 76 letters for the constructed Deseret script, was added.
 * Byzantine Musical Symbols (U+1D000-U+1D0FF), containing 246 symbols for musical notation in Byzantine, was added.
 * Musical Symbols (U+1D100-U+1D1FF), containing 219 characters for current musical notation, was added.
 * Mathematical Alphanumeric Symbols (U+1D400-U+1D7FF), containing 991 Latin and Greek letters in serif, sans-serif, bold, italic, double-struck, script and Fraktur/Blackletter, was added.
 * CJK Unified Ideographs Extension B (U+20000-U+2A6DF), containing 42,711 additional Chinese Ideographs, was added.
 * CJK Compatibility Ideographs Supplement (U+2F800-U+2FA1F), containing 542 additional Chinese Ideographs for compatibility purposes, was added.
 * Tags, containing 97 language tags, was added. (U+E0000-U+E007F)

Extended noncharacters

 * The Noncharacters range: U+FDD0..U+FDEF were added to Arabic Presentation Forms-A.

Extended blocks

 * The capital Theta symbol and the Lunate Epsilon symbol (total 2 characters) were added to Greek and Coptic. (U+03F4-U+03F5)

Characters and Scripts Under Investigation or Rejected

 * Khmer Sign Laak Was Rejected. (U+17DD) From Khmer.
 * Georgian Letter U-Brjuu Was Rejected. From Georgian.

Unicode 3.2
Unicode 3.2 was released March 2002. It encoded 95,221 characters, adding 1,016 new characters.

New blocks

 * Cyrillic Supplement (U+0500-U+052F), containing 16 characters used for the Komi language, was added.
 * Tagalog (U+1700-U+171F), containing 20 characters for the Baybayin script, was added.
 * Hanunoo (U+1720-U+173F), containing 23 characters and punctuation for the Hanunoo script, was added.
 * Buhid (U+1740-U+175F), containing 20 characters for the Buhid script, was added.
 * Tagbanwa (U+1760-U+177F), containing 18 characters for the Tagbanwa script, was added.
 * Miscellaneous Mathematical Symbols-A (U+27C0-U+27EF), containing 28 symbols used in math notation, was added.
 * Supplemental Arrows-A (U+27F0-U+27FF), containing 16 additional arrows, was added.
 * Supplemental Arrows-B (U+2900-U+297F), containing 128 special arrows, was added.
 * Miscellaneous Mathematical Symbols-B (U+2980-U+29FF), containing 128 additional mathematical symbols, was added.
 * Supplemental Mathematical Operators (U+2A00-U+2AFF), containing 256 additional mathematical operators, was added.
 * Katakana Phonetic Extensions (U+31F0-U+31FF), containing 16 Katakana letters used for Ainu, was added.
 * Variation Selectors (U+FE00-U+FE0F), containing 16 symbols used for indicating variations, was added.

Extended blocks

 * A capital letter N with Long Right Leg (total 1 character) was added to Latin Extended-B. (U+0220)
 * The combining grapheme joiner and combining Latin letters used in medieval texts (total 14 characters) were added to Combining Diacritical Marks. (U+034F and U+0363-U+036F)
 * The Qoppa and a reversed lunate epsilon symbol (total 3 characters) were added to Greek and Coptic. (U+03D8-U+03D9 and U+03F6)
 * Four additional letters used for the Kildin Sami language (total 8 characters) were added to Cyrillic. (U+048A-U+048B, U+04C5-U+04C6, U+04C9-U+04CA and U+04CD-U+04CE)
 * A dotless Beh and a dotless Qaf (total 2 characters) were added to Arabic. (U+066E-U+066F)
 * A Letter for Addu dialect (total 1 character) was added to Thaana. (U+07B1)
 * The letters Yn and Elifi (total 2 characters) were added to Georgian. (U+10F7-U+10F8)
 * Some additional punctuation marks and control characters (total 12 characters) were added to General Punctuation. (U+2047, U+204E-U+2052, U+2057 and U+205F-U+2063)
 * A superscript letter I (total 1 character) was added to Superscripts and Subscripts. (U+2071)
 * German Penny and Peso sign (total 2 characters) were added to Currency Symbols. (U+20B0-U+20B1)
 * Some additional combining characters (total 7 characters) were added to Combining Diacritical Marks for Symbols. (U+20E4-U+20EA)
 * Some double-struck and reversed/turned letters (total 15 characters) were added to Letterlike Symbols. (U+213D-U+214B)
 * Some additional arrows (total 12 characters) were added to Arrows. (U+21F4-U+21FF)
 * Some additional mathematical operators (total 14 characters) were added to Mathematical Operators. (U+22F2-U+22FF)
 * Variable-width and additional symbols (total 53 characters) were added to Miscellaneous Technical. (U+237C and U+239B-U+23CE)
 * Black and double circled numerals (total 20 characters) were added to Enclosed Alphanumerics. U+24EB-U+24FE)
 * Quadrant elements (total 10 characters) were added to Block Elements. (U+2596-U+259F)
 * Some additional triangles and squares (total 8 characters) were added to Geometric Shapes. (U+25F8-U+25FF)
 * Shogi pieces ,recycling symbols, dices and dotted circles (total 24 characters) were added to Miscellaneous Symbols. (U+2616-U+2617, U+2672-U+267D and U+2680-U+2689)
 * Additional parenthesis (total 14 characters) were added to Dingbats. (U+2768-U+2775)
 * Three additional marks (total 3 characters) were added to CJK Symbols and Punctuation. (U+303B-U+303D)
 * A digraph and two additional characters (total 3 characters) were added to Hiragana. (U+3095-U+3096 and U+309F)
 * A digraph and a double hyphen (total 2 characters) were added to Katakana. (U+30A0 and U+30FF)
 * Additional circled numerals (total 30 characters) were added to Enclosed CJK Letters and Months. (U+3251-U+325F and U+32B1-U+32BF
 * Five missing radicals (total 5 characters) were added to Yi Radicals. (U+A4A2-U+A4A3, U+A4B4, U+A4C1, U+A4C5)
 * Additional compatibility characters (total 59 characters) were added to CJK Compatibility Ideographs. (U+FA30-U+FA6A)
 * A Rial sign (total 1 character) was added to Arabic Presentation Forms-A. (U+FDFC)
 * Two sesame dots (total 2 characters) were added to CJK Compatibility Forms. (U+FE45-U+FE46)
 * A tail fragment (total 1 character) was added to Arabic Presentation Forms-B. (U+FE73)
 * A pair of double parenthesis (total 2 characters) was added to Halfwidth and Fullwidth Forms. (U+FF5F-U+FF60)

Unicode 4.0
Unicode 4.0 was released April 2003. It encoded 96,447 characters, adding 1,226 new characters.

New blocks

 * Limbu, containing 66 characters for the Limbu abugida, was added.
 * Tai Le, containing 35 letters for the Tai Le script, was added.
 * Khmer Symbols, containing 32 symbols for the lunar calendar, was added.
 * Phonetic Extensions, containing 108 letters used in phonetic transcription, was added.
 * Miscellaneous Symbols and Arrows, containing 14 additional arrows, was added.
 * Yijing Hexagram Symbols, containing 64 hexagrams, was added.
 * Linear B Syllabary, containing 88 syllables of the ancient Linear B script, was added.
 * Linear B Ideograms, containing 123 ideograms of the ancient Linear B script, was added.
 * Aegean Numbers, containing 57 numerals used in the Aegean area, was added.
 * Ugaritic, containing 31 characters used in Ugaritic cuneiform, was added.
 * Shavian, containing 48 letters used for the artificial Shavian script, was added.
 * Osmanya, containing 40 characters used in the artificial Osmanya script, was added.
 * Cypriot Syllabary, containing 55 characters formerly used on Cyprus, was added.
 * Tai Xuan Jing Symbols, containing 87 symbols of Tai Xuan Jing, was added.
 * Variation Selectors Supplement, containing 240 additional variation selectors, was added.

Extended blocks

 * Letters with curl used in Sinology (total 4 characters) were added to Latin Extended-B.
 * Former IPA letters (total 2 characters) were added to IPA Extensions.
 * Some additional characters (total 17 characters) were added to Spacing Modifier Letters.
 * Additional combining double-width diacritics and diacritics corresponding to their spacing equivalent (total 11 characters) were added to Combining Diacritical Marks.
 * The archaic letters Sho and San and the capital Lunate Sigma (total 5 characters) were added to Greek and Coptic.
 * Some additional markers, biblical signs, and letters with inverted V (total 19 characters) were added to Arabic.
 * Letters used for foreign words from Persian and Sogdian (total 6 characters) were added to Syriac.
 * The short A (ऄ) (total 1 character) was added to Devanagari.
 * The Avagraha sign (ঽ) (total 1 character) was added to Bengali.
 * The Adak Bindi and Visarga signs (total 2 characters) were added to Gurmukhi.
 * The vocalic l and ll and the Rupee sign (total 5 characters) were added to Gujarati.
 * The letters Va and Wa (total 2 characters) were added to Oriya.
 * Additional signs for date and finance environments (total 8 characters) were added to Tamil.
 * The Nukta and Avagraha signs (total 2 characters) were added to Kannada.
 * Some symbols and signs (total 11 characters) were added to Khmer.
 * An inverted undertie and a swung dash (total 2 characters) were added to General Punctuation.
 * The facsimile sign (℻) (total 1 character) was added to Letterlike Symbols.
 * The eject symbol and a vertical line (total 2 characters) were added to Miscellaneous Technical.
 * A black circled digit zero (⓿) (total 1 character) was added to Enclosed Alphanumerics.
 * Monograms and diagrams, flags, warning and weather symbols and a cup of tea (total 12 characters) were added to Miscellaneous Symbols.
 * Additional parenthesized and circled Korean characters and supplemental signs (total 9 characters) were added to Enclosed CJK Letters and Months.
 * Additional measure units (total 7 characters) were added to CJK Compatibility.
 * An additional Arabic sign (﷽) (total 1 character) was added to Arabic Presentation Forms-A.
 * A pair of vertical parenthesis (total 2 characters) was added to CJK Compatibility Forms.
 * The letters Oi and Ew (total 4 characters) were added to Deseret.
 * A small script l (ℓ) (total 1 character) was added to Mathematical Alphanumeric Symbols.

Unicode 4.1
Unicode 4.1 was released March 31, 2005. It encoded 97,720 characters, adding 1,273 new characters.

New blocks

 * Arabic Supplement, containing 30 characters for various languages written with the Arabic script, was added.
 * Ethiopic Supplement, containing 26 characters and signs for Sebatbeit, was added.
 * New Tai Lue, containing 80 characters for the New Tai Lue script, was added.
 * Buginese, containing 30 characters for the Lontara script, was added.
 * Phonetic Extensions Supplement, containing 64 additional letters for phonetic transcription, was added.
 * Combining Diacritical Marks Supplement, containing 4 additional diacritics, was added.
 * Glagolitic, containing 94 characters for the Glagolitic script, was added.
 * Coptic, containing 114 characters for the Coptic script, was added.
 * Georgian Supplement, containing 38 Nuskhuri letters, was added.
 * Tifinagh, containing 55 characters for the Tifinagh script, was added.
 * Ethiopic Extended, containing 79 additional Ethiopic syllables, was added.
 * Supplemental Punctuation, containing 26 additional punctuation marks, was added.
 * CJK Strokes, containing 16 strokes for Han Ideographs, was added.
 * Modifier Tone Letters, containing 23 letters for Chinese tones, was added.
 * Syloti Nagri, containing 44 characters for the Syloti Nagri abugida, was added.
 * Vertical Forms, containing 10 punctuation marks suited for vertical text, was added.
 * Ancient Greek Numbers, containing 75 numerals and signs used in Ancient Greek, was added.
 * Old Persian, containing 50 characters for Old Persian cuneiform, was added.
 * Kharoshthi, containing 65 characters for the Kharoshthi abugida, was added.
 * Ancient Greek Musical Notation, containing 70 musical signs used in Ancient Greek, was added.

Extended blocks

 * Letters for Sencoten, digraphs, letters with swash tail and other additions (total 11 characters) were added to Latin Extended-B.
 * Additional diacritics for transliteration (total 5 characters) were added to Combining Diacritical Marks.
 * Rho with stroke, reversed and dotted Lunate Sigma (total 4 characters) were added to Greek and Coptic.
 * Ghe with descender (Ӷ) (total 2 characters) was added to Cyrillic.
 * An additional biblical mark and some punctuation marks (total 4 characters) were added to Hebrew.
 * Additional biblical marks, punctuation marks and the Afghani sign (total 8 characters) were added to Arabic.
 * A glottal stop (ॽ) (total 1 character) was added to Devanagari.
 * The Khanda Ta letter (ৎ) (total 1 character) was added to Bengali.
 * The letter Sha and the digit zero (total 2 characters) were added to Tamil.
 * Two marks used in Bhutan (total 2 characters) were added to Tibetan.
 * Two letters and a modifier letter (total 3 characters) were added to Georgian.
 * Some additional syllables (total 11 characters) were added to Ethiopic.
 * Additional phonetic symbols (total 20 characters) were added to Phonetic Extensions.
 * A flower and dot punctuation marks (total 9 characters) were added to General Punctuation.
 * Additional subscript letters (total 5 characters) were added to Superscripts and Subscripts.
 * The Guarani, Austral, Hryvnia and Cedi signs (total 4 characters) were added to Currency Symbols.
 * A combining long double solidus (⃫) (total 1 character) was added to Combining Diacritical Marks for Symbols.
 * The per sign and a double-struck letter Pi (total 2 characters) were added to Letterlike Symbols.
 * Metrical and electrical signs (total 11 characters) were added to Miscellaneous Technical.
 * Additional gender and map symbols (total 30 characters) were added to Miscellaneous Symbols.
 * Some additional mathematical symbols (total 7 characters) were added to Miscellaneous Mathematical Symbols-A.
 * Additional arrows and squares (total 6 characters) were added to Miscellaneous Symbols and Arrows.
 * A circled Hangul character (㉾) (total 1 character) was added to Enclosed CJK Letters and Months.
 * Additional Han Ideographs (total 22 characters) were added to CJK Unified Ideographs.
 * Additional Compatibility Ideographs (total 106 characters) were added to CJK Compatibility Ideographs.
 * Italic dotless small i and j (total 2 characters) were added to Mathematical Alphanumeric Symbols.

Unicode 5.0
Unicode 5.0 was released July 14, 2006. It encoded 99,089 characters, adding 1,369 new characters.

New blocks

 * N'Ko, containing 59 characters for the N'Ko script, was added.
 * Balinese, containing 121 characters and musical signs for the Balinese abugida, was added.
 * Latin Extended-C, containing 17 letters for various languages, was added.
 * Latin Extended-D, containing 2 characters for UPA, was added.
 * Phags-pa, containing 56 characters for the Phags-pa script, was added.
 * Phoenician, containing 27 letters and numerals for the Phoenician script, was added.
 * Cuneiform, containing 879 signs for Sumero-Akkadian Cuneiform, was added.
 * Cuneiform Numbers and Punctuation, containing 103 numerals and punctuation signs for Sumero-Akkadian Cuneiform, was added.
 * Counting Rod Numerals, containing 18 numerals used with counting rods, was added.

Extended blocks

 * Various letters used mainly for aboriginal languages (total 14 characters) were added to Latin Extended-B.
 * Lowercase lunate sigma symbols (total 3 characters) were added to Greek and Coptic.
 * Lowercase palochka and 3 letters used in Nivkh (total 7 characters) were added to Cyrillic.
 * Two letters used in Khanty and other languages (total 4 characters) were added to Cyrillic Supplement.
 * A specific point meant for Vav (ֺ) (total 1 character) was added to Hebrew.
 * Four letters used in Sindhi (total 4 characters) were added to Devanagari.
 * Four letters used in Sanskrit (total 4 characters) were added to Kannada.
 * Additional IPA diacritics (total 9 characters) were added to Combining Diacritical Marks Supplement.
 * Four combining arrows (total 4 characters) were added to Combining Diacritical Marks for Symbols.
 * A danish symbol and a lowercase turned F (total 2 characters) were added to Letterlike Symbols.
 * A lowercase reversed C (ↄ) (total 1 character) was added to Number Forms.
 * Vertical parenthesis, geometric forms and electrical symbols (total 12 characters) were added to Miscellaneous Technical.
 * A neuter symbol (⚲) (total 1 character) was added to Miscellaneous Symbols.
 * Four additional mathematical symbols (total 4 characters) were added to Miscellaneous Mathematical Symbols-A.
 * Additional squares, pentagons and hexagons (total 11 characters) were added to Miscellaneous Symbols and Arrows.
 * Four additional tone letters used in Chinantec (total 4 characters) were added to Modifier Tone Letters.
 * Bold Digamma (𝟊/Ϝ) (total 2 characters) was added to Mathematical Alphanumeric Symbols.

Unicode 5.1
Unicode 5.1 was released April 4, 2008. It encoded 100,713 characters, adding 1,624 new characters.

New blocks

 * Sundanese, containing 55 letters for Sundanese script, was added.
 * Lepcha, containing 74 letters for Lepcha script, was added.
 * Ol Chiki, containing 48 letters for Ol Chiki script, was added.
 * Cyrillic Extended-A, containing 32 letters for combining Cyrillic letters, was added.
 * Vai, containing 300 letters for Vai script, was added.
 * Cyrillic Extended-B, containing 78 letters for additional Cyrillic characters, was added.
 * Saurashtra, containing 81 letters for Saurashtra script, was added.
 * Kayah Li, containing 48 letters for Kayah languages, was added.
 * Rejang, containing 37 letters for Rejang script, was added.
 * Cham, containing 83 letters for Cham script, was added.
 * Ancient Symbols, containing 12 characters for weights and measures and other Ancient symbols, was added.
 * Phaistos Disc, containing 46 hieroglyphs for Phaistos, was added.
 * Lycian, containing 29 letters for Lycian script, was added.
 * Carian, containing 49 letters for Carian script, was added.
 * Lydian, containing 27 letters for Lydian script, was added.
 * Mahjong Tiles, containing 44 mahjong tiles, was added.
 * Domino Tiles, containing 100 domino tiles, was added.

Extended blocks

 * Archaic letters and capital kai symbol (total 7 characters) were added to Greek and Coptic.
 * Combining Pokrytie (total 1 character) was added to Cyrillic.
 * Mordvin, Kurdish, Aleut and Chuvash letters (total 16 characters) were added to Cyrillic Supplement.
 * Radix symbols, Letterlike, punctuation, Koranic annotation signs and additions for early Persian and Azerbaijani (total 15 characters) were added to Arabic.
 * Additional letters in Torwali, Burushaski and early Persian (total 18 characters) were added to Arabic Supplement.
 * High spacing dot and candra a (total 2 characters) were added to Devanagari.
 * Udaat and yakash signs (total 2 characters) were added to Gurmukhi.
 * Vocalic rr, l and ll (total 3 characters) were added to Oriya.
 * Om symbol (ௐ) (total 1 character) was added to Tamil.
 * Avagraha, additional phonetic letters, vocalic l and ll, fractional signs and tuumu (total 13 characters) were added to Telugu.
 * Avagraha, vocalic rr, l and ll, Malayalam numerics and fractions and chillu letters (total 17 characters) were added to Malayalam.
 * Letters for Balti and various symbols (total 6 characters) were added to Tibetan.
 * Characters for various languages (total 78 characters) were added to Myanmar.
 * Manchu Ali Gali lha (ᢪ) (total 1 character) was added to Mongolian.
 * Miscellaneous combining marks (total 28 characters) were added to Combining Diacritical Marks Supplement.
 * Medievalist latin letters and miscellaneous letters (total 10 characters) were added to Latin Extended Additional.
 * Invisible plus (+) (total 1 character) was added to General Punctuation.
 * Combining asterisk above ( ⃰)(total 1 character) was added to Combining Diacritical Marks for Symbols.
 * Symbol for Samaritan Source (⅏) (total 1 character) was added to Letterlike Symbols.
 * Archaic Roman Numerals (total 4 characters) were added to Number Forms.
 * Outlined white star and other signs (total 15 characters) were added to Miscellaneous Symbols.
 * Long division and additional mathematical brackets (total 5 characters) were added to Miscellaneous Mathematical Symbols-A.
 * Miscellaneous signs (total 51 characters) were added to Miscellaneous Symbols and Arrows.
 * Additional latin letters (total 12 characters) were added to Latin Extended-C.
 * Additional punctuation (total 23 characters) were added to Supplemental Punctuation.
 * Letter ih (ㄭ) (total 1 character) was added to Bopomofo.
 * Other strokes (total 20 characters) were added to CJK Strokes.
 * Miscellaneous additions (total 8 characters) were added to CJK Unified Ideographs.
 * Africanist tone letters (total 5 characters) were added to Modifier Tone Letters.
 * Miscellaneous letters and symbols (total 112 characters) were added to Latin Extended-D.
 * Continuous macrons for Coptic (total 3 characters) were added to Combining Half Marks.
 * Musical symbol multiple measure rest (𝄩) (total 1 character) was added to Musical Symbols.

Unicode 5.2
Unicode 5.2 was released in October 1, 2009. It encoded 107,361 characters, adding 6,648 new characters.

New blocks

 * Samaritan, containing 61 letters for Samaritan script, was added.
 * Unified Canadian Aboriginal Syllabics Extended, containing 70 syllables for various cree languages, was added.
 * Tai Tham, containing 127 letters for Tai Tham script, was added.
 * Vedic Extensions, containing 35 characters for tone marks and signs, was added.
 * Lisu, containing 48 letters for Lisu script, was added.
 * Bamum, containing 88 letters for Bamum script, was added.
 * Common Indic Number Forms, containing 10 fractions and marks, was added.
 * Devanagari Extended, containing 28 additional marks, was added.
 * Hangul Jamo Extended-A, containing 29 characters for additional old initial consonants in hangul jamo, was added.
 * Javanese, containing 91 letters for Javanese script, was added.
 * Myanmar Extended-A, containing 28 letters for Khamti Shan in Myanmar, was added.
 * Tai Viet, containing 72 letters for Tai Viet script, was added.
 * Meetei Mayek, containing 56 letters for Meetei Mayek script, was added.
 * Hangul Jamo Extended-B, containing 72 characters for additional old medieval vowels and final consonants in hangul jamo, was added.
 * Imperial Aramaic, containing 31 characters for Old Aramaic, was added.
 * Old South Arabian, containing 32 letters and numbers for South Arabian, was added.
 * Avestan, containing 61 characters for Avestan script, was added.
 * Inscriptional Parthian, containing 30 characters for Inscriptional Parthian script, was added.
 * Inscriptional Pahlavi, containing 27 characters for Inscriptional Pahlavi script, was added.
 * Old Turkic, containing 73 characters for Orkhon script, was added.
 * Rumi Numeral Symbols, containing 31 numeric characters used in Fez, Morocco, and elsewhere in North Africa and the Iberian peninsula, between the tenth and seventeenth centuries, was added.
 * Kaithi, containing 66 letters for Khaiti script, was added.
 * Egyptian Hieroglyphs, containing 1,071 hieroglyphs for Egyptian, was added.
 * Enclosed Alphanumeric Supplement, containing 63 additional circled, parenthesized and squared alphanumerics, was added.
 * Enclosed Ideographic Supplement, containing 44 squared and tortoised shell bracketed ideographs, was added.
 * CJK Unified Ideographs Extension C, containing 4,149 additional Chinese Ideographs, was added.

Extended blocks

 * Abhaz letters (total 2 characters) were added to Cyrillic Supplement.
 * Inverted Candrabinbu and additional signs and letters (total 5 characters) were added to Devanagari.
 * Ganda Mark (৻) (total 1 character) was added to Bengali.
 * Religious svasti signs (total 4 characters) were added to Tibetan.
 * Extensions for Khamti Shan and Alton and Phake (total 4 characters) were added to Myanmar.
 * Additional old initial consonants, medival vowels, and old final consonants (total 16 characters) were added to Hangul Jamo.
 * Hyphen and additional syllables (total 10 characters) were added to Unified Canadian Aboriginal Syllabics.
 * Letter Sua and Tham Digit One (total 3 characters) were added to New Tai Lue.
 * Combing Almost Equal to Below ( ᷽) (total 1 character) was added to Combining Diacritical Marks Supplement.
 * The Live Tournosis, Spesmillo and Tenge signs (total 3 characters) were added to Currency Symbols.
 * Additional vulgar fractions (total 4 characters) were added to Number Forms.
 * Decimal exponent symbol (⏨) (total 1 characters) was added to Miscellaneous Technical.
 * Additional weather, game and map symbols, traffic signs, sport symbols, closed captioning and draught and checkers (total 59 characters) were added to Miscellaneous Symbols.
 * Heavy exclamation mark symbol (❗) (total 1 character) was added to Dingbats.
 * Traffic sign, dictionary and map symbols (total 5 characters) were added to Miscellaneous Symbols and Arrows.
 * Capital letter turned alpha and additions for shona (total 3 characters) were added to Latin Extended-C.
 * Cryptogrammic letters and combining marks (total 7 characters) were added to Coptic.
 * Word separator middle dot used in Avestan (⸱) (total 1 character) was added to Supplemental Punctuation.
 * Circled ideographs and numbers on black squares (total 12 characters) were added to Enclosed CJK Letters and Months.
 * Miscellaneous additions (total 8 characters) were added to CJK Unified Ideographs.
 * Miscellaneous additions for compatibility (total 3 characters) were added to CJK Compatibility Ideographs.
 * Number two and three (total 2 characters) were added to Phoenician.

Unicode 6.0
Unicode 6.0 was released in October 11, 2010. It encoded 109,449 characters, adding 2,088 new characters.

New blocks

 * Mandaic, containing 29 letters for Mandaic script, was added.
 * Batak, containing 56 letters for Batak script, was added.
 * Ethiopic Extended-A, containing 32 letters for Gamo-Gofa-Dawro, Basketo and Gumuz Ethiophic syllables, was added.
 * Brahmi, containing 108 characters for ancient Brahmi abugida, was added.
 * Bamum Supplement, containing 761 letters for additional Bamum script, was added.
 * Kana Supplement, containing 2 characters for archaic katakana, was added.
 * Playing Cards, containing 59 playing cards, was added.
 * Miscellaneous Symbols and Pictographs, containing 529 additional symbols, was added.
 * Emoticons, containing 63 faces, cat faces and gesture symbols, was added.
 * Transport and Map Symbols, containing 70 transportation, traffic signs and other symbols, was added.
 * Alchemical Symbols, containing 116 symbols for elements, was added.
 * CJK Unified Ideographs Extension D, containing 222 miscellaneous Han ideographs, was added.

Extended blocks

 * Azerbaijani letters (total 2 characters) were added to Cyrillic Supplement.
 * Kashmiri Yeh and Wavy hamza below (total 2 characters) were added to Arabic.
 * Dependent vowel signs and letters used in Kashmiri and Bihari (total 10 characters) were added to Devanagari.
 * Fraction signs (total 6 characters) were added to Oriya.
 * Letters used in scholarly only and letter dot reph (total 3 characters) were added to Malayalam.
 * Leading and Trailing Mchan Rtags (total 6 characters) were added to Tibetan.
 * Additional combining marks (total 2 characters) were added to Ethiopic.
 * Combining Double Inverted Breve Below (᷼) (total 1 character) was added to Combining Diacritical Marks Supplement.
 * Miscellaneous subscript letters (total 8 characters) were added to Superscripts and Subscripts.
 * Indian Rupee Sign (₹) (total 1 character) was added to Currency Symbols.
 * Pointing double triangle and additional mechanical symbols (total 11 characters) were added to Miscellaneous Technical.
 * Ophiucisus, astronomical symbol for uranus and pentagrams (total 6 characters) were added to Miscellaneous Symbols.
 * Additional heavy punctation marks, raised fist, raised hand, sparkles, heavy arithmetic symbols and curly loops (total 16 characters) were added to Dingbats.
 * Squared logicals (total 2 characters) were added to Miscellaneous Mathematical Symbols-A.
 * Separator mark and consonant joiner (total 2 characters) were added to Tifinagh.
 * Bopomofo for Hmu and Ge (total 3 characters) were added to Bopomofo Extended.
 * Reversed Tse (total 2 characters) were added to Cyrillic Extended-B.
 * Additional letters (total 15 characters) were added to Latin Extended-D.
 * Pedagogical symbols (total 16 characters) were added to Arabic Presentation Forms-A.
 * Additional squared, black circled and squared letters and regional indicator letters (total 107 characters) were added to Enclosed Alphanumeric Supplement.
 * Squared katakana, squared ideographs and circled advantage and accept (total 13 characters) were added to Enclosed Ideographic Supplement.

Unicode 6.1
Unicode 6.1 was released in January 31, 2012. It encoded 110,181 characters, adding 732 new characters.

New blocks

 * Arabic Extended-A (U+08A0-U+08FF), containing 39 characters, was added.
 * Sundanese Supplement (U+1CC0-U+1CCF), containing 8 characters, was added.
 * Meetei Mayek Extensions (U+AAE0-U+AAFF), containing 23 characters, was added.
 * Meroitic Hieroglyphs (U+10980-U+1099F), containing 32 characters, was added.
 * Meroitic Cursive (U+109A0-U+109FF), containing 26 characters, was added.
 * Sora Sompeng (U+110D0-U+110FF), containing 35 characters, was added.
 * Chakma (U+11100-U+1114F), containing 67 characters, was added.
 * Sharada (U+11180-U+111DF), containing 83 characters, was added.
 * Takri (U+11680-U+116CF), containing 66 characters, was added.
 * Miao (U+16F00-U+16F9F), containing 133 characters, was added.
 * Arabic Mathematical Alphabetic Symbols (U+1EE00-U+1EEFF), containing 143 characters, was added.

Extended blocks

 * An Armenian Dram sign (total 1 character) was added to Armenian. (U+058F)
 * A sign Samvat (total 1 character) was added to Arabic. (U+0604)
 * An Abbreviation mark (total 1 character) was added to Gujarati. (U+0AF0)
 * Letters for Khmu (total 2 characters) were added to Lao. (U+0EDE-U+0EDF)
 * Capital letter Yn, letter Aen, Hard and Labial sign (total 5 characters) were added to Georgian. (U+10C7, U+10CD and U+10FD-U+10FF)
 * Letters and signs for Old Sundanese (total 9 characters) were added to Sundanese. (U+1BAB-U+1BAD and U+1BBA-U+1BBF)
 * Sign Rotated Ardhavisarga, Candra Above, Jihvamuliya and Uphadhmaniya (total 4 characters) were added to Vedic Extensions. (U+1CF3-U+1CF6)
 * Mathematical diagonals (total 2 characters) were added to Miscellaneous Mathematical Symbols-A. (U+27CB and U+27CD)
 * A letter Bohairic Khei (total 2 characters) were added to Coptic. (U+2CF2-U+2CF3)
 * Small letters Yn and Aen (total 2 characters) were added to Georgian Supplement. (U+2D27 and U+2D2D)
 * Letters Ye and Yo (total 2 characters) were added to Tifinagh. (U+2D66-U+2D67)
 * (total 10 characters) were added to Supplemental Punctuation. (U+2E32-U+2E3B)
 * An additional ideograph for Kanji (total 1 character) was added to CJK Unified Ideographs. (U+9FCC)
 * Combining letter for Slavonic (total 9 characters) were added to Cyrillic Extended-B. (U+A674-U+A67B and U+A69F)
 * Letter C with Bar, capital letter H with Hook and modifier letters for extended IPA (total 5 characters) were added to Latin Extended-D. (U+A792-U+A793, U+A7AA and U+A7F8-U+A7F9)
 * Some additional ideographs for Korea (total 2 characters) were added to CJK Compatibility Ideographs. (U+FA2E-U+FA2F)
 * Symbols for Canadian legal use (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F16A-U+1F16B)
 * Typikon symbols (total 4 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F540-U+1F543)
 * (total 13 characters) were added to Emoticons. (U+1F600, U+1F611, U+1F615, U+1F617, U+1F619, U+1F61B, U+1F61F, U+1F626-U+1F627, U+1F62C, U+1F62E-U+1F62F and U+1F634)

Unicode 6.2
Unicode 6.2 was released in September 26, 2012. It encoded 110,182 characters, adding only 1 new character.

Extended blocks

 * A Turkish Lira sign (total 1 character) was added to Currency Symbols. (U+20BA)

Unicode 6.3
Unicode 6.3 was released in September 30, 2013. It encoded 110,187 characters, adding only 5 new characters.

Extended blocks

 * A Letter mark (total 1 character) was added to Arabic. (U+061C)
 * Isolate directional format characters (total 4 characters) were added to General Punctuation. (U+2066-U+2069)

Unicode 7.0
Unicode 7.0 was released in June 16, 2014. It encoded 113,021 characters, adding 2,834 new characters.

New blocks

 * Combining Diacritical Marks Extended (U+1AB0-U+1AFF), containing 15 marks, was added.
 * Myanmar Extended-B (U+A9E0-U+A9FF), containing 31 letters, was added.
 * Latin Extended-E (U+AB30-U+AB6F), containing 50 letters, was added.
 * Coptic Epact Numbers (U+102E0-U+102FF), containing 28 numbers, was added.
 * Old Permic (U+10350-U+1037F), containing 43 letters, was added.
 * Elbasan (U+10500-U+1052F), containing 50 letters, was added.
 * Caucasian Albanian (U+10530-U+1056F), containing 53 letters and marks, was added.
 * Linear A (U+10600-U+1077F), containing 341 signs, was added.
 * Palmyrene (U+10860-U+1087F), containing 32 letters, was added.
 * Nabataean (U+10880-U+108AF), containing 40 letters and numbers, was added.
 * Old North Arabian (U+10A80-U+10A9F), containing 32 letters and numbers, was added.
 * Manichaean (U+10AC0-U+10AFF), containing 51 characters, was added.
 * Psalter Pahlavi (U+10B80-U+10BAF), containing 29 characters, was added.
 * Mahajani (U+11150-U+1117F), containing 39 letters and signs, was added.
 * Sinhala Archaic Numbers (U+111E0-U+111FF), containing 20 numbers, was added.
 * Khojki (U+11200-U+1124F), containing 61 characters, was added.
 * Khudawadi (U+112B0-U+112FF), containing 69 characters, was added.
 * Grantha (U+11300-U+1137F), containing 83 characters, was added.
 * Tirhuta (U+11480-U+114DF), containing 82 characters, was added.
 * Siddham (U+11580-U+115FF), containing 72 characters, was added.
 * Modi (U+11600-U+1165F), containing 79 characters, was added.
 * Warang Citi (U+118A0-U+118FF), containing 84 letters and numbers, was added.
 * Pau Cin Hau (U+11AC0-U+11AFF), containing 57 characters, was added.
 * Mro (U+16A40-U+16A6F), containing 43 characters, was added.
 * Bassa Vah (U+16AD0-U+16AFF), containing 36 characters, was added.
 * Pahawh Hmong (U+16B00-U+16B8F), containing 127 letters and signs, was added.
 * Duployan (U+1BC00-U+1BC9F), containing 143 characters, was added.
 * Shorthand Format Controls (U+1BCA0-U+1BCAF), containing 4 format characters, was added.
 * Mende Kikakui (U+1E800-U+1E8DF), containing 213 syllables and numbers, was added.
 * Ornamental Dingbats (U+1F650-U+1F67F), containing 48 pictographic characters, was added.
 * Geometric Shapes Extended (U+1F780-U+1F7FF), containing 85 pictographic characters, was added.
 * Supplemental Arrows-C (U+1F800-U+1F8FF), containing 148 pictographic characters, was added.

Extended blocks

 * A capital letter Yot (total 1 character) was added to Greek and Coptic. (U+037F)
 * Letters for Orok, Komi and Khanty (total 8 characters) were added to Cyrillic Supplement. (U+0528-U+052F)
 * An Eternity sign (total 2 characters) were added to Armenian. (U+058D-U+058E)
 * A Number Mark Above (total 1 character) was added to Arabic. (U+0605)
 * Letters for African, Philippine, Turkic, Berber, Belarusian, Palula and Shina languages (total 8 characters) were added to Arabic Extended-A. (U+08A1, U+08AD-U+08B2 and U+08FF)
 * A letter for Marwari (total 1 character) was added to Devanagari. (U+0978)
 * A sign Anji (total 1 character) was added to Bengali. (U+0980)
 * Sign Candrabindu and letter Llla (total 2 characters) were added to Telugu. (U+0C00 and U+0C34)
 * A Sign Candrabindu (total 1 character) was added to Kannada. (U+0C81)
 * A Sign Candrabindu (total 1 character) was added to Malayalam. (U+0D01)
 * Lith Numerals (total 10 characters) were added to Sinhala. (U+0DE6-U+0DEF)
 * Additional Old English runes (total 8 characters) were added to Runic. (U+16F1-U+16F8)
 * Letters Gyan and Tra (total 2 characters) were added to Limbu. (U+191D-U+191E)
 * Signs for Jaiminiya Sama Veda (total 2 characters) were added to Vedic Extensions. (U+1CF8-U+1CF9)
 * Marks for Germanic and American lexicology (total 15 characters) were added to Combining Diacritical Marks Supplement. (U+1DE7-U+1DF5)
 * Nordic Mark, Manat and Ruble sign (total 3 characters) were added to Currency Symbols. (U+20BB-U+20BD)
 * Playback symbols from Webdings font (total 7 characters) were added to Miscellaneous Technical. (U+23F4-U+23FA)
 * A Scissors symbol from Wingdings 2 font (total 1 character) was added to Dingbats. (U+2700)
 * Arrows for Lithuanian dialectology and symbols from Wingdings 3 font (total 115 characters) were added to Miscellaneous Symbols and Arrows. (U+2B4D-U+2B4F, U+2B5A-U+2B5F, U+2B60-U+2B73, U+2B76-U+2B95, U+2B98-U+2BB9, U+2BBD-U+2BC8 and U+2BCA-U+2BD1)
 * (total 7 characters) were added to Supplemental Punctuation. (U+2E3C-U+2E42)
 * Early Cyrillic letters and letters for Lithuanian dialectology (total 6 characters) were added to Cyrillic Extended-B. (U+A698-U+A69D)
 * Letters for European, American and African orthography (total 18 characters) were added to Latin Extended-D. (U+A794-U+A79F, U+A7AB-U+A7AD, U+A7B0-U+A7B1 and U+A7F7)
 * Tone marks for Tai Laing and letters for Shwe Palaung (total 4 characters) were added to Myanmar Extended-A. (U+AA7C-U+AA7F)
 * Combining phonetic marks (total 7 characters) were added to Combining Half Marks. (U+FE27-U+FE2D)
 * Additional mathematical symbols (total 2 characters) were added to Ancient Greek Numbers. (U+1018B-U+1018C)
 * A Greek Tau Rho symbol (total 1 character) was added to Ancient Symbols. (U+101A0)
 * A letter Ess (total 1 character) was added to Old Italic. (U+1031F)
 * A Number Joiner (total 1 character) was added to Brahmi. (U+1107F)
 * Sutra mark and sign Ekam (total 2 characters) were added to Sharada. (U+111CD and U+111DA)
 * Additional cuneiform signs (total 42 characters) were added to Cuneiform. (U+1236F-U+12398)
 * Additional numbers, vulgar fractions and a punctuation mark (total 13 characters) were added to Cuneiform Numbers and Punctuation. (U+12463-U+1246E and U+12474)
 * Red Joker, Fool and trumps (total 23 characters) were added to Playing Cards. (U+1F0BF and U+1F0E0-U+1F0F5)
 * Dingbat normal and negative sans-serif digit zero (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F10B-U+1F10C)
 * Symbols from Webdings, Wingdings 1 and 2 font (total 209 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F321-U+1F32C, U+1F336, U+1F37D, U+1F394-U+1F39F, U+1F3C5, U+1F3CB-U+1F3CE, U+1F3D4-U+1F3DF, U+1F3F1-U+1F3F7, U+1F43F, U+1F441, U+1F4F8, U+1F4FD-U+1F4FE, U+1F53E-U+1F53F, U+1F544-U+1F54A, U+1F568-U+1F579, U+1F57B-U+1F5A3 and U+1F5A5-U+1F5FA)
 * Slightly frowning and smiling faces emoji (total 2 characters) were added to Emoticons. (U+1F641-U+1F642)
 * Symbols from Webdings and Wingdings 2 font (total 27 characters) were added to Transport and Map Symbols. (U+1F6C6-U+1F6CF, U+1F6E0-U+1F6EC and U+1F6F0-U+1F6F3)

Unicode 8.0
Unicode 8.0 was released in June 17, 2015. It encoded 120,737 characters, adding 7,716 new characters.

New blocks

 * Cherokee Supplement (U+AB70-U+ABBF), containing 80 lowercase letters, was added.
 * Hatran (U+108E0-U+108FF), containing 26 letters, was added.
 * Old Hungarian (U+10C80-U+10CFF), containing 108 letters, was added.
 * Multani (U+11280-U+112AF), containing 38 letters, was added.
 * Ahom (U+11700-U+1173F), containing 57 letters, was added.
 * Early Dynastic Cuneiform (U+12480-U+1254F), containing 196 characters, was added.
 * Anatolian Hieroglyphs (U+14400-U+1467F), containing 583 characters, was added.
 * Sutton SignWriting (U+1D800-U+1DAAF), containing 672 signs, was added.
 * Supplemental Symbols and Pictographs (U+1F900-U+1F9FF), containing 15 pictographic characters, was added.
 * CJK Unified Ideographs Extension E (U+2B820-U+2CEAF), containing 5762 characters, was added.

Extended blocks

 * Letters for Arwi (total 3 characters) were added to Arabic Extended-A. (U+08B3-U+08B4 and U+08E3)
 * A letter for Avestan transliteration (total 1 character) was added to Gujarati. (U+0AF9)
 * A letter for Andhra Pradesh (total 1 character) was added to Telugu. (U+0C5A)
 * An archaic letter II (total 1 character) was added to Malayalam. (U+0D5F)
 * A letter Mv and small letters (total 7 characters) were added to Cherokee. (U+13F5 and U+13F8-U+13FD)
 * A Georgian Lari sign (total 1 character) was added to Currency Symbols. (U+20BE)
 * Turned digits (total 2 characters) were added to Number Forms. (U+218A-U+218B)
 * Two headed arrows with triangle arrowheads (total 4 characters) were added to Miscellaneous Symbols and Arrows. (U+2BEC-U+2BEF)
 * Some additional ideographs (total 9 characters) were added to CJK Unified Ideographs. (U+9FCD-U+9FD5)
 * A combining letter Ef (total 1 character) was added to Cyrillic Extended-B. (U+A69E)
 * Sinological dot, phonetic extension for African languages, letters for American and Gabonese orthography (total 7 characters) were added to Latin Extended-D. (U+A78F and U+A7B2-U+A7B7)
 * Sign Siddham and letter Jain Om (total 2 characters) were added to Devanagari Extended. (U+A8FC-U+A8FD)
 * Letters for Yakut transliteration (total 4 characters) were added to Latin Extended-E. (U+AB60-U+AB63)
 * A combining mark for Church Slavonic (total 2 characters) were added to Combining Half Marks. (U+FE2E-U+FE2F)
 * Numerals and vulgar fractions (total 64 characters) were added to Meroitic Cursive. (U+109BC-U+109BD, U+109C0-U+109CF and U+109D2-U+109FF)
 * Sandhi mark, diacritical marks for Kashmiri, sign Siddham and punctuation marks (total 9 characters) were added to Sharada. (U+111C9-U+111CC and U+111DB-U+111DF)
 * Combining Anusvara Above and letter Om (total 2 characters) were added to Grantha. (U+11300 and U+11350)
 * Section marks and alternate letters (total 20 characters) were added to Siddham. (U+115CA-U+115DD)
 * An additional sign (total 1 character) was added to Cuneiform. (U+12399)
 * East-Slavic musical symbols (total 11 characters) were added to Musical Symbols. (U+1D1DE-U+1D1E8)
 * (total 24 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F32D-U+1F32F, U+1F37E-U+1F37F, U+1F3CF-U+1F3D3, U+1F3F8-U+1F3FF, U+1F4FF and U+1F54B-U+1F54F)
 * Upside Down Face and Face With Rolling Eyes emoji (total 2 characters) were added to Emoticons. (U+1F643-U+1F644)
 * A Place of Worship emoji (total 1 character) was added to Transport and Map Symbols. (U+1F6D0)

Unicode 9.0
Unicode 9.0, was released in June 21, 2016. It encoded 128,237 characters, adding 7,500 new characters.

New blocks

 * Cyrillic Extended-C (U+1C80-U+1C8F), containing 9 letters, was added.
 * Osage (U+104B0-U+104FF), containing 72 letters, was added.
 * Newa (U+11400-U+1147F), containing 92 letters, was added.
 * Mongolian Supplement (U+11660-U+1167F), containing 13 letters, was added.
 * Bhaiksuki (U+11C00-U+11C6F), containing 97 letters, was added.
 * Marchen (U+11C70-U+11CBF), containing 68 letters, was added.
 * Ideographic Symbols and Punctuation (U+16FE0-U+16FFF), containing 1 letter, was added.
 * Tangut (U+17000-U+187FF), containing 6125 letters, was added.
 * Tangut Components (U+18800-U+18AFF), containing 755 letters, was added.
 * Glagolitic Supplement (U+1E000-U+1E02F), containing 38 letters, was added.
 * Adlam (U+1E900-U+1E95F), containing 87 letters, was added.

Extended blocks

 * Letters for Bravanese, Warsh and Quranic marks used in Pakistan (total 23 characters) were added to Arabic Extended-A. (U+08B6-U+08BD and U+08D4-U+08E2)
 * A sign Spacing Candrabindu (total 1 character) were added to Kannada. (U+0C80)
 * Sign Para, Chillu letters and vulgar fractions (total 14 characters) were added to Malayalam. (U+0D4F, U+0D54-U+0D56, U+0D58-U+0D5E and U+0D76-U+0D78)
 * A diacritical mark for Newa (total 1 character) was added to Combining Diacritical Marks Supplement. (U+1DFB)
 * Power symbols (total 4 characters) were added to Miscellaneous Technical. (U+23FB-U+23FE)
 * Punctuation marks for Church Slavonic (total 2 characters) were added to Supplemental Punctuation. (U+2E43-U+2E44)
 * A letter for Unifon (total 1 character) was added to Latin Extended-D. (U+A7AE)
 * A sign Candrabindu (total 1 character) was added to Saurashtra. (U+A8C5)
 * Indiction sign and a currency symbol (total 2 characters) were added to Ancient Greek Numbers. (U+1018D-U+1018E)
 * A sign Sukun (total 1 character) was added to Khojki. (U+1123E)
 * Japanese TV symbols (total 18 characters) were added to Enclosed Alphanumeric Supplement. (U+1F19B-U+1F1AC)
 * A Japanese TV symbol (total 1 character) was added to Enclosed Ideographic Supplement. (U+1F23B)
 * A dancing man and Black Heart emoji (total 2 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F57A and U+1F5A4)
 * Octagonal Sign, Shopping Trolley, scooters and a Canoe emoji (total 5 characters) were added to Transport and Map Symbols. (U+1F6D1-U+1F6D2 and U+1F6F4-U+1F6F6)
 * (total 67 characters) were added to Supplemental Symbols and Pictographs. (U+1F919-U+1F91E, U+1F920-U+1F927, U+1F930, U+1F933-U+1F93E, U+1F940-U+1F94B, U+1F950-U+1F95E and U+1F985-U+1F991)

Variation Sequences
Here is a table with new standardized variation sequences:

Unicode 10.0
Unicode 10.0, was released in June 20, 2017. It encoded 136,690 characters, adding 8,453 new characters.

New blocks

 * Syriac Supplement (U+0860-U+086F), containing 11 characters, was added.
 * Zanabazar Square (U+11A00-U+11A4F), containing 72 characters, was added.
 * Soyombo (U+11A50-U+11AAF), containing 80 characters, was added.
 * Masaram Gondi (U+11D00-U+11D5F), containing 75 characters, was added.
 * Kana Extended-A (U+1B100-U+1B12F), containing 31 characters, was added.
 * Nushu (U+1B170-U+1B2FF), containing 396 characters, was added.
 * CJK Unified Ideographs Extension F (U+2CEB0-U+2EBEF), containing 7,473 characters, was added.

Extended blocks

 * A Vedic Anusvara and Abbreviation mark (total 2 characters) were added to Bengali. (U+09FC-U+09FD)
 * Letters for Arabic transliteration (total 6 characters) were added to Gujarati. (U+0AFA-U+0AFF)
 * A combining Anusvara Above and Viramas (total 3 characters) were added to Malayalam. (U+0D00 and U+0D3B-U+0D3C)
 * A sign Atikrama (total 1 character) was added to Vedic Extensions. (U+1CF7)
 * Combining diacritical marks for Church Slavonic (total 4 characters) were added to Combining Diacritical Marks Supplement. (U+1DF6-U+1DF9)
 * A Bitcoin sign (total 1 character) was added to Currency Symbols. (U+20BF)
 * An Observe Eye symbol (total 1 character) was added to Miscellaneous Technical. (U+23FF)
 * A Group mark (total 1 character) was added to Miscellaneous Symbols and Arrows. (U+2BD2)
 * Medieval punctuation marks (total 5 characters) were added to Supplemental Punctuation. (U+2E45-U+2E49)
 * A letter O with Dot Above (total 1 character) was added to Bopomofo. (U+312E)
 * Ideographs for Slavonic transliteration (total 21 characters) were added to CJK Unified Ideographs. (U+9FD6-U+9FEA)
 * Letters for North Italic (total 3 characters) were added to Old Italic. (U+1032D-U+1032F)
 * An Iteration mark for Nushu (total 1 character) was added to Ideographic Symbols and Punctuation. (U+16FE1)
 * Letters for Hentaigana (total 254 characters) were added to Kana Supplement. (U+1B002-U+1B0FF)
 * Symbols for Chinese Folk religion (total 6 characters) were added to Enclosed Ideographic Supplement. (U+1F260-U+1F265)
 * Stupa, Pagoda, Sled and Flying Saucer emoji (total 4 characters) were added to Transport and Map Symbols. (U+1F6D3-U+1F6D4 and U+1F6F7-U+1F6F8)
 * (total 66 characters) were added to Supplemental Symbols and Pictographs. (U+1F900-U+1F90B, U+1F91F, U+1F928-U+1F92F, U+1F931-U+1F932, U+1F94C, U+1F95F-U+1F96B, U+1F992-U+1F997 and U+1F9D0-U+1F9E6)

Unicode 11.0
Unicode 11.0, was released in June 5, 2018. It encoded 137,374 characters, adding 684 new characters.

New blocks

 * Georgian Extended (U+1C90-U+1CBF), containing 46 characters, was added.
 * Hanifi Rohingya (U+10D00-U+10D3F), containing 50 characters, was added.
 * Old Sogdian (U+10F00-U+10F2F), containing 40 characters, was added.
 * Sogdian (U+10F30-U+10F6F), containing 42 characters, was added.
 * Dogra (U+11800-U+1184F), containing 60 characters, was added.
 * Gunjala Gondi (U+11D60-U+11DAF), containing 63 characters, was added.
 * Makasar (U+11EE0-U+11EFF), containing 25 characters, was added.
 * Medefaidrin (U+16E40-U+16E9F), containing 91 characters, was added.
 * Mayan Numerals (U+1D2E0-U+1D2FF), containing 20 characters, was added.
 * Indic Siyaq Numbers (U+1EC70-U+1ECBF), containing 68 characters, was added.
 * Chess Symbols (U+1FA00-U+1FA6F), containing 14 characters, was added.

Extended blocks

 * Small letters Turned Ayb and Yi with Stroke (total 2 characters) were added to Armenian. (U+0560 and U+0588)
 * A triangle Yod (total 1 character) were added to Hebrew. (U+05EF)
 * A Dantayalan and currency symbols (total 3 characters) were added to N'Ko. (U+07FD-U+07FF)
 * A Small Low Waw (total 1 character) was added to Arabic Extended-A. (U+08D3)
 * A Sandhi mark (total 1 character) was added to Bengali. (U+09FE)
 * An Abbreviation mark (total 1 character) was added to Gurmukhi. (U+0A76)
 * A combining Anusvara Above (total 1 character) was added to Telugu. (U+0C04)
 * A sign Siddham (total 1 character) was added to Kannada. (U+0C84)
 * A letter for Buryat (total 1 character) was added to Mongolian. (U+1878)
 * Symbols for chess notation, astrological and half star symbols (total 43 characters) were added to Miscellaneous Symbols and Arrows. (U+2BBA-U+2BBC, U+2BD3-U+2BEB and 2BF0-U+2BFE)
 * Medieval punctuation marks (total 5 characters) were added to Supplemental Punctuation. (U+2E4A-U+2E4E)
 * A letter NN (total 1 character) was added to Bopomofo. (U+312F)
 * Some ideographs for Kanji (total 5 characters) were added to CJK Unified Ideographs. (U+9FEB-U+9FEF)
 * A small capital Q and a letter for Mazahua (total 3 characters) were added to Latin Extended-D. (U+A7AF and U+A7B8-U+A7B9)
 * Letter and vowel sign Ay (total 2 characters) were added to Devanagari Extended. (U+A8FE-U+A8FF)
 * Letters Ttta, Vha and a vulgar fraction (total 3 characters) were added to Kharoshthi. (U+10A34-U+10A35 and U+10A48)
 * A Number Sign Above (total 1 character) was added to Kaithi. (U+110CD)
 * Letter Lhaa, vowel sign Aa and Ei (total 3 characters) were added to Chakma. (U+11144-U+11146)
 * A combining Bindu Below (total 1 character) was added to Grantha. (U+1133B)
 * A Sandhi mark (total 1 character) was added to Newa. (U+1145E)
 * An alternate letter Ba (total 1 character) was added to Ahom. (U+1171A)
 * A mark Pluta (total 1 character) was added to Soyombo. (U+11A9D)
 * Additional ideographs (total 5 characters) were added to Tangut. (U+187ED-U+187F1)
 * Tally marks (total 7 characters) were added to Counting Rod Numerals. (U+1D372-U+1D378)
 * A Copyleft symbol (total 1 character) was added to Enclosed Alphanumeric Supplement. (U+1F12F)
 * A Skateboard emoji (total 1 character) was added to Transport and Map Symbols. (U+1F6F9)
 * Normal and negative circled shapes (total 4 characters) were added to Geometric Shapes Extended. (U+1F7D5-U+1F7D8)
 * (total 65 characters) were added to Supplemental Symbols and Pictographs. (U+1F94D-U+1F94F, U+1F96C-U+1F970, U+1F973-U+1F976, U+1F97A, U+1F97C-U+1F97F, U+1F998-U+1F99F, U+1F9A0-U+1F9A2, U+1F9B0-U+1F9B9, U+1F9C1-U+1F9C2 and U+1F9E7-U+1F9FF)

Variation Sequences
Here is a table with new standardized variation sequences:

Unicode 12.0
Unicode 12.0 was released on March 5, 2019. It encoded 137,928 characters, adding 555 new characters.

New blocks

 * Elymaic (U+10FE0-U+10FFF), containing 23 characters, was added.
 * Nandinagari (U+119A0-U+119FF), containing 65 characters, was added.
 * Tamil Supplement (U+11FC0-U+11FFF), containing 51 characters, was added.
 * Egyptian Hieroglyph Format Controls (U+13430-U+1343F), containing 9 characters, was added.
 * Small Kana Extension (U+1B130-U+1B16F), containing 7 characters, was added.
 * Nyiakeng Puachue Hmong (U+1E100-U+1E14F), containing 71 characters, was added.
 * Wancho (U+1E2C0-U+1E2FF), containing 59 characters, was added.
 * Ottoman Siyaq Numbers (U+1ED00-U+1ED4F), containing 61 characters, was added.
 * Symbols and Pictographs Extended-A (U+1FA70-U+1FAFF), containing 16 characters, was added.

Extended blocks

 * A sign Siddham (total 1 character) was added to Telugu. (U+0C77)
 * Letters for Pail and Sanskrit (total 15 characters) were added to Lao. (U+0E86, U+0E89, U+0E8C, U+0E8E-U+0E93, U+0E98, U+0EA0, U+0EA8-U+0EA9, U+0EAC and U+0EBA)
 * A sign Double Anusvara Antargomukha (total 1 character) was added to Vedic Extensions. (U+1CFA)
 * An astrological symbol and Hellschreiber Pause symbol (total 2 characters) were added to Miscellaneous Symbols and Arrows. (U+2BC9 and U+2BFF)
 * A Cornish Verse Divider (total 1 character) was added to Supplemental Punctuation. (U+2E4F)
 * Egyptological letters, Anglicana W and letters for early Pinyin (total 11 characters) were added to Latin Extended-D. (U+A7BA-U+A7BF and U+A7C2-U+A7C6)
 * Sinological phonetic letters (total 2 characters) were added to Latin Extended-E. (U+AB66-U+AB67)
 * A Vedic Anusvara (total 1 character) was added to Newa. (U+1145F)
 * An archaic letter Kha (total 1 character) was added to Takri. (U+116B8)
 * Sign Jihvamuliya and Uphadhmaniya (total 2 characters) were added to Soyombo. (U+11A84-U+11A85)
 * Letters for various Yi and Miao languages (total 16 characters) were added to Miao. (U+16F45-U+16F4A, U+16F4F and U+16F7F-U+16F87)
 * Marks for Ancient Chinese texts (total 2 characters) were added to Ideographic Symbols and Punctuation. (U+16FE2-U+16FE3)
 * Some additional ideographs (total 6 characters) were added to Tangut. (U+187F2-U+187F7)
 * A Nasalization mark (total 1 character) was added to Adlam. (U+1E94B)
 * A Spanish and Portuguese register mark (total 1 character) was added to Enclosed Alphanumeric Supplement. (U+1F16C)
 * Hindu Temple and Auto Rickshaw emoji (total 2 characters) were added to Transport and Map Symbols. (U+1F6D5 and U+1F6FA)
 * Large colored circles and boxes (total 12 characters) were added to Geometric Shapes Extended. (U+1F7E0-U+1F7EB)
 * (total 31 characters) were added to Supplemental Symbols and Pictographs. (U+1F90D-U+1F90F, U+1F93F, U+1F971, U+1F97B, U+1F9A5-U+1F9AA, U+1F9AE-U+1F9AF, U+1F9BA-U+1F9BF, U+1F9C3-U+1F9CA and U+1F9CD-U+1F9CF)
 * Heterodox chess symbols (total 84 characters) were added to Chess Symbols. (U+1FA00-U+1FA53)

Glyph Changes
Here is a table with glyph changes:

Variation Sequences
Here is a table with new standardized variation sequences:

Unicode 12.1
Unicode 12.1 was released on May 7, 2019. It encoded 137,929 characters, adding only 1 new character.

Extended blocks

 * A square era name Reiwa (total 1 character) was added to Enclosed CJK Letters and Months. (U+32FF)

Unicode 13.0
Unicode 13.0 was released on March 10, 2020. It encoded 143,859 characters, adding 5,930 new characters.

New blocks

 * Yezidi (U+10E80-U+10EBF), containing 47 characters, was added.
 * Chorasmian (U+10FB0-U+10FDF), containing 28 characters, was added.
 * Dives Akuru (U+11900-U+1195F), containing 72 characters, was added.
 * Lisu Supplement (U+11FB0-U+11FBF), containing 1 character, was added.
 * Khitan Small Script (U+18B00-U+18CFF), containing 470 characters, was added.
 * Tangut Supplement (U+18D00-U+18D08), containing 9 characters, was added.
 * Symbols for Legacy Computing (U+1FB00-U+1FBFF), containing 212 characters, was added.
 * CJK Unified Ideographs Extension G (U+30000-U+3134F), containing 4939 characters, was added.

Extended blocks

 * Letters for African languages and Punjabi (total 10 characters) were added to Arabic Extended-A. (U+08BE-U+08C7)
 * A sign Overline (total 1 character) was added to Oriya. (U+0B55)
 * A Vedic Anusvara (total 1 character) was added to Malayalam. (U+0D04)
 * A sign Candrabindu (total 1 character) was added to Sinhala. (U+0D81)
 * Combining diacritical marks for Scottish phonology (total 2 characters) were added to Combining Diacritical Marks Extended. (U+1ABF-U+1AC0)
 * A Japanese symbol for Type A Electronics (total 1 character) was added to Miscellaneous Symbols and Arrows. (U+2B97)
 * Cross patties and a Tironian sign Capita Et (total 3 characters) were added to Supplemental Punctuation. (U+2E50-U+2E52)
 * Letters for Taiwan and Cantonese language (total 5 characters) were added to Bopomofo Extended. (U+31BB-U+31BF)
 * Some disunified ideographs (total 10 characters) were added to CJK Unified Ideographs Extension A. (U+4DB6-4DBF)
 * Some ideographs for China (total 13 characters) were added to CJK Unified Ideographs. (U+9FF0-U+9FFC)
 * Letters for Gaulish (total 6 characters) were added to Latin Extended-D. (U+A7C7-U+A7CA and U+A7F5-U+A7F6)
 * An alternate sign Nasanta (total 1 character) was added to Syloti Nagri. (U+A82C)
 * Letter R With Midle Tilde and modifier letters for Scottish phonology (total 4 characters) were added to Latin Extended-E. (U+AB68-U+AB6B)
 * A symbol Ascia (total 1 character) was added to Ancient Symbols. (U+1019C)
 * A letter for Pali (total 1 character) was added to Chakma. (U+11147)
 * A vowel sign Prishthamatra E and Inverted Candrabindu (total 2 characters) were added to Sharada. (U+111CE and U+111CF)
 * Double comma, sign Jihvamuliya and Uphadhmaniya (total 3 characters) were added to Newa. (U+1145A and U+11460-U+11461)
 * Khitan Small Script Filler and reading marks for Vietnamese (total 3 characters) were added to Ideographic Symbols and Punctuation. (U+16FE4 and U+16FF0-U+16FF1)
 * Some additional components (total 13 characters) were added to Tangut Components. (U+18AF3-U+18AFF)
 * Creative Commons license symbols and Mask Work symbol (total 7 characters) were added to Enclosed Alphanumeric Supplement. (U+1F10D-U+1F10F, U+1F16D-1F16F and U+1F1AD)
 * Hut, Elevator, Pickup Truck and Roller Skate emoji (total 4 characters) were added to Transportation and Map Symbols. (U+1F6D6-U+1F6D7 and U+1F6FB-U+1F6FC)
 * Arrows for legacy computing (total 2 characters) were added to Supplemental Arrows-C. (U+1F8B0-U+1F8B1)
 * (total 10 characters) were added to Supplemental Symbols and Pictographs. (U+1F90C, U+1F972, U+1F977-U+1F978, U+1F9A3-U+1F9A4, U+1F9AB-U+1F9AD and U+1F9CB)
 * (total 41 characters) were added to Symbols and Pictographs Extended-A. (U+1FA74, U+1FA83-U+1FA86, U+1FA96-U+1FAA8, U+1FAB0-U+1FAB6, U+1FAC0-U+1FAC2 and U+1FAD0-U+1FAD6)
 * Gongche charaters for Kunqu Opera (total 7 characters) were added to CJK Unified Ideographs Extension B. (U+2A6D7-U+2A6DD)

Glyph Changes
Here is a table with glyph changes:

Unicode 14.0
Unicode 14.0 was released on September 14, 2021. It encoded 144,697 characters, adding 838 new characters.

New blocks

 * Arabic Extended-B (U+0870-U+089F), containing 41 characters, was added.
 * Vithkuqi (U+10570-U+105BF), containing 70 characters, was added.
 * Latin Extended-F (U+10780-U+107BF), containing 57 characters, was added.
 * Old Uyghur (U+10F70-U+10FAF), containing 26 characters, was added.
 * Unified Canadian Aboriginal Syllabics Extended-A (U+11AB0-U+11ABF), containing 16 characters, was added.
 * Cypro-Minoan (U+12F90-U+12FFF), containing 99 characters, was added.
 * Tangsa (U+16A70-U+16ACF), containing 89 characters, was added.
 * Kana Extended-B (U+1AFF0-U+1AFFF), containing 13 characters, was added.
 * Znamenny Musical Symbols (U+1CF00-U+1CFFF), containing 185 characters, was added.
 * Latin Extended-G (U+1DF00-U+1DFFF), containing 31 characters, was added.
 * Toto (U+1E290-U+1E2BF), containing 31 characters, was added.
 * Ethiopic Extended-B (U+1E7E0-U+1E7FF), containing 28 characters, was added.

Extended blocks

 * An End of Text punctuation mark (total 1 character) was added to Arabic. (U+061D)
 * Letters for Balti and Quranic orthography (total 12 characters) were added to Arabic Extended-A. (U+08B5 and U+08C8-U+08D2)
 * A sign Nukta and letter Nakaara Pollu (total 2 characters) were added to Telugu. (U+0C3C and U+0C5D)
 * A letter Nakaara Pollu (total 1 character) was added to Kannada. (U+0CDD)
 * A letter Ra, sign Pamudpod and archaic letter Ra (total 3 characters) were added to Tagalog. (U+170D, U+1715 and U+171F)
 * A fourth Free variation selector (total 1 character) was added to Mongolian. (U+180F)
 * Combining diacritical marks for extended IPA (total 14 characters) were added to Combining Diacritical Marks Extended. (U+1AC1-U+1ACE)
 * An archaic ligature Jnya and punctuation marks (total 3 characters) were added to Balinese. (U+1B4C and U+1B7D-U+1B7E)
 * A combining Dot Below Right (total 1 character) was added to Combining Diacritical Marks Supplement. (U+1DFA)
 * A Kyrgyz Som sign (total 1 character) was added to Currency Symbols. (U+20C0)
 * A letter Caudate Chrivi (total 2 characters) were added to Glagolitic. (U+2C2F and U+2C5F)
 * Medieval and phonetic punctuation marks (total 11 characters) were added to Supplemental Punctuation. (U+2E53-U+2E5D)
 * Some ideographs for Macao (total 3 characters) were added to CJK Unified Ideographs. (U+9FFD-U+9FFF)
 * Archaic European letters, modifier letters for Sokuon and Chatino orthography (total 13 characters) were added to Latin Extended-D. (U+A7C0-U+A7C1, U+A7D0-U+A7D1, U+A7D3, U+A7D5, U+A7D6-U+A7D9 and U+A7F2-U+A7F4)
 * A modifier letter Wasla Above and honorifics (total 20 characters) were added to Arabic Presentation Forms-A. (U+FBC2, U+FD40-U+FD4F, U+FDCF and U+FDFE-U+FDFF)
 * Letters for Old Tamil (total 6 characters) were added to Brahmi. (U+11070-U+11075)
 * A vowel sign Vocalic R (total 1 character) was added to Khaiti. (U+110C2)
 * An Abbreviation sign (total 1 character) was added to Takri. (U+116B9)
 * Letters for Tai Ahom (total 7 characters) were added to Ahom. (U+11740-U+11746) The block was expanded from (U+11700-U+1173F) to (U+11700-U+1174F)
 * Kana archaic letters (total 4 characters) were added to Kana Extended-A. (U+1B11F-U+1B122)
 * Accidental symbols for Iranian classical music (total 2 characters) were added to Musical Symbols. (U+1D1E9-U+1D1EA)
 * Playground Slide, Wheel and Ring Buoy emoji (total 3 characters) were added to Transportation and Map Symbols. (U+1F6DD-U+1F6DF)
 * A Heavy Equals Sign emoji (total 1 character) was added to Geometric Shapes Extended. (U+1F7F0)
 * A Troll and Face Holding Back Tears emoji (total 2 characters) were added to Supplemental Symbols and Pictographs. (U+1F979 and U+1F9CC)
 * (total 31 characters) were added to Symbols and Pictographs Extended-A. (U+1FA7B-U+1FA7C, U+1FAA9-U+1FAAC, U+1FAB7-U+1FABA, U+1FAC3-U+1FAC5, U+1FAD7-U+1FAD9, U+1FAE0-U+1FAE7 and U+1FAF0-U+1FAF6)
 * Some ideographs for Macao (total 2 characters) were added to CJK Unified Ideographs Extension B. (U+2A6DE-U+2A6DF)
 * Disunified ideographs and a G source ideograph for China, Hong Kong and Vietnam (total 4 characters) were added to CJK Unified Ideographs Extension C. (U+2B735-U+2B738)

Glyph Changes
Here is a table with glyph changes:

Variation Sequences
Here is a table with new standardized variation sequences:

Named Sequences
Here is a table with new named character sequences:

Unicode 15.0
Unicode 15.0 was released on September 13, 2022. It encoded 149,186 characters, adding 4,489 new characters.

New blocks

 * Arabic Extended-C (U+10EC0-U+10EFF), containing 3 characters, was added.
 * Devanagari Extended-A (U+11B00-U+11B5F), containing 10 characters, was added.
 * Kawi (U+11F00-U+11F5F), containing 86 characters, was added.
 * Kaktovik Numerals (U+1D2C0-U+1D2DF), containing 20 characters, was added.
 * Cyrillic Extended-D (U+1E030-U+1E08F), containing 63 characters, was added.
 * Nag Mundari (U+1E4D0-U+1E4FF), containing 42 characters, was added.
 * CJK Unified Ideographs Extension H (U+31350-U+323AF), containing 4192 characters, was added.

Removed blocks under ConScript Unicode Registry

 * Kaktovik Numerals (U+EBE0-U+EBFF), containing 20 characters, was removed.

Extended blocks

 * A Yamakkan (total 1 character) was added to Lao. (U+0ECE)
 * A combining Anusvara Above Right (total 1 character) was added to Kannada. (U+0CF3)
 * Letters Qa, Short I and Vocalic R (total 3 characters) were added to Khojki. (U+1123F-U+11241)
 * An additional hieroglyph to Group V (total 1 character) was added to Egyptian Hieroglyphs
 * Extended format controls (total 29 characters) were added to Egyptian Hieroglyph Format Controls. (U+13439-U+13455). The block was expanded from (U+13430-U+1343F) to (U+13430-U+1345F)
 * Hiragana and Katakana Small Ko (total 2 characters) were added to Small Kana Extension. (U+1B132 and U+1B155)
 * Letters for Malayalam transliteration (total 6 characters) were added to Latin Extended-G. (U+1DF25-U+1DF2A)
 * A Wireless emoji (total 1 character) was added to Transport and Map Symbols. (U+1F6DC)
 * A Nine Pointed White Star (total 1 character) was be added to Geometric Shapes Extended. (U+1F7D9)
 * A Lot of Fortune, eclipse symbols and symbols for dwarf planets (total 6 characters) were added to Alchemical symbols. (U+1F774-U+1F776 and U+1F77B-U+1F77F)
 * (total 20 characters) were added to Symbols and Pictographs Extended-A. (U+1FA75-U+1FA77, U+1FA87-U+1FA88, U+1FAAD-U+1FAAF, U+1FABB-U+1FABF, U+1FACE-U+1FACF, U+1FADA-U+1FADB, U+1FAE8 and U+1FAF7-U+1FAF8)
 * A disunified ideograph for Macao (total 1 character) was added to CJK Unified Ideographs Extension C. (U+2B739)

Glyph Changes
Here is a table with glyph changes:

Variation Sequences
Here is a table with new standardized variation sequences:

Unicode 15.1
Unicode 15.1 was released on September 12th, 2023. It encoded 149,794 characters, adding 627 new characters.

New blocks

 * CJK Unified Ideographs Extension I (U+2EBF0-U+2EE5F), containing 622 characters, was added.

Extended blocks

 * 4 Ideographic characters will be added to Ideographic Description Characters. (U+2FFC-U+2FFF)
 * An Ideographic subraction (total 1 character) will be added to CJK Strokes. (U+31EF)

Glyph Changes

 * Capital F with Stroke will get a new glyph (U+A798)
 * Y with Short Leg will get a new glyph (U+AB5A)

Unicode 15.2
Unicode 15.2 will be released on December 2023.

New Blocks

 * Cirth (U+16000-U+1607F), containing 104 characters will be added.
 * Tengwar (U+16080-U+160FF), containing 93 characters will be added.

Removed Blocks Under ConScript Unicode Registry

 * Tengwar (U+E000-U+E07F), containing 93 characters will be removed.
 * Cirth (U+E080-U+E0FF), containing 104 characters will be removed.

Extended Blocks

 * Letters for Cirth (total 7 characters) will be added to Runic. (U+16F9-U+16FF) (Note: The Rest of the Letters are in the block Cirth.)
 * Latin Capital Letter Double Thorn (total 1 character) will be added to Latin Extended-D. (U+A7D2)

Glyph Changes

 * Katakana Letter Archaic E will get a new glyph (U+1B000)
 * Hiragana Letter Archaic Ye will get a new glyph (U+1B001)
 * Hentaigana Letter Me-Ma will get a new glyph (U+1B0D6)
 * Hentaigana Letter Wi-1 will get a new glyph (U+1B10D)

Unicode 16.0
Unicode 16.0 will be released on September 2024.

New Blocks

 * Todhri (U+105C0-U+105FF), containing 52 characters will be added.
 * Garay (U+10D40-U+10D8F), containing 69 characters will be added.
 * Tulu-Tigalari (U+11380-U+113FF), containing 80 characters will be added.
 * Myanmar Extended-C (U+116D0-U+116FF), containing 20 characters will be added.
 * Sunuwar (U+11BC0-U+11BFF), containing 44 characters will be added.
 * Gurung Khema (U+16100-U+1613F), containing 58 characters will be added.
 * Kirat Rai (U+16D40-U+16D7F), containing 58 characters will be added.
 * Symbols for Legacy Computing Supplement (U+1CC00-U+1CEBF), containing 686 characters will be added.
 * Ol Onal (U+1E5D0-U+1E5FF), containing 44 characters will be added.

Extended Blocks

 * A combining diacritical mark for Jawi (total 1 character) will be added to Arabic Extended-B. (U+0897)
 * An archaic ligature Shrii (total 1 character) will be added to Telugu. (U+0C5C)
 * An archaic ligature Shrii (total 1 character) will be added to Kannada. (U+0CDC)
 * Inverted letters and a punctuation mark (total 3 characters) will be added to Balinese. (U+1B4E-U+1B4F and U+1B7F)
 * A letter Tje (total 2 characters) will be added to Cyrillic Extended-C. (U+1C89-U+1C8A)
 * Legacy computing symbols for Delete (total 3 characters) will be added to Control Pictures. (U+2427-U+2429)
 * A capital Rams Horn and an S with Diagonal Stroke (total 3 characters) will be added to Latin Extended-D. (U+A7CB-U+A7CD)
 * A combining Alef overlay and letters with two dots vertically below (total 4 characters) will be added to Arabic Extended-C. (U+10EC2-U+10EC4 and U+10EFC)
 * A sign Nukta (total 1 character) will be added to Kawi. (U+11F5A)
 * A rightwards arrow with hook and arrows for legacy computing (total 10 characters) will be added to Supplemental Arrows-C. (U+1F8B2-U+1F8BB)
 * Graphic shapes for legacy computing (total 37 characters) will be added to Symbols for Legacy Computing. (U+1FBCB-U+1FBEF)

Code Points Provisionally Assigned
This is a section where you can add any upcoming Unicode characters that have been provisionally assigned for mature proposals (but not yet accepted) for a future update of The Unicode Standard.

New Blocks

 * Sidetic (U+10940-U+1095F), containing 29 characters will be added.
 * Sharada Supplement (U+11B60-U+11B7F), containing 8 characters will be added.
 * Tolong Siki (U+11DB0-U+11DEF), containing 54 characters will be added.
 * Egyptian Hieroglyphs Extended-A (U+13460-U+143FF), containing 3994 characters will be added.
 * Chisoi (U+16D80-U+16DAF), containing 40 characters will be added.
 * Kana Extended-C (U+1AFD0-U+1AFEF), containing 32 characters will be added.
 * Shuishu Logograms (U+1B300-U+1B5FF), containing 471 characters will be added.
 * Tai Yo (U+1E6C0-U+1E6FF), containing 55 characters will be added.

Extended Blocks

 * An alternate letter Ba (total 1 character) will be added to Bengali. (U+09FF)
 * Compound tone diacritics (total 6 characters) were added to Combining Diacritical Marks Extended. (U+1AD0-U+1AD5)
 * 2 capital letters for Middle English, and letters for Wakashan and Salishan Languages (total 5 characters) will be added to Latin Extended-D. (U+A7D2, U+A7D4, U+A7DA-U+A7DC)
 * A Small Yeh Barree with Two Dots Below, Thin Noon, Biblical End of Verse, and Small Low Noon (total 3 characters) will be added to Arabic Extended-C. (U+10EC5-U+10EC6, U+10ED0 and U+10EFB)
 * A blank character (total 1 character) will be added to Khitan Small Script. (U+18CFF)
 * Katakana Letter Minnan Tone-6 and Katakana Letter Minnan Nasalized Tone-6 (total 2 characters) will be added to Kana Extended-B. (U+1AFF4, U+1AFFC)
 * Archaic Letters for Katakana (total 13 characters) will be added to Kana Extended-A. (U+1B123-U+1B12F)
 * A arrows for Egyptology (total 2 characters) will be added to Supplemental Arrows-C. (U+1F8C0-U+1F8C1)

Future Versions
This is a section where you can add any upcoming Unicode characters that have been confirmed to be in a future update of The Unicode Standard. Do not add any false information as this will confuse people into thinking that they are official.

New Blocks

 * Proto-Sinaitic (U+108B0-U+108DF), containing 19 characters will be added.
 * Book Pahlavi (U+10BB0-U+10BDF), containing 39 characters will be added.
 * Vatteluttu (U+11950-U+1199F), containing 57 characters will be added.
 * Indus (U+12E00-U+12F8F), containing 386 characters will be added.
 * Kpelle (U+16C00-U+16C7F), containing 116 characters will be added.
 * Khitan Ideographs (U+18D00-U+195FF), containing 2218 characters will be added.
 * Western Cham (U+1E200-U+1E26F), containing 105 characters will be added.
 * Persian Siyaq Numbers (U+1EC00-U+1EC6F), containing 105 characters will be added.

Extended Blocks

 * Extra Symbols for Mongolian (total 5 characters) will be added to Mongolian. (U+181A-U+181E)
 * Tai Don Letters (total 24 characters) will be added to Tai Viet. (U+AAC3-U+AADA)
 * Additional Vowel Signs for Kawi, Vocalic RR, Vocalic L, and Vocalic LL (total 3 characters) will be added to Kawi. (U+11F3B-U+11F3D)
 * Historic Ideographs for Tangut (total 108 characters) will be added to Tangut Supplement. (U+18D09-U+18D75)
 * Xiangqi Red and Black Bird (total 2 characters) will be added to Chess Symbols. (U+1FA6E-U+1FA6F)
 * Disunified Ideographs (total 2 characters) will be added to CJK Unified Ideographs Extension D. (U+2B81E-U+2B81F)

Roadmap Blocks
This is a section where present proportional maps of a proposed allocations to Unicode and ISO/IEC 10646. Italic indicates scripts for which detailed proposals have not yet been written.

Blocks

 * Kangxi Radical Format Controls (U+2EE0-U+2EEF)
 * Northern Palaeohispanic (U+10200-U+1023F)
 * Southern Palaeohispanic (U+10240-U+1027F)
 * Shavian Quikscript (U+103E0-U+103FF)
 * Rejang Extended (U+107C0-U+107FF)
 * Numidian (U+10960-U+1097F)
 * Balti-A (U+10AA0-U+10ABF)
 * Baburi (U+10BE0-U+10BFF)
 * Arabic Extended-D (U+10D90-U+10E5F)
 * Landa (U+11250-U+1127F)
 * Tani Lipi (U+114E0-U+114FF)
 * Ranjana (U+11500-U+1157F)
 * Zou (U+11750-U+117AF)
 * Pyu (U+117B0-U+117FF)
 * Sirmauri (U+11850-U+1188F)
 * Leke (U+11B80-U+11BBF)
 * Balti-B (U+11CC0-U+11CFF)
 * Tocharian (U+11E00-U+11E6F)
 * Khotanese (U+11E70-U+11ECF)
 * Pallava (U+11F60-U+11FAF)
 * Proto-Cuneiform (U+12580-U+12DFF)
 * Egyptian Hieroglyphs Extended-B (U+14680-U+151FF)
 * Bete Syllabary (U+15200-U+154FF)
 * Mayan Hieroglyphs (U+15500-U+15AFF)
 * Lampung (U+15B00-U+15B3F)
 * Kerinci (U+15B40-U+15B6F)
 * Mandombe (U+15B80-U+15FFF)
 * Moon (U+161A0-U+161FF)
 * Blissymbols (U+16200-U+167FF)
 * Woleai (U+16B90-U+16BFF)
 * Afaka (U+16C80-U+16CCF)
 * Khimhun Tangsa (U+16CD0-U+16CFF)
 * Tikamuli (U+16D00-U+16D3F)
 * Lontara Bilang-Bilang (U+16DB0-U+16DCF)
 * Kulitan (U+16DD0-U+16DFF)
 * Mwangwego (U+16E00-U+16E3F)
 * Buginese Supplement (U+16EA0-U+16EFF)
 * Bopomofo Extended-A (U+16FA0-U+16FAF)
 * Kanbun Extended-A (U+16FB0-U+16FDF)
 * Jurchen (U+19600-U+19B9F)
 * Pau Cin Hau Syllabary (U+19E00-U+1A2FF)
 * Eskaya (U+1A300-U+1A75F)
 * Kaida (U+1A780-U+1A7FF)
 * Naxi Dongba (U+1A800-U+1ACFF)
 * Naxi Geba (U+1AD00-U+1AFCF)
 * Lisu Syllabic Script (U+1B600-U+1B9FF)
 * Pitman Shorthands (U+1BCB0-U+1BCFF)
 * Proto-Elamite (U+1BD00-U+1C37F)
 * Linear-Elamite (U+1C380-U+1C4FF)
 * Tartaria Ideographs (U+1C500-U+1CBFF)
 * Chinese Musical Symbols (U+1D250-U+1D2AF)
 * Cistercian Numeral Format Controls (U+1D2B0-U+1D2BF)
 * Mathematical Alphanumeric Symbols Supplement (U+1D380-U+1D3FF)
 * Kodo Incense Linear Patterns (U+1DAD0-U+1DADF)
 * Jianzi Format Controls (U+1DAE0-U+1DAFF)
 * Jianzi Musical Symbols (U+1DB00-U+1DC8F)
 * Eebee Hmong (U+1E150-U+1E1FF)
 * Loma (U+1E300-U+1E41F)
 * Bagam (U+1E420-U+1E4CF)
 * Pungchen (U+1E500-U+1E52F)
 * Pungchung (U+1E530-U+1E55F)
 * Marchung (U+1E560-U+1E59F)
 * Brusha (U+1E5A0-U+1E5CF)
 * Chola (U+1E600-U+1E65F)
 * Chalukya (Box-Headed) (U+1E660-U+1E6BF)
 * Beria (U+1E700-U+1E72F)
 * Ditema Components (U+1E730-U+1E77F)
 * Byblos (U+1EB90-U+1EBFF)
 * Diwani Siyaq Numbers (U+1ECC0-U+1ECFF)
 * Extended Pictographic Characters (U+1FC00-U+1FFFF)
 * Seal Script (U+32400-U+34FFF)
 * Oracle Bone Script (U+35000-U+36FFF)
 * Bronze Script (U+37000-U+37FFF)
 * Seal Script Extension A (U+38000-U+3B3FF)
 * Yi Ideographs (U+3B400-U+3C9FF)
 * Seal Script Extension B (U+3CA00-U+3D3FF)
 * Seal Script Components (U+3D400-U+3DFFF)

Planes

 * Plane 0: Basic Multilingual Plane (BMP)
 * Plane 1: Supplementary Multilingual Plane (SMP)
 * Plane 2: Supplementary Ideographic Plane (SIP)
 * Plane 3: Tertiary Ideographic Plane (TIP)
 * Plane 4: Extended Multilingual Plane-A (EMP-A)
 * Plane 5: Extended Multilingual Plane-B (EMP-B)
 * Plane 6: Home Multilingual Plane (HMP)
 * Plane 7: Alternative Multilingual Plane (AMP)
 * Plane 8: Tertiary Multilingual Plane (TMP)
 * Plane 9: Complementary Multilingual Plane (CMP)
 * Plane 10: Complementary Ideographic Plane (CIP)
 * Plane 11: Supplementary Alternative Plane (SAP)
 * Plane 12: Supplementary Pictographic Plane (SPP)
 * Plane 13: Tertiary Pictographic Plane (TPP)
 * Plane 14: Supplementary Special-Purpose Plane (SSP)
 * Plane 15: Private Use Plane-A (PUA-A)
 * Plane 16: Private Use Plane-B (PUA-B)