Cherokee syllabary

The Cherokee syllabary is a syllabary invented by Sequoyah in the late 1810s and early 1820s to write the Cherokee language. His creation of the syllabary is particularly noteworthy as he was illiterate until its creation. He first experimented with logograms, but his system later developed into the syllabary. In his system, each symbol represents a syllable rather than a single phoneme; the 85 (originally 86) characters provide a suitable method for writing Cherokee. Although some symbols may resemble Latin, Greek, Cyrillic, and Glagolitic letters, they are not used to represent the same sounds.

Description
Each of the characters represents one syllable, as in the Japanese kana and the Bronze Age Greek Linear B writing systems. The first six characters represent isolated vowel syllables. Characters for combined consonant and vowel syllables then follow.

The charts below show the syllabary in recitation order, left to right, top to bottom as arranged by Samuel Worcester, along with his commonly used transliterations. He played a key role in the development of Cherokee printing from 1828 until his death in 1859.

The transliteration working from the syllabary uses conventional consonants like 'qu', 'ts',..., and may differ from the ones used in the phonological orthographies (first column in the below chart, in the "d/t system").

The Latin letter 'v' in the transcriptions, seen in the last column, represents the nasal mid central vowel,.

The Cherokee character Ꮩ (do) has a different orientation in old documents, resembling a Greek Λ (or barless A) rather than a Latin V as in modern documents.

There is also a handwritten cursive form of the syllabary; notably, the handwritten glyphs bear little resemblance to the printed forms.

Detailed considerations
The phonetic values of these characters do not equate directly to those represented by the letters of the Latin script. Some characters represent two distinct phonetic values (actually heard as different syllables), while others may represent multiple variations of the same syllable. Not all phonemic distinctions of the spoken language are represented:
 * Voiced consonants are generally not distinguished from their non-voiced counterpart. For example, while + vowel syllables are mostly differentiated from  + vowel by use of different glyphs, syllables beginning with  are all conflated with those beginning with.
 * Long vowels are not distinguished from short vowels. However, in more recent technical literature, length of vowels can actually be indicated using a colon, and other disambiguation methods for consonants (somewhat like the Japanese dakuten) have been suggested.
 * Tones are not marked.
 * Syllables ending in vowels, h, or glottal stop are not differentiated. For example, the single symbol Ꮡ is used to represent both suú as in suúdáli, meaning "six" (ᏑᏓᎵ), and súh as in súhdi, meaning "fishhook" (ᏑᏗ).
 * There is no regular rule for representing consonant clusters. When consonants other than s, h, or glottal stop arise in clusters with other consonants, a vowel must be inserted, chosen either arbitrarily or for etymological reasons (reflecting an underlying etymological vowel, see vowel deletion for instance). For example, ᏧᎾᏍᏗ represents the word juunsdi̋, meaning "small (pl.), babies". The consonant cluster ns is broken down by insertion of the vowel a, and is spelled as ᎾᏍ . The vowel is etymological as juunsdi̋ is composed of the morphemes di-uunii-asdii̋ʔi, where a is part of the root. The vowel is included in the transliteration, but is not pronounced.

As with some other underspecified writing systems (like Arabic), adult speakers can distinguish words by context.

If a labial consonant such as p or b appears in a borrowed word or name, it is written using the qu row. This ~  correspondence is a known linguistic phenomenon that exists elsewhere (cf. P-Celtic, Osco-Umbrian). The l and tl rows are similarly used for borrowings containing r or tr/dr, respectively, and s (including within ts) can represent /s/, /ʃ/, /z/, or /ʒ/, as indicated in the above word juunsdi̋.

Transliteration issues
Some Cherokee words pose a problem for transliteration software because they contain adjacent pairs of single letter symbols that (without special provisions) would be combined when doing the back-conversion from Latin script to Cherokee. Here are a few examples:

For these examples, the back conversion is likely to join s-a as sa or s-i as si, as the consonant s can be written either with its own isolated glyph, or combined with a following vowel -- but the vowel itself doesn't require being attached to a consonant. One solution is to use a middle dot to separate the two, similar to the use of an apostrophe in Japanese due to a similar problem with n as exists with Cherokee s, according to the modified Hepburn standard: itsalis·anedi. While some use an apostrophe instead, apostrophes are also used to represent a glottal stop in Cherokee.

Other Cherokee words contain character pairs that entail overlapping transliteration sequences. Examples:
 * ᏀᎾ transliterates as, yet so does ᎾᎿ. The former is nah-na, the latter is na-hna.
 * ᎤᏡᏀᎠ transliterates as, yet so does ᎤᏡᎾᎭ. The former is u-tlu-nah-a, the latter is u-tlu-na-ha.

If the Latin script is parsed from left to right, longest match first, then without special provisions, the back conversion would be wrong for the latter. There are several similar examples involving these character combinations: nahe nahi naho nahu nahv.

A further problem encountered in transliterating Cherokee is that there are some pairs of different Cherokee words that transliterate to the same word in the Latin script. Here are some examples: Without special provision, a round trip conversion may change ᎠᏍᎡᏃ to ᎠᏎᏃ and change ᎨᏍᎥᎢ to ᎨᏒᎢ.
 * ᎠᏍᎡᏃ and ᎠᏎᏃ both transliterate to
 * ᎨᏍᎥᎢ and ᎨᏒᎢ both transliterate to

Character orders

 * The usual alphabetical order for Cherokee runs across the rows of the syllabary chart from left to right, top to bottom—this is the one used in the Unicode block:

Ꭰ (a), Ꭱ (e), Ꭲ (i), Ꭳ (o), Ꭴ (u), Ꭵ (v),

Ꭶ (ga), Ꭷ (ka), Ꭸ (ge), Ꭹ (gi), Ꭺ (go), Ꭻ (gu), Ꭼ (gv),

Ꭽ (ha), Ꭾ (he), Ꭿ (hi), Ꮀ (ho), Ꮁ (hu), Ꮂ (hv),

Ꮃ (la), Ꮄ (le), Ꮅ (li), Ꮆ (lo), Ꮇ (lu), Ꮈ (lv),

Ꮉ (ma), Ꮊ (me), Ꮋ (mi), Ꮌ (mo), Ꮍ (mu),

Ꮎ (na), Ꮏ (hna), Ꮐ (nah), Ꮑ (ne), Ꮒ (ni), Ꮓ (no), Ꮔ (nu), Ꮕ (nv),

Ꮖ (qua), Ꮗ (que), Ꮘ (qui), Ꮙ (quo), Ꮚ (quu), Ꮛ (quv),

Ꮜ (sa), Ꮝ (s), Ꮞ (se), Ꮟ (si), Ꮠ (so), Ꮡ (su), Ꮢ (sv),

Ꮣ (da), Ꮤ (ta), Ꮥ (de), Ꮦ (te), Ꮧ (di), Ꮨ (ti), Ꮩ (do), Ꮪ (du), Ꮫ (dv),

Ꮬ (dla), Ꮭ (tla), Ꮮ (tle), Ꮯ (tli), Ꮰ (tlo), Ꮱ (tlu), Ꮲ (tlv),

Ꮳ (tsa), Ꮴ (tse), Ꮵ (tsi), Ꮶ (tso), Ꮷ (tsu), Ꮸ (tsv),

Ꮹ (wa), Ꮺ (we), Ꮻ (wi), Ꮼ (wo), Ꮽ (wu), Ꮾ (wv),

Ꮿ (ya), Ᏸ (ye), Ᏹ (yi), Ᏺ (yo), Ᏻ (yu), Ᏼ (yv).


 * Cherokee has also been alphabetized based on the six columns of the syllabary chart from top to bottom, left to right:

Ꭰ (a), Ꭶ (ga), Ꭷ (ka), Ꭽ (ha), Ꮃ (la), Ꮉ (ma), Ꮎ (na), Ꮏ (hna), Ꮐ (nah), Ꮖ (qua), Ꮝ (s), Ꮜ (sa), Ꮣ (da), Ꮤ (ta), Ꮬ (dla), Ꮭ (tla), Ꮳ (tsa), Ꮹ (wa), Ꮿ (ya),

Ꭱ (e), Ꭸ (ge), Ꭾ (he), Ꮄ (le), Ꮊ (me), Ꮑ (ne), Ꮗ (que), Ꮞ (se), Ꮥ (de), Ꮦ (te), Ꮮ (tle), Ꮴ (tse), Ꮺ (we), Ᏸ (ye),

Ꭲ (i), Ꭹ (gi), Ꭿ (hi), Ꮅ (li), Ꮋ (mi), Ꮒ (ni), Ꮘ (qui), Ꮟ (si), Ꮧ (di), Ꮨ (ti), Ꮯ (tli), Ꮵ (tsi), Ꮻ (wi), Ᏹ (yi),

Ꭳ (o), Ꭺ (go), Ꮀ (ho), Ꮆ (lo), Ꮌ (mo), Ꮓ (no), Ꮙ (quo), Ꮠ (so), Ꮩ (do), Ꮰ (tlo), Ꮶ (tso), Ꮼ (wo), Ᏺ (yo),

Ꭴ (u), Ꭻ (gu), Ꮁ (hu), Ꮇ (lu), Ꮍ (mu), Ꮔ (nu), Ꮚ (quu), Ꮡ (su), Ꮪ (du), Ꮱ (tlu), Ꮷ (tsu), Ꮽ (wu), Ᏻ (yu),

Ꭵ (v), Ꭼ (gv), Ꮂ (hv), Ꮈ (lv), Ꮕ (nv), Ꮛ (quv), Ꮢ (sv), Ꮫ (dv), Ꮲ (tlv), Ꮸ (tsv), Ꮾ (wv), Ᏼ (yv).


 * Sequoyah used a completely different alphabetical order:

Ꭱ (e), Ꭰ (a), Ꮃ (la), Ꮵ (tsi), Ꮐ (nah), Ꮽ (wu), Ꮺ (we), Ꮅ (li), Ꮑ (ne), Ꮌ (mo),

Ꭹ (gi), Ᏹ (yi), Ꮟ (si), Ꮲ (tlv), Ꭳ (o), Ꮇ (lu), Ꮄ (le), Ꭽ (ha), Ꮼ (wo), Ꮰ (tlo),

Ꮤ (ta), Ᏼ (yv), Ꮈ (lv), Ꭿ (hi), Ꮝ (s), Ᏺ (yo), Ᏽ (mv), Ꮁ (hu), Ꭺ (go), Ꮷ (tsu),

Ꮍ (mu), Ꮞ (se), Ꮠ (so), Ꮯ (tli), Ꮘ (qui), Ꮗ (que), Ꮜ (sa), Ꮖ (qua), Ꮓ (no), Ꭷ (ka),

Ꮸ (tsv), Ꮢ (sv), Ꮒ (ni), Ꭶ (ga), Ꮩ (do), Ꭸ (ge), Ꮣ (da), Ꭼ (gv), Ꮻ (wi), Ꭲ (i),

Ꭴ (u), Ᏸ (ye), Ꮂ (hv), Ꮫ (dv), Ꭻ (gu), Ꮶ (tso), Ꮙ (quo), Ꮔ (nu), Ꮎ (na), Ꮆ (lo),

Ᏻ (yu), Ꮴ (tse), Ꮧ (di), Ꮾ (wv), Ꮪ(du), Ꮥ (de), Ꮳ (tsa), Ꭵ (v), Ꮕ (nv), Ꮦ (te),

Ꮉ (ma), Ꮡ (su), Ꮱ (tlu), Ꭾ (he), Ꮀ (ho), Ꮋ (mi), Ꮭ (tla), Ꮿ (ya), Ꮹ (wa), Ꮨ (ti),

Ꮮ (tle), Ꮏ (hna), Ꮚ (quu), Ꮬ (dla), Ꮊ (me), Ꮛ (quv).

Numerals
Cherokee generally uses Arabic numerals (0–9). In the late 1820s, several years after the introduction and adoption of his syllabary, Sequoyah proposed a set of number signs for Cherokee; however, these were never adopted and never typeset. In 2012, the Cherokee Language Consortium agreed to begin using Sequoyah's numerals in some instances.

Sequoyah developed unique characters for 1 through 19, and then characters for the "tens" of 20 through 100. Additional symbols were used to note thousands and millions, and Sequoyah also used a final symbol to mark the end of a number. The glyphs for 1 through 20 can be grouped into groups of five that have a visual similarity to each other (1–5, 6–10, 11–15, and 16–20). The Cherokee Language Consortium has created an additional symbol for zero along with symbols for billions and trillions. As of Unicode 13.0, Cherokee numerals are not encoded within Unicode.

Sequoyah's proposed numeral system has been described as having a "ciphered-additive structure," using combinations of the characters for 1 through 9 with the characters for 20 through 100 to create larger numbers. For example, instead of writing 64, the Cherokee numerals for 60 and 4 would be written together. To write 10 through 19, unique characters for each number are employed. For numbers larger than 100, the system takes on features of a multiplicative-additive system, with the digits for 1 through being placed before the hundred, thousand, or million sign to indicate large numbers; for example, for 504, the Cherokee numerals for 5, 100, and 4 would be written together.

Early history


Around 1809, impressed by the "talking leaves" of European written languages, Sequoyah began work to create a writing system for the Cherokee language. After attempting to create a character for each word, Sequoyah realized this would be too difficult and eventually created characters to represent syllables. He worked on the syllabary for twelve years before completion and dropped or modified most of the characters he originally created.

After the syllabary was completed in the early 1820s, it achieved almost instantaneous popularity and spread rapidly throughout Cherokee society. By 1825, the majority of Cherokees could read and write in their newly developed orthography.

Some of Sequoyah's most learned contemporaries immediately understood that the syllabary was a great invention. For example, when Albert Gallatin, a politician and trained linguist, saw a copy of Sequoyah's syllabary, he believed it was superior to the English alphabet in that literacy could be easily achieved for Cherokee at a time when only one-third of English-speaking people achieved the same goal. He recognized that even though the Cherokee student must learn 85 characters instead of 26 for English, the Cherokee could read immediately after learning all the symbols. The Cherokee student could accomplish in a few weeks what students of English writing might require two years to achieve.

In 1828, the order of the characters in a chart and the shapes of the characters were modified by Cherokee author and editor Elias Boudinot to adapt the syllabary to printing presses. The 86th character was dropped entirely. Following these changes, the syllabary was adopted by the Cherokee Phoenix newspaper, later Cherokee Advocate, followed by the Cherokee Messenger, a bilingual paper printed in Indian Territory in the mid-19th century.

In 1834, Worcester made changes to several characters in order to improve the readability of Cherokee text. Most notably, he inverted the do character (Ꮩ) so that it could not be confused with the go character (Ꭺ). Otherwise, the characters remained remarkably invariant until the advent of new typesetting technologies in the 20th century.

Later developments
In the 1960s, the Cherokee Phoenix Press began publishing literature in the Cherokee syllabary, including the Cherokee Singing Book. A Cherokee syllabary typewriter ball was developed for the IBM Selectric in the late 1970s. Computer fonts greatly expanded Cherokee writers' ability to publish in Cherokee. In 2010, a Cherokee keyboard cover was developed by Roy Boney, Jr. and Joseph Erb, facilitating more rapid typing in Cherokee. The keyboard cover is now used by students in the Cherokee Nation Immersion School, where all coursework is written in syllabary.

In August 2010, the Oconaluftee Institute for Cultural Arts in Cherokee, North Carolina, acquired a letterpress and had the Cherokee syllabary recast to begin printing one-of-a-kind fine art books and prints in syllabary. Artists Jeff Marley and Frank Brannon completed a collaborative project on October 19, 2013, in which they printed using Cherokee syllabary type from Southwestern Community College in the print shop at New Echota. This was the first time syllabary type has been used at New Echota since 1835.

In 2015, the Unicode Consortium encoded a lowercase version of the script, since typists would often set Cherokee with two different point sizes, so as to mark beginnings of sentences and given names (as in the Latin alphabet). Handwritten Cherokee also shows a difference in lower- and uppercase letters, such as descenders and ascenders. Lowercase Cherokee has already been encoded in the font Everson Mono.

The syllabary is finding increasingly diverse usage today, from books, newspapers, and websites to the street signs of Tahlequah, Oklahoma, and Cherokee, North Carolina. An increasing corpus of children's literature is printed in Cherokee to meet the needs of students in Cherokee language immersion schools in Oklahoma and North Carolina.

Possible influence on Liberian Vai syllabary
In the 1960s evidence emerged suggesting that the Cherokee syllabary of North America provided a model for the design of the Vai syllabary in Liberia. The Vai syllabary emerged about 1832/33. This was at a time when American missionaries were working to use the Cherokee syllabary as a model for writing Liberian languages. Another link appears to have been Cherokee who emigrated to Liberia after the invention of the Cherokee syllabary (which in its early years spread rapidly among the Cherokee) but before the inventions of the Vai syllabary. One such man, Austin Curtis, married into a prominent Vai family and became an important Vai chief himself. It is perhaps not coincidence that the "inscription on a house" that drew the world's attention to the existence of the Vai script was in fact on the home of Curtis, a Cherokee. There also appears to be a connection between an early form of written Bassa and the earlier Cherokee syllabary.

Classes
Cherokee language classes typically begin with a transliteration of Cherokee into Roman letters, only later incorporating the syllabary. The Cherokee language classes offered through Haskell Indian Nations University, Northeastern State University, the University of Oklahoma, the University of Science and Arts of Oklahoma, Western Carolina University, the University of North Carolina at Chapel Hill, and the immersion elementary schools offered by the Cherokee Nation and the Eastern Band of Cherokee Indians, such as New Kituwah Academy, all teach the syllabary. The fine arts degree program at Southwestern Community College incorporates the syllabary in its printmaking classes.

Unicode
Cherokee was added to the Unicode Standard in September 1999, with the release of version 3.0. On June 17, 2015, with the release of version 8.0, Cherokee was redefined as a bicameral script; the character repertoire was extended to include a complete set of lowercase Cherokee letters as well as the archaic character (Ᏽ).

Blocks
The first Unicode block for Cherokee is U+13A0–U+13FF. It contains all 86 uppercase letters, together with six lowercase letters:

The Cherokee Supplement block is U+AB70–U+ABBF. It contains the remaining 80 lowercase letters.

Fonts
A single Cherokee Unicode font, Plantagenet Cherokee, is supplied with macOS, version 10.3 (Panther) and later. Windows Vista also includes a Cherokee font. Windows 10 replaced Plantagenet Cherokee with Gadugi after the Cherokee language term for "working together".

Several free Cherokee fonts are available including Digohweli, Donisiladv, and Noto Sans Cherokee. Some pan-Unicode fonts, such as Code2000, Everson Mono, and GNU FreeFont, include Cherokee characters. A commercial font, Phoreus Cherokee, published by TypeCulture, includes multiple weights and styles.