Written Hokkien

Hokkien, a variety of Chinese that forms part of the Southern Min family and is spoken in Southeastern China, Taiwan and Southeast Asia, does not have a unitary standardized writing system, in comparison with the well-developed written forms of Cantonese and Standard Chinese (Mandarin). In Taiwan, a standard for Written Hokkien has been developed by the Ministry of Education including its Dictionary of Frequently-Used Taiwan Minnan, but there are a wide variety of different methods of writing in Vernacular Hokkien. Nevertheless, vernacular works written in Hokkien are still commonly seen in literature, film, performing arts and music.

History
Prior to the modern era, the main written language of China was Classical Chinese, which has grammar and vocabulary based on Old Chinese used in ancient times. Whilst the written form of Chinese mostly remained static, the spoken varieties of Chinese diverged from Old Chinese. In the early 20th century, reformers in China saw the need for language reform and championed the development of a writing system that allowed Chinese people to write in a form more closely reflecting speech called written vernacular Chinese.

However, there are various differences between the spoken Chinese varieties, such as Hokkien, Mandarin, Cantonese, including in vocabulary, meaning that Vernacular Chinese is less suited for writing texts spoken in Hokkien. Various expressions in Hokkien, as with other Chinese varieties, do not have associated Chinese characters in Vernacular Chinese, meaning that some words originally could not be written. In the case of Cantonese, a vernacular system specifically for writing Cantonese was developed in Hong Kong, then a British colony. On the other hand, since Hokkien was never standardized, different people began to use various separate methods to solve the issue of Hokkien-specific words, where such words would eventually be written phonetically, using either a Latin-based alphabet, bopomofo, or through the use of Chinese characters as loangraphs for their phonetically with no relation to the original word via meaning.

Varieties of Hokkien are spoken in the Southern Fujian in mainland China, Taiwan, Chinese communities in Malaysia, Singapore, and other Chinese expatriate communities. Initially there was no effort by the government of the Republic of China on Taiwan, nor other governments, to create a standardized Hokkien vernacular. During the initial stages of Kuomintang rule in Taiwan, the official Kuomintang language policy was to promote the use of Mandarin Chinese in everyday speech, and to discourage the use of other dialects such as Hokkien and Hakka; this was done in an attempt to promote national linguistic unity, and to promulgate a Chinese identity over that of a Taiwanese one for political reasons. Following the Taiwan localization movement, education and everyday usage of spoken and written Hokkien by local Taiwanese became more widely used. A Chinese character online dictionary for Hokkien was released in 2008 by the ROC Ministry of Education. Nevertheless, within literature circles there is still ongoing debate over which writing system should be used to write Taiwanese Hokkien, and controversy exists between the various rival systems currently used to write Hokkien. Today usage of languages remains a politicized issue in Taiwan. In Singapore, in an effort to promote Mandarin as a lingua franca amongst ethnic Chinese through the Speak Mandarin Campaign, usage of other varieties such as Hokkien is discouraged.

Today, whilst Taiwanese Hokkien speakers speak in their variety of Hokkien, they would officially write in Vernacular Chinese for formal documents, and only use vernacular Hokkien writings during informal occasions, if at all. In Taiwan, vernacular Chinese is used for academic writings, newspaper articles and television news report headlines, whilst Hokkien writings are used in novels, songs lyrics, film subtitles, theatrical and opera scripts, and in informal communication.



(白話字) is a Latin alphabet developed by Western missionaries working in Southeast Asia in the 19th century to write Hokkien. allows Hokkien to be written phonetically in Latin script, meaning that phrases specific to Hokkien can be written without having to deal with the issue of non-existent Chinese characters. Current usage of is restricted to some Taiwanese Christians, non-native learners of Hokkien, and native-speaker enthusiasts in Taiwan. POJ remains the Taiwanese script with "the richest inventory of written work, including dictionaries, textbooks, literature [...] and other publications in many areas".

can also be used along with Chinese characters in a mixed script called, where words specific to Hokkien are written in , and words with associated characters written in Han Characters.

Sample mixed orthography text:  "翻 tńg 工，我 koh hap i tī Hotel ê 餐廳食西式 ê chái 起，我講 beh tò 去稅厝 ê 所在，i beh 送我去，我 kā 拒絕，mā 無 beh hō͘ i 知我 ê 地址、電話番，講若有緣就會 koh 再相會. I 講人海茫茫，我若無 tī hit 間跳舞、唱歌，i beh 去 toh 位 chhōe--我？「就是 án-ni m̄-chiah 講是緣」，我嘴是 án-ni 應，心肝內知影 kap i 自細漢到這時 ê 牽連、綿纏無 hiah 簡單就煞."

Bopomofo
Bopomofo is another script used in Taiwanese Hokkien writings. It is commonly used in Taiwanese literature to represent Hokkien-specific grammatical particles, along with Chinese characters, and can also be used to gloss Chinese characters with their Hokkien readings.
 * Sample text: 我像離水ㄟ魚 ('I am like the fish that has left the water', with ㄟ being used as a replacement for ㆤ ê .)

Taiwanese kana
During the period of Taiwan under Japanese rule, a kana-based system was introduced to gloss Hokkien writing in Chinese characters, as well writing as other languages of Taiwan.

Chinese characters
Writing Hokkien using Chinese characters 漢字 or 唐人字, ) is a common method of writing in Taiwanese literature. However, there are various problems relating to the use of Chinese characters to write vernacular Hokkien, and in many cases Chinese characters are used alongside other scripts, such as bopomofo or POJ. The problem with using only Chinese characters to write Min Nan is that there are many morphemes, estimated to be around 15% of running text, which are not definitively associated with a particular character. Various strategies have been developed to deal with the issue, including researching and reviving the etymologically correct Chinese character from ancient Chinese texts and classical Chinese, creating new characters, allocating Chinese characters used in written Mandarin with similar meanings, but dissimilar etymology, to represent the missing characters, or using romanization for the missing 15%.

In many cases, when writing Hokkien using Chinese characters phonetically, the use of characters is entirely unrelated to the original meaning of the phrase. While most Hokkien morphemes have standard designated characters, they are not always etymologically correct or phono-semantic. Similar-sounding, similar-meaning or rare characters are commonly borrowed or substituted to represent a particular morpheme. In addition, there may be many different ways to write a specific Hokkien phrase using Chinese characters. Wanhua District in Taipei is commonly written as ('ten thousand flowers'); however, the original meaning of the location name, pronounced Báng-kah in Taiwanese Hokkien (old character form ), is a rendering of a non-Chinese indigenous name for 'boat'. Along with location names, common words also have orthography problems due to non-standardization; chhit-thô or thit-thô ('play (around)') is commonly written as 𨑨迌; however, neither characters have anything to do with the act of "playing" (the characters mean 'near' and 'cunning', 'deceitful'). Within Robert Cheng's publication of a Han character edition of the Taiwanese Hokkien novella Khó-ài ê Sîu-jîn ('Beloved Enemy') by Lai Jinsheng, the word lô͘-môa, meaning 'gangster' and cognate with Standard Chinese ), is transcribed as ; these two phonetically used characters literally translate to 'perch' and 'eel'.

Additional examples include the word for 'beautiful' ( is the literary form), which has the vernacular morpheme suí represented by characters such as (an obsolete character),  (a vernacular reading of this character) and even  (suí, typically meaning 'water'), and 'tall'  (ko is the literary form), whose morpheme kôan is. Common grammatical particles are not exempt; the negation particle m̄ ('not') is variously represented by 毋, 呣 or 唔, among others. In other cases, characters are invented to represent a particular morpheme (a common example is the non-standard character in, which represents the personal pronoun 'they'). In addition, some characters have multiple and unrelated pronunciations, adapted to represent Hokkien words. For example, the Hokkien word bah ('meat') has been reduced to writing via, which has etymologically unrelated colloquial and literary readings (he̍k and jio̍k, respectively). In other cases, a character is borrowed to represent a morpheme with the same meaning, but a different pronunciation; the morpheme kē ('low', as in kē-thn̂g 'low sugar') has been assigned the character, whose phonetic reading is te, by the Republic of China's Ministry of Education. Another case is the word 'to eat', chia̍h, which is often transcribed in Taiwanese newspapers and media as (a Mandarin transliteration, xiā, to approximate the Hokkien term), even though its recommended character in dictionaries is.

Victor H. Mair makes an estimate that if "pure, unadulterated spoken vernacular Taiwanese" were written exclusively in Chinese characters, with minimal use of Mandarin phrases, over 25% of morphemes would have no character, about 25% would have arbitrarily selected (yet more or less conventionally accepted) characters that are homophones or near-homophones, 10% would be written using characters exclusive to Hokkien, and 40% would be written with characters that have the correct sound and meaning. However, in more colloquial styles of Taiwanese Hokkien, the proportion of morphemes written with conventionally accepted characters would drop even lower than 40%. Likewise, Carstairs Douglas, who has compiled a historical comprehensive 1873 Dictionary on Hokkien as well that later formed the basis of many other dictionaries for Hokkien in the subsequent decades, regarding Chinese characters would argue as well that:

"authorCarstairs Douglas"

Moreover, unlike Cantonese, Hokkien does not have a universally accepted standardized character set. Thus, there is some variation in the characters used to express certain words and characters and they can be ambiguous in meaning. In 2007, Taiwan's Ministry of Education formulated and released a standard character set known as the Taiwanese Southern Min Recommended Characters to overcome these difficulties. These standard Chinese characters for writing Taiwanese Hokkien are now taught in schools in Taiwan.

When writing Hokkien in Chinese characters, some writers create 'new' characters when they consider it impossible to use existing ones; this corresponds to similar practices in character usage in Cantonese, Vietnamese chữ Nôm, Korean hanja and Japanese kanji. These are usually not encoded in Unicode.

The earliest Hokkien vernacular literature written in hàn-jī is Tân Saⁿ and Gō͘-niû in the Ming dynasty, and koa-á-chheh is also an important kind of hàn-jī vernacular literature in the history of the Hokkien language.

Vocabulary
The following table lists a few examples displaying differences in vocabulary between Vernacular Chinese based on Mandarin, and Taiwanese Hokkien written in Chinese characters:

In addition, Hokkien literature can consist of phrases that are vernacular to Hokkien, as well as literary terms originating from Classical Chinese. The following list of Taiwanese Hokkien words is adapted from a list by scholar Ong Iok-tek, contrasting vernacular terms with relevant literary terms; the English translations have been added by Mair.

Literary and colloquial character readings
Hokkien separates reading pronunciations from spoken pronunciations  and explications  of Chinese characters (see Literary and colloquial readings of Chinese characters). The following examples in Pe̍h-oē-jī show differences in readings in Taiwanese Hokkien: