Pegon script

Pegon (Javanese and Sundanese:, ; also known as , , Madurese: أبجاْد ڤَيگو, Abjâd Pèghu) is a modified Arabic script used to write the Javanese, Sundanese, and Madurese languages, as an alternative to the Latin script or the Javanese script and the Old Sundanese script. It was used in a variety of applications, from religion, to diplomacy, to poetry. But today particularly, it is used for religious (Islamic) writing and poetry, particularly in writing commentaries of the Qur'an. Pegon includes letters that are not present in Modern Standard Arabic. Pegon has been studied far less than its Jawi counterpart which is used for Malay, Acehnese and Minangkabau.

In the past few decades, the Indonesian language has grown in its prominence and role as the national language of Indonesia. Thus, publishing institutions associated with religious schools have further developed new teaching material, in order to expand the use of Pegon script to Indonesian language as well. Indonesian language, being a variety of Malay, has also been written by the sister script of Pegon, Jawi.

Etymology
The word Pegon originated from the Javanese word, meaning "deviate", due to the practice of writing the Javanese language with Arabic script, which was considered unconventional by Javanese people.

History
One of the earliest dated examples of the usage of Pegon may be Masa'il al-ta'lim, a work on Islamic law written in Arabic with interlinear translation and marginal commentary in Javanese. The manuscript is dated 1623 and written on, a paper made from the bark of the mulberry tree.

Letters
Pegon uses the original letters of the Arabic script plus an additional seven letters to represent native Javanese sounds not present in Arabic: $⟨⟩$, $⟨⟩$, $⟨⟩$, $⟨⟩$, $⟨⟩$, $⟨⟩$, and $⟨⟩$. One additional letter is used in foreign loanwords $⟨⟩$. These new letters are formed by the addition of dots to base letter forms. Pegon is not standardized and variation can be seen in how these additional letters are represented, most commonly in the position of the dots (above or below) and the number of dots (one, two or three). In more recent teaching material, additional letters have been added in order to use the script for writing Indonesian language.

Vowel diacritics
Arabic script is an abjad, meaning that for the most part, only consonants are written. Arabic has three vowels, which may be short or long. There are three letters in Arabic that can also represent long vowels, but in general, short vowels diacritics are only used diacritic in religious texts and those meant for beginner learners. The phonology of Javanese, Sundanese, and Madurese is quite different. There are six vowels, and no marking of vowel length. So, the script has been adapted by using the vowel diacritics in conjuncton with, and to fully represent the vowels of Javan languages.

The prevalence of diacritic marking in Pegon varies from marking every letter, to being present only to differentiate particular vowel sounds. A version of the script which uses few diacritics, is called bare or bald. To a fluent reader, the base letters are often sufficient to recognise word, rendering the diacritics unnecessary. So, for example, the word Indonesia may be written fully vocalised, or bare. It is increasingly common in printed books to only consistently use the e-pepêt, with the other diacritics only used when disambiguation is needed.

Full marking of letters is common in most formal texts, including religious texts and historic diplomatic manuscripts.

Syllables
A vowel at the beginning of a word is indicated by the letter alif $⟨⟩$, plus diacritic, and a follow-up letter $⟨⟩$ or $⟨⟩$ if required. If present, the follow-up letter is written with a sukun to indicate that it is part of the first syllable and not the start of a new one. A vowel following a consonant (such as the letter $⟨é⟩$ in the example below), a following vowel is indicated by diacritics but without the letter alif.

Consonant clusters
In pegon, consonant clusters are written in two ways. In clusters that consist of a nasal consonant followed by a liquid consonant, such as [mr], [ml], or [ŋl], or of an obstruent consonant followed by a plosive consonant, such as [tr], [pl], or [by], the first consonant is modified by an epenthetic e-pepet $⟨i⟩$.

When a consonant cluster consists of a nasal consonant followed by a plosive consonant, like [nj], [mb], or [nd], a prothetic alif is added to the beginning of the cluster.

Vowel Sequences
Vowel sequences follow certain general conventions. Variations besides these are also commonly seen in various books and manuscripts.

Reduplication
In Pegon script, reduplication is represented with a numeral ٢ or a hyphen. If the word has a prefix, the duplicated base word is simply repeated after a hyphen. Otherwise, a ٢ indicates that the word is reduplicated. If the word has a suffix, the ٢ is placed between the base word and the suffix.

Madurese Pèghu
The Madurese language has a more complicated phonology than other Javanese languages. It includes the vowel /ɤ/ (â) and a wider range of glottal stops than Javanese or Sundanese. The Arabic script as adapted to Madurese, known as pèghu, had some differences from other versions of pegon, and is always written vocalized using diacritics.

Consonants
Madurese has more consonants than its neighboring languages, including voiceless unaspirated, voiceless aspirated, and voiced unaspirated, but with the exception of dh, aspirated consonants in Madurese Pèghu are represented by the same letters as their unaspirated counterparts.

Vowels
Vowels at the beginning of a word are indicated by the letter alif $⟨o⟩$ or ain $⟨u⟩$ with the appropriate diacritic, and a follow-up letter $⟨⟩$, $⟨⟩$, or $⟨⟩$, if required. Usually, this follow-up letter is written with a zero-vowel diacritic (sukūn), to indicate that it is part of the first syllable, and not part of a new one.

Comparison of Pegon and Jawi
The orthographic rules of Jawi and Pegon differ, with Jawi spelling being much more standardised than Pegon. Pegon tends to write all vowel sounds of native words explicitly, either with full letters or diacritics, whereas Jawi spelling sometimes omits alif in certain positions where an would be pronounced, and other vowel sounds may not be written explicitly.

For those additional letters representing sounds not present in Arabic, some letters have the same appearance in both Jawi and Pegon, while others differ. Pegon also has two additional letters for sounds native to Javanese which are not present in Malay. Also the form of kaf used differs between the two varieties with Pegon using the Arabic form, while Jawi uses the Persian form.

Transliteration
The United States Library of Congress published a romanization standard of Jawi and Pegon in 2012.

Sample text
Baginda Nabi Muhammad adalah utusan Allah kepada semua makhluk, Apa saja yang diceritakan oleh Baginda Nabi Muhammad adalah kebenaran yang nyata. Maka semua makhluk wajib membenarkan dan mengikuti Baginda Nabi Muhammad.
 * Indonesian in Pegon script
 * Indonesian in Latin Script

His Majesty the Prophet Muhammad is the messenger of God to all creatures, whatever is told by His Majesty the Prophet Muhammad is the real truth. Therefore all creatures have to corroborate and follow His Majesty the Prophet Muhammad.
 * English