User:Nameneko

Starting a list of municipalities of Japan here for a list and here as a final draft of sorts.

Japanese railway information and related stuff has been moved to Japanese rail project

Variants of the Latin alphabet
Variants of the Latin alphabet are used by the writing systems of many languages throughout the world. The basic Lain alphabet as defined by the ISO in ISO/IEC 646 since 1972 is the same as the modern English alphabet. It is as follows:
 * A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

This article uses this as a base of comparison for the variations of the Latin alphabet. The alphabet as defined in the language is color-coded in a chart, with additional notes and a list variant letters that are commonly used in the language. These do not include characters used in loanwords. This article is by no means comprehensive; it merely serves to compare a number of common variations of the Latin alphabet.

Albanian

 * The circumflex is used only in the Gheg dialect, where it indicates a nasal vowel.
 * Variants
 * Circumflex: Â, Ê, Î, Ô, Û, Ŷ

Azerbaijani

 * The Azerbaijani language is also written in the Cyrillic alphabet and in Perso-Arabic script.


 * The letter X, alphabetized between H and I, represents a voiceless velar fricative.
 * The letter Q represents a voiced velar plosive (or "hard g") while the letter G represents a palatalized g. Q is collated between K and L.
 * The Azerbaijani alphabet contains both a dotted and dotless I. The minuscule form of I is ı and the majuscule form of i is İ.

Basque

 * The U with an umlaut (Ü) is used only in the Souletin dialect where it represents a close front rounded vowel.
 * Variants
 * Umlaut: Ü

Berber Languages

 * The Berber languages are also written in the Arabic alphabet and the Tifinagh alphabet.


 * In the Tuareg languages, the Latin alphabet used differs slightly, such as using Ə instead of E.

Bosnian

 * Bosnian uses Gaj's Latin alphabet, also used as the Croatian alphabet and the Serbian Latin alphabet.


 * The digraphs DŽ, LJ, and NJ are considered single letters in Bosnian and are collated separate from their first characters. Therefore, njegov appears after novine in a dictionary.

Catalan

 * Variants
 * Acute accent: Á, É, Í, Ó, Ú
 * Cedilla: Ç
 * Grave accent: À, È, Ì, Ò, Ù
 * Diaeresis: Ï, Ü

Croatian

 * Croatian uses Gaj's Latin alphabet, also used as the Bosnian alphabet and the Serbian Latin alphabet.


 * The digraphs DŽ, LJ, and NJ are considered single letters in Croatian and are collated separate from their first characters. Therefore, njegov appears after novine in a dictionary.

Czech

 * The digraph CH, alphabetized between H and I, represents a voiceless velar fricative. It is considered a single letter.

Danish

 * The Danish alphabet is identical to the Norwegian alphabet.

Dutch

 * The digraph IJ is only sometimes considered a separate letter. It usually represents the diphthong [ɛi].  Although Unicode codes for an upper and lower case character (Ĳ and ĳ), its use as a single character is not encouraged.
 * Variants
 * Diaeresis: Ä, Ë, Ï, Ö, Ü

Estonian

 * In the Estonian alphabet, the letter Z is positioned between S and T (or Š and Ž).
 * The letters S-caron (Š) and Z-caron (Ž), although variant letters, are only used in loanwords and foreign proper names.

Filipino

 * In 1976, the letters C, CH, F, J, LL, Ñ, Q, RR, V, X, and Z were added to accommodate Spanish and English loanwords. CH, LL, and RR were removed in 1987.

French

 * Variants
 * Acute accent: É
 * Cedilla: Ç
 * Circumflex: Â, Ê, Î, Ô, Û
 * Diaeresis: Ë, Ï, Ü, Ÿ
 * Grave accent: À, È, Ù

German

 * The letter C does not appear alone in native words; it is used in the sequences SCH, CH, and CK. It is only used by itself for loanwords.
 * The eszett (ß) has no capital letter and has been replaced by the sequence SS is a number of German-speaking regions.
 * Variants
 * ß (Eszett)
 * Umlaut: Ä, Ö, Ü

Guaraní

 * The letters B, C, and D are not considered standalone letters. They are only found in the digraphs MB, CH, and ND, respectively.
 * The letters L and RR are used predominantly in Spanish loanwords.
 * The puso (') is a letter indicating a glottal stop. It is ordered at the end of the alphabet.
 * Variants
 * Accute accent: Á, É, Í, Ó, Ú, Ý

Hausa

 * The Hausa language is also written in Ajami script.


 * The letter Y with a hook (Ƴ) is used only in Niger to represent a palatalized glottal stop. In Nigeria the sound is written ʼY.

Hawaiian

 * In the Hawaiian alphabet, vowels are ordered before consonants. The order of the consonants is consistent with standard, Latin alphabetical order.
 * The ʻokina (ʻ) is a letter indicating a glottal stop. It is ordered at the end of the alphabet.
 * Although most, common consonants in the basic Latin alphabet are not in the Hawaiian alphabet, when they are used in loanwords, they are ordered after the ʻokina, i.e. a dictionary would be sorted: ...P, W, ʻ, B, C, D, F...
 * Variants
 * macron: Ā, Ē, Ī, Ō, Ū

Hungarian

 * Although separate letters, when alphabetized, the letter pairs O-Ó, Ö-Ő, U-Ú, and Ü-Ű are grouped together, so the word folyó would appear before folyosó.
 * In alphabetization, the multiple-letter consonants are treated as separate letters, so that cukor (under C) would appear before csak (under CS).

Italian

 * Variants
 * Acute accent: É
 * Circumflex: Î
 * Grave accent: À, È, Ì, Ò, Ù

Karakalpak

 * The Karakalpak alphabet contains both a dotted and dotless I. The minuscule form of I is ı and the majuscule form of i is İ.

Norwegian

 * The Norwegian alphabet is identical to the Danish alphabet.

Portuguese

 * The U with diaeresis (Ü) is used only in Brazilian Portugese.
 * Variants
 * Acute accent: Á, É, Í, Ó, Ú
 * Cedilla: Ç
 * Circumflex: Â, Ê, Ô
 * Diaeresis: Ü
 * Grave accent: À
 * Tilde: Ã, Õ

Serbian

 * The Serbian language is also written in the Cyrillic alphabet.
 * Serbian uses Gaj's Latin alphabet, also used as the Bosnian alphabet and the Croatian alphabet.


 * The digraphs DŽ, LJ, and NJ are considered single letters in Serbian and are collated separate from their first characters. Therefore, njegov appears after novine in a dictionary.

Somali

 * The Somali alphabet follows an Arabic-based order.

Spanish

 * Since 1994, the letters CH and LL have not been collated as separate letters, but are still considered separate letters. So whereas CH was once alphabetized between C and D, it is now alphabetized between the sequences Cg and Ci.
 * Variants
 * Acute accent: Á, É, Í, Ó, Ú
 * Diaeresis: Ü

Turkish

 * The Turkish alphabet contains both a dotted and dotless I. The minuscule form of I is ı and the majuscule form of i is İ.

Turkmen

 * The Turkmen language is also written in the Arabic alphabet.

Uzbek

 * The Uzbek language is also written in the Cyrillic alphabet.


 * The letter C is not considered a standalone letter. It is only found in the digraph CH.
 * Collation sequence from Russian Wikipedia.

Vietnamese

 * Variants
 * Acute accent: Á, Ắ, Ấ, É, Ế, Í, Ó, Ố, Ớ, Ú, Ứ, Ý
 * Dot: Ạ, Ặ, Ậ, Ẹ, Ệ, Ị, Ọ, Ộ, Ợ, Ụ, Ự, Ỵ
 * Grave accent: À, Ằ, Ầ, È, Ề, Ì, Ò, Ồ, Ờ, Ù, Ừ, Ỳ
 * Hook: Ả, Ẳ, Ẩ, Ẻ, Ể, Ỉ, Ỏ, Ổ, Ở, Ủ, Ử, Ỷ
 * Tilde: Ã, Ẵ, Ẫ, Ẽ, Ễ, Ĩ, Õ, Ỗ, Ỡ, Ũ, Ữ, Ỹ

Welsh

 * The digraphs CH, DD, FF, NG, LL, PH, RH, and TH are single letters in Welsh. Therefore, the town of Llanelli in Wales is considered to have only six letters (LL occurs twice).  The letters are collated as separate letters, so a dictionary would be in the order: ...C, CH, D, DD, E, F, FF, G, NG, H...
 * Variants
 * Acute accent: Á, É, Í, Ó, Ú, Ẃ, Ý
 * Circumflex: Â, Ê, Î, Ô, Û, Ŵ, Ŷ
 * Diaeresis: Ä, Ë, Ï, Ö, Ü, Ẅ, Ÿ
 * Grave accent: À, È, Ì, Ò, Ù, Ẁ, Ỳ