Template:Unichar/testcases2

working

 * Unichar/sandbox/doc




 * --new, debug only
 * Module:Unicode data;
 * --new, debug only
 * Module:Unicode data;
 * --new, debug only
 * Module:Unicode data;

todo

 * Cf same as Cx? done
 * Zl, Zp (2) treat as ws? as Cx
 * replacement character for formats (not Zs lightblue)? (see ws article)
 * format in string needs word separator
 * lang i.c.w. script (now use+use2 trick)
 * Space extra when no note ?


 * err
 * error check & useful, subtempl & get GC?
 * See /checkerror


 * chek for wrong prefix (U+, 0x, )


 * major
 * option name=none? (old) -- current: 2=empty, often ntentional + effect


 * nowrap somewhere


 * future
 * table format
 * add alias names
 * auto ref Ucode
 * (brackets) around glyph, old br

base






links




format

 * frivolous

c-with

 * c-with, cwith (old)
 * --comb
 * --comb
 * --comb
 * --comb c-with=xx old; see Tie
 * --comb cwith=xx old; see Tie

RTL

 * --rtl


 * 12ab ث89yz




 * arab

U+066E, U+065C dot below

















html

 * format=html, html old habit




 * --html= (old)

sans

 * format=sans; yes (old habit)


 * --old sans=yes

br

 * br added July 2020 ,see talkpage


 * Not used, se TPU


 * might be useful, i.c.w. noname.

whitespace (Zx)

 * Zs (=whitespace), Zl (1x), Zp (1x)
 * "spacing effect but is not a space (braille-0)"
 * "space effect but not a visible space (ZWSP)"
 * some controls, nonlatin scripts
 * Uses Non-photo blue


 * see Whitespace character


 * Zl, Zp

Cc in whitespace list

 * Cc = old C0+C1

Non-space blanks (U+2800 braille)

 * The Braille Patterns Unicode block the Unicode standard explicitly states that it does not act as a space.
 * Unicode's coverage of the Korean alphabet includes several code points which represent the absence of a written letter, and thus do not display a glyph:
 * Hangul Filler character in the Hangul Compatibility Jamo block . This is classified as a letter, but displayed as an empty space, like a Hangul block containing no jamo. It is used in KS X 1001 Hangul combining sequences to introduce them or denote the absence of a letter in a position, but not in Unicode's combining jamo system.
 * Unicode's combining jamo system uses similar Hangul Choseong Filler and Hangul Jungseong Filler characters to denote the absence of a letter in initial or medial position within a syllable block, which are included in the Hangul Jamo block.
 * Additionally, a Halfwidth Hangul Filler is included in the Halfwidth and Fullwidth Forms, which is used when mapping from encodings which include characters from both Johab (or Wansung) and N-byte Hangul (or its EBCDIC counterpart), such as IBM-933, which includes both Johab and EBCDIC fillers.

Cx controls

 * Cc | Cf | Cs | Co | Cn =


 * --Cc control (=C0 C1)
 * --Cc control (=C0 C1)


 * --Cs #Surrogates
 * --Cs


 * --Co
 * --Co


 * --Cf
 * --Cf


 * --Cn nonchar = unass.
 * --Cn reserv = unass.

han

 * --han
 * --han

Incidental

 * -- invis?

error

 * correct input, formats
 * -- A9 OK
 * -- lc 0a9 OK
 * -- U+0A9 OK
 * -- 0x0A9 OK
 * -- zero
 * -- html x0055;&plus;0a9 = U+

nowiki, script erroros (2023-020100
;blank and missing param
 * -- no param
 * -- empty param
 * -- U+empty


 * unicode cp check, for GC
 * -- Cc
 * -- Cn, nonchar
 * --Cn, reserved
 * -- >> unicode

input error

 * bad prefix
 * -- lc u+
 * -- OTTOx
 * -- P+a9


 * -- P+a9


 * -- F+a923


 * -- g+a90
 * not hex
 * -- not hex
 * -- not hex
 * -- not hex


 * ws
 * -- ws
 * -- ws
 * -- ws


 * minus, plus signs
 * -- minus a9
 * -- &minus html a9
 * -- plus a9
 * -- minus a9
 * -- minus a9
 * -- plus a9