Chinese character forms

Chinese character forms are the shapes and structures of Chinese characters. They are the physical carriers of written Chinese.

Modern Chinese characters appear in the form of square blocks. There are two methods to analyze the forms of Chinese characters, source tracing analysis and current status analysis. Source tracing analysis is also called the method of character creation. It takes the form of a character when it was created as the object of analysis. Current status (or current situation) analysis takes the current regular script standard form as the object. As an academic subject, modern Chinese characters pay more attention to current status analysis.

Current status analysis studies how the writing units are combined level by level into a complete Chinese character. There are three levels of structural units of Chinese characters: strokes, components , and whole characters. For example, character is composed of two components, each of which is composed of three stokes: = 宀(㇔㇔㇇) + 子(㇇㇚㇐).

Strokes
Strokes (bǐhuà; ; ) are the smallest writing units of Chinese characters. When writing a Chinese character, the trace of a dot or a line left on the writing material (such as paper) from pen-down to pen-up is called a stroke.

Stroke number is the number of strokes of a Chinese character. It varies, for example, characters "一" and "乙" have only one stroke, while character "齉" has 36 strokes, and "龘" (three 龍s, dragons) consists of 48 strokes.

Stroke forms refer to the shapes of strokes. The stroke forms of a standard Chinese character set can be classified into a stroke table (or stroke list), for instance, the Unicode CJK strokes list has 36 types of strokes:

Stroke order has two meanings: Because the direction of strokes is relatively simple, people generally refer to the latter meaning when talking about stroke order.
 * 1) The order of a stroke, or the direction in which a stroke is written, for example, stroke "㇐" (heng, horizontal) is written from left to right, stroke "㇑" (shu, vertical)" is written from top to bottom.
 * 2) The order of strokes in a character, i.e., the order in which strokes are written to form a Chinese character, for example, the stroke order of character  is "㇓㇐㇑".

Strokes can also be used for Chinese character sorting. The important stroke-based sorting methods include stroke-count sorting, stroke-count-stroke-order sorting, GB stroke-based sorting and YES sorting.

Strokes combine with each other in a Chinese character in different ways. There are three types of stroke combinations between two strokes:
 * 1) Separation: the strokes are separated from each other. Such as:, ,.
 * 2) Connection: the strokes are connected, such as, , , , ,.
 * 3) Intersection: the strokes are intersected. Such as:, ,.

Components
Chinese character components (Pinyin: hànzì bùjiàn; Traditional Chinese: ; Simplified Chinese: ) are Chinese character building blocks composed of strokes. In most cases, a component is larger than a stroke (i.e., consists of more than one stroke) and smaller than the whole character (combines with some other components to form a character). For example, in character "件", there are two components ( and ), each with more than one strokes, (亻: ㇓㇑) and (牛: ㇓㇐㇐㇑). In the special cases of one-stroke characters, such as and, a stroke is a component and is a character.

Chinese character component analysis is to divide or separate a character into components. There are two ways for Chinese character dividing, hierarchical dividing and plane dividing. Hierarchical dividing separate layer by layer from larger to smaller components, and finally get the primitive components. Plane dividing separate out the primitive components at one time. Hierarchical dividing can display the external structure of Chinese characters, while plane splitting can be regarded as omitting the higher splitting levels, and directly writing out the final separating result of primitive components.

A component that can independently form a character is a character component, or a component of independent character formation. For example, component formed character  independently, and is a component in characters,  and. A component that can not independently form a character is a non-character component, or a component of dependent character formation. For example, is not a character in modern Chinese, but it is a component in characters,  and.

A component that cannot be (further) divided into smaller components by the rules is a primitive component, or basic component. Primitive components are the final-level components of hierarchical dividing. For example, components and  in character. A component composed of two or more primitive components is a compound component. For example, component in character,  and.

Radicals and pianpangs are components. In Chinese characters, radicals are mostly semantic pianpangs, such as the radicals of, ,.

The structure of a Chinese character is the pattern or rule in which the character is formed by its (first level) components. Chinese character structures include:
 * Single-component structure: The character is formed by a single primitive component, such as, and.
 * Left-right structure: The character is formed by a component on the left and another one on the right, such as, and.
 * Up-down structure: The character is formed by a component above another component, such as ,召 and.
 * Surrounding structure: One component is completely or partially surrounded by another component, such as, , , , , and.

Whole characters
A Chinese whole character (Pinyin: hànzì zhěngzì; Traditional Chinese: ; Simplified Chinese: ) is a complete Chinese character. It lies at the final level of the stroke-component-character Chinese character composition. According to their structures, Chinese characters can be divided into undecomposable characters and decomposable characters.

Undecomposable characters
An undecomposable character consists of one primitive component, which is directly formed by strokes and can not be decomposed into smaller components. In the "Specification of the Undecomposable Characters Commonly Used in the Modern Chinese", there is the "List of Modern Commonly Used Undecomposable Characters" (256 characters in stroke-based order): 一乙二十丁厂七卜八人入儿匕几九刁了刀力乃又三干于工土士才下寸大丈与万上小口山巾千川个歹久么凡丸及广亡门丫义之尸己已巳弓子卫也女刃飞习叉马乡丰王开井天夫无云专丐木五不犬太歹尤车巨牙屯戈互瓦止少曰日中贝内水见午牛手气毛壬升夭长片斤爪父月氏勿丹鸟六文方火为斗户心尺丑巴办予书玉未末击正甘世本术丙石戊龙平东卡凸业木且甲申电田由史央冉皿凹四生矢失乍禾丘白斥瓜乎用甩乐匆册鸟主立半头必永民弗出矛母耳亚臣吏再西百而页夹夷虫曲肉年朱臼自血卤舟亦衣产亥羊米州农严求甫更束两酉来卤里串我身囱言羌弟事雨果垂秉肃隶承革柬面重鬼禹首兼象鼠

It is estimated that the number of undecomposable characters approximately account for 4% in modern Chinese characters.

Decomposable characters
A decomposable character consists of more than one components. There are two frequently-used modes of component combination in the study of Chinese character structures: first-level component combination and primitive component combination.

According to first-level component combination, the structures of decomposable characters can be divided into 13 categories:


 * 1) Left to right (⿰, 2FF0 ), for example:, ,  and.
 * 2) Left to middle and right (⿲, 2FF2):,  and.
 * 3) Above to below (⿱, 2FF1):,  and.
 * 4) Above to middle and below (⿳, 2FF3):,  and.
 * 5) Full surround (⿴, 2FF4): ,  and
 * 6) Surround from above (⿵, 2FF5):,  and 風
 * 7) Surround from below (⿶, 2FF6):,  and 函
 * 8) Surround from left (⿷, 2FF7):,  and 匣
 * 9) Surround from upper left (⿸, 2FF8):,  and.
 * 10) Surround from upper right (⿹, 2FF9):,  and.
 * 11) Surround from lower left (⿺, 2FFA):,  and.
 * 12) Surround from lower right (N/A)：斗 and.
 * 13) Overlaid (⿻, 2FFB):,  and.

According to primitive component combination, the structures of decomposable characters can be divided into:
 * A. For characters composed of two primitive components, there are 9 different structures, as shown by the following example characters:.
 * B. For characters composed of three components, there are 21 different structures, such as:.
 * C. For characters composed of four components, there are 20 different structures, such as:.
 * D. For characters composed of five components, there are 20 different structures, such as:.
 * E. For characters composed of six components, there are 10 different structures, such as:.
 * F. For characters composed of seven components, there are 3 different structures, such as:.
 * G. For characters composed of eight components, there is 1 structure, such as:.
 * H. For characters composed of nine components, there is 1 structure, such as:.

Fonts
The popular fonts of modern Chinese characters include Song or Ming, FangSong , Kai (regular, , ), Li (clerical, , ), Hei (black, sans-serif, , ) and Wei.

In Chinese, in addition to the international "points" system, a unique "number" system is used for character sizes. For example, the Simplified Chinese version of MS Word allows setting font sizes by points or by numbers. The sizes of the number systems include (in ascending order of font sizes): Size No. 8, No. 7 , Small No. 6 , No. 6 , Small No. 5 , No. 5 , Small No. 4 , No. 4 , Small No. 3 , No. 3 , Small No. 2 , No. 2 , small No. 1 , No. 1 , Small initial number , Initial number.