Talk:Chinese characters/GA1

GA Review
The edit link for this section can be used to add comments to the review.''

Nominator: 18:14, 28 March 2024 (UTC)


 * @Kanguole, while Kusma is elsewhere, do you happen to have any critique or comments about the article in its present state? I know I've asked this question of you a lot, but you always have something insightful to say, so! Remsense  诉  10:19, 6 April 2024 (UTC)
 * I'm looking ahead, and I think the Old Chinese section that explains character evolution vis a vis Old Chinese morphology (i.e. the qusheng and transitivity prefix) is a dramatic jump in technicality from the rest of the article, and I'm not sure how I could simplify it for an article with this broad a scope. I want to keep something that's roughly this informative though, any thoughts @Kanguole? Remsense  诉  21:02, 9 April 2024 (UTC)
 * I think this is a special case of phonetic loans: the same character was used to write another word with a similar pronunciation, and subsequent sound change separated them. This can be seen with phono-semantic compounds too, e.g. 張 'stretch'/長 'long' (Baxter p324).
 * Some nits:
 * Is that a stray quote next to drjwen?
 * 去聲 (qùshēng) is the name of the MC departing tone; *-s is a proposed source of it.
 * The residual morphology of OC is usually said to be derivational rather than inflectional.
 * The usual notation for sound change (e.g. in the cited Baxter 1992) is > rather than →.
 * The last two examples of voiced initials don't show clear distinctions. If you had something in the level tone (like 張/長, you would end up with different initials and different tones in the modern forms.
 * Kanguole 23:03, 9 April 2024 (UTC)
 * In any case, there's more than enough material Kusma has deliberated on here for me to source from in future improvements to the article/the future FAC run. Remsense  诉  09:32, 21 May 2024 (UTC)

Reviewer: Kusma (talk · contribs) 07:13, 1 April 2024 (UTC)

Will take this on. I do know a few things about Chinese characters, but am looking forward to reading this in detail and learning more. —Kusma (talk) 07:13, 1 April 2024 (UTC)


 * Oh, thank you so much! I thought I may be waiting a bit for a review, it's a big vital article with a broad scope. Very excited to hear your feedback. Remsense  诉  11:28, 1 April 2024 (UTC)
 * It is probably going to take me some time, so you may need to wait a little for a complete review. —Kusma (talk) 13:11, 1 April 2024 (UTC)
 * Oh, please take your time! Remsense  诉  13:21, 1 April 2024 (UTC)
 * I will, and I will comment on everything I come across that I find worth mentioning. Not everything will need addressing in order to satisfy the GA criteria, and I may also be wrong about things, so please do let me know when you disagree with me. —Kusma (talk) 14:29, 1 April 2024 (UTC)
 * Yeah! Really, doing a few reviews first was a really good choice—I'm much more comfortable on the nominator side than I may have otherwise been. Do you mind if I reply to your bullet points below inline, or is that too much of a headache? Remsense  诉  14:39, 1 April 2024 (UTC)
 * Inline is fine unless we are editing at the same time. —Kusma (talk) 16:07, 1 April 2024 (UTC)
 * I don't know whether I'll have much editing time until Thursday (and I had another stalled GA review coming back). So I will indeed take my time. If you are trying to give names in characters for everyone (a goal I support, but general consensus might be slightly different): Chan Sin-Wai is 陳善偉. —Kusma (talk) 07:04, 2 April 2024 (UTC)
 * Thank you! I think it is generally necessary to give the characters for people who write their names with characters—my litmus test has generally been to include characters when they are printed in the book or immediately visible upon a google search for the person, but extra inclusion certainly can't hurt.
 * And no problem at all! As you may have seen, I keep finding things to polish and tweak—I hope that's alright with you, I can gladly refrain if it complicates the review process. I'm honestly just stoked that the review has been taken up so quickly by someone I'm going to learn a lot from—so please take all the time you need, I mean it. I'll be here. Face-smile.svg Remsense  诉  08:19, 2 April 2024 (UTC)
 * I am going to take some time, especially as I want to engage with the sources. My own personal library has two that you don't use, the classic Chinese Characters by Léon Wieger (most of the book is a dictionary sorted by phonetic component, something I have not seen anywhere else) and the modern Chinese Characters by Han Jiantang. Reading all this will be fun, but won't be quick. Thank you for your patience. —Kusma (talk) 10:13, 7 April 2024 (UTC)
 * I am over the moon that you're affording me so much effort. I hope in kind you don't mind that I tinker with the article some (I have a graphical addition in mind but theoretically it shouldn't be disruptive to the flow of analysis)! I will also find those books and add them to my collection, thank you! Remsense  诉  10:25, 7 April 2024 (UTC)

Lead

 * Lead: will discuss later

Development

 * Development: four independent inventions of writing in human history probably better to say "at least" as it is not always known what else was independent?
 * It is probably possible to explain the timing of the four different inventions without using BP for the Maya, which looks out of place with the BCE notation for the other three.
 * Not sure about some of the italics here (compare MOS:ITALIC). proto-writing doesn't need emphasis imo, and is not really WP:WAW.
 * What is the difference between "development" and "early history"?
 * 大鹿 could be linked to Wiktionary like character examples later are
 * 假借: better to incorporate the pinyin (with tones) into the template like you do for Chinese words later in the text

Second pass

 * I am still wondering whether this (relatively short and fairly general) section shouldn't be closer to the "neolithic" part of the history where the concrete application of the abstract proto-writing theory to Chinese is discussed. But I can shut up about this.
 * Is the word "rebus" used by Qiu? On a quick scan I couldn't see it, and the link rebus only offers "Some linguists believe that the Chinese developed their writing system according to the rebus principle". This may need an improved citation.
 * Other concerns have been addressed.

Classification
More later! —Kusma (talk) 14:28, 1 April 2024 (UTC)
 * Classification: again, over-italicised in my view. "writing systems" and "morphosyllabic" have little reason for italics.
 * You do not mention the traditional zh:六書 system, but then cite the 漢語大字典 which often goes back to the 說文 for its character information.
 * Structural analysis: this may of course be justified, but it seems a lot of space is given to Qiu's view, even if it differs only little from the "straightforward structural classification scheme" presented earlier.
 * Pictographs: While relatively few in number, most of the earliest characters originated as pictographs relatively few in number in the last 2000 years? Early on, they were apparently the majority, so relatively many in number?
 * Example lexeme A: 日 links to the 漢語大字典 page for 月.
 * Would it make sense to mention that these are 象形 in
 * Is there some deeper reason why some characters are linked and others are not?
 * Over time, this process sometimes creates excess ambiguity between different senses of a character, which is then usually resolved by adding additional components to create new characters used for specific senses. This can result in new pictographs, but usually results in other character types. this makes me itch to say that modern Chinese resolves this ambiguity not by new characters, but by multi-syllable words; not sure if that is an appropriate comment here


 * four independent inventions; BP: these are taken directly from Handel, but I can happily adjust the date format to favor internal consistency.
 * What is the difference between "development" and "early history"? I think I attempted to organize the "development" section to explain how proto-writing became a writing system while explicating what that means, while the "history" section is meant to cover the evolution of a fully mature writing system.
 * do not mention the traditional zh:六書 system [yet] This is a bit of a pickle for me, I'll have to think about it a bit more. In short, I did not think is incompatible with or requires explication vis a vis with modern analysis before it is explicitly discussed. In such cases, the  analysis is considered accurate to the best of my knowledge
 * Qiu's view: I attribute a lot of statements to Qiu in the first half that I probably don't need to, though in this specific case I do. That said, Qiu's analyses in the Development section etc. are basically the industry golden standard and reflect broad scholarly consensus; Qiu 1988 is basically the definitive monograph for Chinese palaeography (which makes it infuriating that they haven't reprinted the 2000 translation.) Will think about this a bit more.
 * Early on, they were apparently the majority? Great catch! Simply, there weren't very many characters in the proto-writing system—it's impossible to hold thousands of ideographs in your head without starting to encode spoken language. I will explicate this.
 * Is there some deeper reason why some characters are linked and others are not? Inertia! I am not sure what should and should not be linked, but I definitely lean on "don't link without a specific reason", and removed a ton of wiktionary linking from this article as I was working on it.
 * excess ambiguity: The ambiguity that creates new character forms is (or lexicographical) in nature—I do see how this can be conflated with the also-mentioned ambiguity that creates compound words, which is  in nature, and therefore went largely unreflected in the written language until the advent of the vernacular movement to supplant Literary Chinese. Good catch, will explicate this also.
 * All other bullet points I agree with and have applied the most straightforward changes for.
 * Remsense 诉  15:13, 1 April 2024 (UTC)

More later! —Kusma (talk) 10:07, 7 April 2024 (UTC)
 * Compound ideographs: you use 日 ('Sun') but ; is there some obvious reason to use two different formats? Also, seems to create a huge amount of whitespace behind it when viewed in Safari, making it look awkward (not a problem in Chrome).
 * There seems to be no citation for 休.
 * I am not certain that the 漢語大字典 is the best possible source for character etymologies, given that it often just reprints the 說文解字. I will revisit this point when talking about sources.
 * I see no information in the 漢語大字典 that 砼 is a compound ideograph, or even how it is composed; all the dictionary says is that it means 混凝土, concrete.


 * Oh wow, I'm curious why that is! Will investigate. For now, I've tried to use smallcaps with single quotes specifically for radicals, and single quotes for the glosses of lexemes per usual.
 * And yes, I've been thinking about how to cite lexemes also. Of course, there are other dictionaries that I'm prepared to swap out if we don't think the 漢語大字典 is fit for purpose here—but I would consider that it is, given that concerns the latter are not strictly about individual classifications per se. It's odd!
 * ! I accidentally did an OR here—specifically, I went to a chat room where a bunch of Taiwanese lexicographers were hanging out and begged them in my sloppy Chinese to give me a better example of a modern sessemantograph than 鉑 'platinum', which had been the previous example and simply was obviously not one 砼 tóng is at the very least also arguably a phono-semantic compound with phonetic 工 gōng. Perhaps these further examples can simply be cut if they are too nettlesome.
 * Will fix these shortly, wanted to organize my thoughts. Remsense  诉  10:18, 7 April 2024 (UTC)
 * 鉑 is an interesting case, I wonder if there are sources stating whether it is supposed to be syssemantographic gold+white or phonosemantic 金+bo (while 白 is only rarely bo, it is often bo in compounds). The choice of 白 for the sound component may have been inspired by the meaning... —Kusma (talk) 20:04, 8 April 2024 (UTC)
 * As mentioned in the article, the line between phono-semantic and ideographic compounds can be fuzzy. I recalled after looking that 砼 was specifically invented by a Chinese architect and professor named Cai Fangyin in 1953 (which explains why it's not in any Taiwanese dictionaries) but it is also easily described as a phono-semantic compound of 石 and 仝 tóng. Remsense  诉  20:32, 8 April 2024 (UTC)
 * I'm now also very curious about platinum. When did knowledge of that metal reach China? The usual informal word is 白金, but Wiktionary tells me that that was coined in Japanese by the chemist Udagawa Yōan in 1834. If knowledge of Pt reached China that late, then it seems plausible that the use of 鉑 to mean Pt (instead of its old meaning "thin sheet of gold") was a new invention from those who coined the Chinese names of the elements, with a character being chosen to make it meaningfully appropriate. That would make it something like 氫 qīng for hydrogen, where the character and reading were both coinages, and the chosen phonetic was meant to give meaning instead of to match the pronunciation in the major European languages; the difference would be that 鉑 was repurposing an old character with its old reading (not unheard of, some of the characters like 鈰 for cerium are actually in names of the House of Zhu) and 氫 was an entirely new coinage. (I'm particularly fond of 氕 piē, 氘 dāo and 氚 chuān for the hydrogen isotopes, which follow the same sort of thinking behind H itself.) Needless to say, if this speculation is correct, it's not a good example. :)
 * P.S. on reading Wright's Translating Science, I now feel it was a gloriously wasted opportunity not to translate osmium as ⾦惡. Double sharp (talk) 17:14, 14 April 2024 (UTC)

More later, as usual :) —Kusma (talk) 21:11, 8 April 2024 (UTC)
 * Phono-semantic compounds: "For a modern reader, the resulting drift is such that the phonetic component no longer provides any hint as to each character's pronunciation." this is specific to 也; maybe add some comment that clarifies that modern readers get lots of hints from most other characters?
 * Table: is the OC column in IPA? What are C and A? Can you link to some IPA help page?
 * Loangraphs: Some loangraphs may represent words that have never been written another way—this is often the case with abstract grammatical particles such as 之 and 其—but this is not always so this is a bit redundantly written. Also, what does "words that have never been written another way" really mean? 之 is often written の in Taiwanese Mandarin handwriting ...
 * 可口可樂: well, both 可口 and 可樂 are actual words on their own, so this is a bit questionable as an example in my view. Neither Coca-Cola nor Romania are mentioned in the source.
 * (random anecdote): my favourite transcription of a Western company into Chinese is actually 大众 for VW, with the 众 looking like the VW logo :)
 * Traditional Shuowen Jiezi classification: liùshū (六書; 六书; 'six writings') here you have pinyin-traditional-simplified; Romania and Coca-Cola were simplified-traditional-pinyin. Worth making consistent?
 * Are there examples for characters mis-classified by the 說文解字?
 * while this number would be reduced substantially in future dictionaries, the underlying concept would remain ubiquitous. try to avoid going too far WP:INTOTHEWOULDS.

Second pass

 * There are a few uncited sentences in the "pictographs" subsubsection. ("their forms have become regularised and simplified after centuries of iteration", "individualised components may form part of a compound pictograph", "the same 'mouth' component depicts the lip of a vessel in the modern form of the pictograph 畐"
 * Compound ideographs: do you need to say "canonical" twice?
 * Link kokuji.
 * I still do not know how to interpret the "/*lAjʔ/" given for 也 in Old Chinese. What does the A stand for?
 * After checking Baxter-Sagart 2014, I think you should cite that book explicitly for the table (也 is on p. 269). On p. 321, I found the information that "our *A is not a seventh vowel: it is an explicitly ad hoc notation to flag some unsolved problems." I understand that the table is there to strengthen the point that sounds were different in the past, so some sound components make no sense, but overall I think it is a bit of a distraction; I would drop the table, perhaps in favour of an example where we know more.
 * Generally much improved.

History
More later... —Kusma (talk) 22:48, 9 April 2024 (UTC)
 * History: the broad trend towards simplification is probably scholarly consensus and can probably be stated in wikivoice instead of attributing to Qiu? You could still mention Qiu and cite his words.
 * Neolithic writing: aren't there some people (including Qiu) who claim that the Dawenkou culture's signs may be connected to Jiaguwen, compare e.g. zh:大汶口陶文?
 * Oracle bone script: I just fell into a slight rabbit hole of trying to read up on earliest jiaguwen. Now I'm wondering whether an eclipse in 1173 BCE (see p. 4) ought to be added to Historically significant lunar eclipses... but that's probably nothing we need to discuss here.
 * However, it might be interesting to mention Wu Ding?
 * village near Anyang in Henan, which was excavated why not link to / mention Yinxu?
 * which was excavated by the Academia Sinica more precisely, by Li Ji from their (freshly founded just like the Academia) zh:中央研究院歷史語言研究所 (Institute of History and Philology).
 * I think it would be worth to mention the Chinese names of the various scripts.
 * Zhou scripts: The traditional notion of an orderly procession of scripts, with each suddenly invented and displacing the one previous, has been conclusively superseded by modern archaeological finds and scholarly research. More often, two or more scripts coexisted in a given area, and scripts evolved gradually. this seems to be rather general, not limited to the Zhou dynasty?
 * Examples of these styles were preserved as variants in the Shuowen Jiezi. are these other scripts only in the Shuowen Jiezi, or are there extant samples from the Zhou dynasty?
 * Large seal script is only in the See Also on top; shouldn't it be mentioned and discussed in this section?
 * Qin unification and small seal script: Isn't this the place to talk about Li Si?
 * The coexistence of proto-clerical and seal script runs counter to the traditional belief that only the latter was used by the Qin, with clerical script being suddenly invented during the early Han. does everybody agree with Qiu on this? Even if yes, might be better to either drop the traditional belief or state both what the traditional theory was and then what the modern theory is and why it superseded the traditional one.
 * I know that we get some samples later in the article, but I think it would be great to see pictures of oracle bone script/large seal script/small seal script/clerical script closer to where these are discussed.


 * This list is especially worthwhile, many points for refinement toward maximum encyclopedic value here, thank you so much! Remsense  诉  17:07, 10 April 2024 (UTC)
 * Do you think for the history section broadly, it might be worthwhile to have a pseudo-infobox/table where the names are listed in order? I've spent a lot of time cutting a lot of non-English text (imperfectly as you've seen) from this article but yes the native names for scripts do seem like necessary inclusions. Remsense  诉  18:01, 10 April 2024 (UTC)
 * I think that could work, definitely worth trying to create a table of scripts with their Chinese names. —Kusma (talk) 12:20, 11 April 2024 (UTC)
 * We'll cross that bridge when we get to it, but I'll keep this concept in mind for other terminology: MOS:ZH states (rightfully in the realm of style guideline) that linked terms shouldn't also include characters/romanization inline, but there do seem some common-sense exceptions here, since this is such an omnibus article that should reliably stand on its own as such. Remsense  诉  12:22, 11 April 2024 (UTC)
 * We should not write "Mao Zedong (毛泽东)" but we may wish to explain terminology, and the Chinese terminology is potentially important here. For GA, we are free to ignore MOS:ZH but of course at FA level you will need to justify any deviation from the MoS. —Kusma (talk) 14:10, 11 April 2024 (UTC)
 * it's definitely domain-specific MOS that hasn't really been worked through at the FA level specifically, but yes I agree for script styles, and possibly other terms, let me know if there are others you notice that should have characters attached. Remsense  诉  14:23, 11 April 2024 (UTC)
 * By the way, I've been working through your lists, fixing the obvious or easy ones, and copying the ones I'm less sure about to either work through or possibly discuss further once we've done a complete pass through the article, does this seem like a plan? Remsense  诉  15:17, 12 April 2024 (UTC)
 * I can work with that. I am not actually sure what the best method is, so I am happy to use yours. —Kusma (talk) 16:01, 12 April 2024 (UTC)

Structure next! —Kusma (talk) 16:38, 12 April 2024 (UTC)
 * Han clerical script: I don't understand why Libian and Liding are linked as See Also. As their titles are in Chinese, it is unclear what the reader should expect. Shouldn't the libian processes be touched upon here a little?
 * its sophistication was comparable to small seal script. what is meant by this "sophistication"?
 * I find the treatment of ancient cursive versus cursive (and later semi-cursive) a bit confusing, and considerations of the Jin dynasty make up near half of your Han dynasty section. Maybe treat all cursives in a separate section?
 * Neo-clerical: is this a name used only by Qiu?
 * Coined by Qiu. Remsense  诉  16:00, 17 April 2024 (UTC)
 * Regular script: this is missing a statement that regular script is still used today. I also find the section a bit difficult to follow. We have a "simplified, convenient" form that is changed in a way that is "neat, formal". It is unclear how the predecessors were inconvenient or informal or lacking in neatness. I generally don't like this section much.
 * Cao Wei calligrapher Zhong Yao: I didn't know that Cao Wei is a state/dynasty without clicking the link; can this be made clearer?
 * How did the Wangs develop the script?
 * calligraphers such as Wang Wang Xizhi or his son or both?
 * In the History section, is it worth mentioning what tools people used to write the different script styles and how the tool choice influenced the script? (I am pretty sure writing on bones or bronze is very different from using a brush and ink).


 * It may help to merge section History and section Development into "Historical Development" or so. Ctxz2323 (talk) 09:58, 23 May 2024 (UTC)
 * The reason I haven't done this is I don't want Classification to be buried under the entire History section, but I feel the Development section is what is needed for the Classification section to be afforded its proper context. Remsense  诉  09:59, 23 May 2024 (UTC)
 * Understand.
 * And many thanks for your great effort for the GA target! Ctxz2323 (talk) 10:10, 23 May 2024 (UTC)
 * I'm hoping the lessons I've learned will make improving all the other articles on the subject much easier for me and those around me. :)! Thank you for the work you've done as well. Remsense  诉  10:11, 23 May 2024 (UTC)

Second pass

 * "Broadest trend is simplification" is still attributed instead of wikivoice
 * Neolithic: attribute the quote to Qiu
 * Oracle bone script: the source has "several dozen" instead of "up to 40"; is there an independent source for the number or is this just an attempt to avoid WP:CLOP?
 * Zhou scripts: "traditional notion of an orderly procession of scripts" is still here, although my comment has a strikethrough above. I am wondering whether this would be best moved into the introduction of the History section.

Structure

 * rectilinear units of uniform width. nowadays they are square, and they have been nearly square (even when writing top to bottom) for a long time, although the image of the "printed Song publication" has 義 much taller than 之. In the next sentence, you say within the square.
 * Not sure whether it is worth mentioning how standardised stroke counts are (so they are used for all kinds of sorting, from dictionaries to bookshops).
 * I don't understand how the Eight Principles of Yong exemplify the categorisation of strokes.
 * Both the order in which strokes are drawn within a given component, as well as the order components are written in a character is largely fixed. well, but there are several competing standards for stroke order, so this isn't fixed across all of the Sinosphere.
 * Variants and allographs: this might benefit from some more examples (usually we don't see as much variety as for 龜). 棋/䃆 (you play Go with stone pieces on a wooden board, so both make sense) and the top-to-bottom versions 棊/碁 belong to my personal favourites, as does 裡/裏. No sources, sorry.

Second pass

 * "many character components became regularised as discrete series of strokes." I would say that nowadays, all characters (with the possible exception of 〇 if you want to count that) are written as a discrete series of strokes, and the same for their components.
 * 永 could perhaps use a footnote: these days it has six strokes
 * "The order in which the strokes of a component are drawn is also set, but may differ by region." sort of; there are different standards, and different regions (most people call them "countries", except if they believe Taiwan is not a country) nowadays subscribe to different standards. I would probably say "there are several different standards" instead of "may differ by region" if you can source it.
 * Layout: I would suggest to mention here that traditional publishing (books, conservative newspapers) in Taiwan and Japan still uses the traditional layout.
 * If there is a link to stroke order (with the weird example of 必 having four different versions) in the article, I have missed it. If there isn't, I would suggest to add one.

Methods of writing
Enough for today. —Kusma (talk) 22:07, 12 April 2024 (UTC)
 * Printing and typefaces: I would either not mention the 字号 or say what it is. Missing traditional 字號.
 * "Kai" typefaces at this point it turns out we need to know that regular script is called kaishu.
 * Input methods: Predominantly, Chinese characters are input as strings of Latin characters with some notable exceptions like Taiwan, where people usually have bopomofo keyboards and "ji32k7au4a83" is a famously terrible password (我的密碼 typed on a bopomofo keyboard in Latin mode).
 * a more unique shorthand this hurts my mathematician brain. either it is unique or not. "less ambiguous"?
 * 大学 is missing its traditional friend 大學.
 * Encoding and interchange: Text is represented digitally by a series of binary numbers called code points why "series" and not "string" or "sequence"? I don't see what is "binary" about the numbers; Unicode code points are usually written in hexadecimal. Better drop "binary"?
 * I'm not sure you need all of this technical detail, but if you have so much detail on different encoding systems, why do you omit Shift JIS?
 * As of 2024, Big5 encodes 3053 characters.[92] Citation 92 is from 2008 and it seems unlikely that the author knows what Big5 is like in 2024.
 * Thank you so much as always. Remsense  诉  23:11, 12 April 2024 (UTC)

Second pass

 * Just an aside: only now did I read the article on edomoji and finally understood better why some writing looks Japanese to me even without any kana in it
 * Printing and typefaces: should jiu zixing also have characters?
 * Input methods: link pinyin on first use
 * Encoding and interchange: you should mention that other encoding systems exist (or at least were very widely used pre Unicode).

Vocabulary and adaptation
Next stop: Japanese. —Kusma (talk) 13:25, 14 April 2024 (UTC)
 * Vocabulary and adaptation: this section combines talking about 文言文 with the adoption of Chinese characters for other languages. There is probably a good reason to do so, but it is not very clear from the current writing. It is odd that Vietnam, Korea etc. are discussed in a paragraph that begins with a mention of Literary and colloquial readings.
 * Outside China, the notion of special reading systems was extended, what is a "special reading system"?
 * maybe explain quickly what brushtalk is? what are face-to-face conversions?
 * In Korea, hanja were usually used to write Sino-Korean vocabulary I suppose the point is that they were used only for Sino-Korean vocabulary? Was there a different writing system for other Korean words?
 * It isn't really clear how this section is ordered. It is neither chronological nor by area, and some points are revisited more than once.
 * If the bit between the "Vocabulary and adaptation" header and the "Old Chinese" subheader is supposed to be an overview/introduction/"lead section" for what follows, it is exceedingly long.
 * Old Chinese: I don't quite understand why you have so much content about the "departing tone"; how is that relevant to the characters?
 * 説: why not the standard form 說 but this intermediate form closer to 说? (I have no idea why these strokes are different between traditional and simplified, but they are).
 * 説 was the form used in the source (Baxter 1994). Remsense  诉  16:00, 17 April 2024 (UTC)
 * Vernacular Chinese varieties: here we have two paragraphs, one talking about 2000 years of history leading to written vernacular Chinese, another about much more recent (I think, no dates are given) adaptations for non-Mandarin varieties.
 * Say that 孬 is Cantonese and when it was introduced?
 * This seems a variant used across different variants (originally attributed to Northeast Mandarin), so I've opted not to be specific. Remsense  诉  16:00, 17 April 2024 (UTC)
 * My favourite Cantonese character is 冇 (meaning 沒有). It isn't really a "compound" ideograph, but more a "reduction" one. Do you know how people classify this one?
 * semi-official characters I think the MoE is trying to have an official list (which has some choices that I don't like, in particular 袂 instead of Be-tw.svg, as I learned during the GA review of Taiwanese Mandarin, but it is what it is). What do you mean by "semi-official"?
 * Why not give examples of Taiwanese Southern Min Recommended Characters? 睏 is a nice phonosemantic that has 目 as in 睡 (same meaning) and the different sound component 困. (My 台语 is terrible).
 * Actually, many of these seem old, including 睏 and 袂. Not sure which are fully new designs.
 * Remsense 诉  20:17, 14 April 2024 (UTC)

More later. —Kusma (talk) 20:27, 14 April 2024 (UTC)
 * Why See also to Kanbun?
 * the word meaning "Chinese characters" is rendered as kanji isn't it 漢字 just like in Chinese?
 * The 刀 story comes in the middle of the katakana/ateji story.
 * Not sure we need so much about kana.
 * Korean: Define Hanja.
 * As early as the Gojoseon period when was that?
 * I wish I knew! I was shocked at how trying to include a parenthetical dating for this opened every single can of worms in the world, hence its conspicuous absence. Not sure what better to do. Remsense  诉  16:00, 17 April 2024 (UTC)
 * This ambiguity had historically been resolved by also including the associated hanja. While still sometimes used for Sino-Korean vocabulary, it is much rarer for native Korean words to be written using hanja. tense? when is "historically" and when is the now with the native Korean words?
 * Examples of listings why "listings"?
 * Are "native translation" and "Sino-Korean" like the kun and on reading in Japanese or does it just look like that?
 * South Korea: I am a bit confused about the extent of Hanja use and how it changed in the past 70 years.
 * Others support returning to a level of ordinary hanja use previously seen during the 1970s and 1980s so they want the use to increase a bit again?
 * Not sure we need two paragraphs on Hanja in South Korea.
 * Maybe some examples of Korean use of Hanja would be good (of 15 random pages on kowiki, only one, ko:남병길 had any Hanja content; all just names). But maybe you have too much about Korean already.
 * North Korea: the use of all so-called foreign languages, which has been interpreted as including hanja is this a North Korean interpretation or a foreign interpretation?
 * Vietnamese: Link chữ Nôm. Why are there two sentences about use of Chinese in Vietnam between the first mention of chữ Nôm and the explanation what it is?
 * Similar to Zhuang sawndip hasn't been introduced yet
 * Before 1945, the library of the French School of the Far East (EFEO) in Hanoi collected a total of around 20,000 Chinese and Vietnamese epigraphy rubbings from throughout Indochina.[133] The oldest surviving extant manuscript in Vietnamese is a late 15th-century bilingual copy of the Buddhist Sutra of Filial Piety, currently kept by the EFEO. It features Chinese text in larger characters, and an Old Vietnamese translation in smaller characters glossing the text.[134] Every Hán Nôm book in Vietnam after the Phật thuyết is dated between the 17th and the 20th centuries, with most being hand-copied works, and few printed texts. By 1987, the library of the Institute of Hán-Nôm Studies in Hanoi had collected a total of 4,808 Hán Nôm manuscripts is any of this necessary in this article?
 * Other languages: scripts based on Chinese characters, but also included many locally created characters why "but"?
 * According to surveys, traditional sawndip script has twice as many users as the official Latin script this basically repeats the earlier point still in use despite efforts to encourage the writing of Zhuang with a Latin-based alphabet, better to merge these.
 * Nüshu and bopomofo seem to be in the wrong section, they are both phonetic scripts for Chinese.
 * Other scripts within China that have adapted some characters but are otherwise distinct what languages are these for?
 * Transcription: why is this section only about Mongolian?

Second pass

 * "Sinosphere" is explained and linked well after its first use in the article.
 * Kin 2021 p. XII (ref. 118 at the moment) has no information about Chinese, Japanese, Korean and Vietnamese belonging to different language families.
 * However, for other varieties this level of completeness has been described by Victor H. Mair (b. 1943) as "almost unthinkable". hm, I think this was written in 1994, well before the Taiwanese Southern Min Recommended Characters. Systematic characters for 方言 are probably only ever going to be created by government agencies of a separatist bent, but there we are. I would suggest to drop the quote (and certainly drop "born 1943" and instead describe who Mair is so we know why we should care if you leave it).
 * Should we have Japanese before Korean, opposite of the historical order?
 * Japanese: link onyomi and kunyomi?

Special cases
A somewhat odd name for the section, which I will look at next. —Kusma (talk) 20:41, 15 April 2024 (UTC) Not convinced this is the best name or place for this content, nor that all of this needs to be here. —Kusma (talk) 21:16, 16 April 2024 (UTC)
 * The entire "Contractions and abbreviations" section appears to be sourced to a single blog post. Even if it is by Victor H. Mair, this seems less than ideal. If this one blog post is the only mention of these characters in the literature, it is undue weight to spend two paragraphs on this.
 * What does the concept of "ligature" even mean in Chinese? It is not in the source (only in a comment, and Mair seems to be opposed to its use).
 * 囍 can also be pronounced xǐ
 * There are a few more sources and examples at zh:合文, for example common Lunar New Year things like zh:招財進寶 (成語). Modern readers might even like the Grass Mud Horse :)
 * Multi-syllable morphemes: what are the sources for this section?
 * 琵琶, incorporating 珡 just the upper half I'd say...
 * 儿 for erhua: traditional is 兒.
 * Rare and complex characters: Rare or antiquated character variants appear more often in personal or place names more often than where? Any examples? (Biangbiang noodles isn't a personal or place name).
 * Being unable to remember how to write 憂鬱症 by hand did make me slightly melancholic at some point.
 * 鬱 does have a simplified form, 郁.
 * quadruplicated quadrupled? quadruple?


 * Indeed—mind that this article as submitted was following me already having cut several thousand words out of it, the general perspective that the work wasn't done is well-appreciated, as you can see I've trimmed the article still further per your suggestions/thoughts Remsense  诉  21:20, 16 April 2024 (UTC)
 * This seems all gone. I don't know whether some "complexity" subsection somewhere would be a good place to mention complicated characters like 鬱 and some of the 合文? —Kusma (talk) 20:49, 19 May 2024 (UTC)
 * I'm not convinced, honestly. It seems a bit like trivia and appears almost not at all in the RS. Remsense  诉  04:21, 20 May 2024 (UTC)
 * If there is nothing in the sources, it should not be in the article. Something like statistics of how many strokes typical characters have (and that complex characters are rare, and anything worse than 鑰 is extremely rare) doesn't look impossible to source to me, but it might not be strictly necessary to cover. —Kusma (talk) 20:25, 20 May 2024 (UTC)

Standardisation
Hope I can get through the last section tomorrow! —Kusma (talk) 22:02, 18 April 2024 (UTC)
 * Is the table supposed to be complete? (Macao? Singapore? North Korea??)
 * Going from this table into "Received forms" is quite a break in the flow
 * I am wondering whether it wouldn't make more sense to put content on "Received" woodblock forms and the history of simplification into the "History" section and to make the section here just about (current) "standards" instead of including the process of standardisation. (There is nothing about standardisation in the TW and HK sections).
 * Simplified characters: Fascinating as it is that KMT and CCP has competing romanisation projects, I don't think we need this much detail about Gwoyeu Romatzyh and Latinxua Sinwenz. Awesome as Jaw Yuanrenn is, he might be not on topic here.
 * You may be right; I had trouble writing about the simplification without mentioning its tight interwovenness with the alphabetization movements, but surely I can do a bit better.  Remsense  诉  11:11, 19 April 2024 (UTC)
 * Cursive script served as a source for many simplified character forms; others had already been used in print, though usually not in formal works this is mostly redundant with the content in the following section, drop here and mention Cursive later
 * The Republic of China, as well as Hong Kong and Macau—still under colonial rule at the time—were not affected by the reforms. and why should they?
 * The PRC initiated the first round of character simplifications with two documents published in 1956 and 1965 so did the use of simplified characters start in the 1950s or in the 1960s?
 * a total of 8,105 characters, with 3,500 categorised as primary, 3,000 as secondary, and 1,605 as tertiary. what do these categories mean? We seen to move from standardisation of the characters (what are their standard forms and how were they changed?) to the different question of which characters a student is supposed to learn,
 * The Chinese Proficiency Test (HSK) covers 2,663 characters and 5,000 words at its highest level, while the Chinese Proficiency Grading Standards for International Chinese Language Education would cover 3,000 characters and 11,092 words at its highest level that is the pre-2021 汉语水平考试, I think the new one is a but different. Also, this isn't about standardisation of characters, but standardisation of the curriculum taught to learners of Chinese as a foreign language.
 * Singapore: The first round was published in 1969, and consisted of 498 simplified and 502 traditional characters the first round of simplification consisted of simplified and traditional characters? I don't understand this.
 * Generally, is it worth discussing the writing directions and their standardisation (or lack thereof) somewhere?
 * Philippines: why do we have a special section for this?
 * bopomofo, owing to Taiwanese influence due to a shared Hokkien heritage not in the source. The source doesn't look particularly reliable either.
 * North America: why does it get its own section? (Where is Europe? Australia?)


 * I was thinking about education having its own section—what do you think about that? Remsense  诉  12:08, 19 April 2024 (UTC)
 * That might work? "Educational standards", maybe? You could have all the information about educational standards in various countries (like the fact that it is usually standardised when a child should learn a specific character; I own a Taiwanese student dictionary from ca. 1999 and it has stroke order and information like 二上 for "first half of second grade" given with each character) in such a section, and also information about how many characters are required for things like HSK/TOCFL or JLPT that foreigners care about. —Kusma (talk) 12:43, 19 April 2024 (UTC)

Second pass (now "Reform and standardisation")

 * I am wondering whether there should be an additional example for "they also sought to reduce the total number of characters in use by merging some forms together". 云/雲 is a return to an ancient form plus a merging (unlike 气/氣, which is just a return to the ancient form). 後 being merged into the (unrelated, IIRC) 后 is a true merging. (I do not like 简体字 much, does it show?) But it is certainly not something that must be done.
 * From the second round of simplification you move on to describing the present standard (The Chart of Generally Utilised Characters of Modern Chinese) without warning.
 * The Taiwanese Chart of Standard Forms of Common National Characters apparently (according to that article) has been augmented by "18,388 characters in the Chart of Rarely-Used National Characters (罕用國字標準字體表)"; if you can source this properly, it would fit well here.

Lexicography
The following are more personal rants than suggestions to improve the article, so no need to engage too much with the small print.
 * What is a "meaning-based order"? Is this about the time before modern encyclopaedias when the world was small enough that people tried to sort the encyclopaedia in some logical way instead of the meaningless alphabetical order we have nowadays? (Western encyclopaedias made this shift around the 16th century).
 * Most modern Chinese dictionaries arrange the main character entries alphabetically according to pinyin spelling, while also providing a traditional radical-based index I can't find that in the cited source. From my own experience, the last time I went dictionary shopping (I have to admit that was over 20 years ago), it was difficult to buy pinyin-based dictionaries in Taipei. I think there is also a difference between monolingual and bilingual dictionaries, with monolingual dictionaries more favouring radical/stroke and bilingual ones more favouring pinyin.
 * Most dictionaries choose one method to sort characters and actually have indices for other methods.
 * The text doesn't emphasise enough what a terrible user-unfriendly system radical+stroke was in practice, back when we had to use paper dictionaries. I remember spending entire days looking up characters. Drawing on my phone in Pleco is about a hundred times faster.
 * (That ⻌ and 阝 both have seven strokes in Kangxi does not help).
 * In fact, my best paper dictionary (a Far East Chinese English Dictionary from ca. 2000) starts with radical index, table of contents, then the coolest thing of all: a table of hard to find characters sorted by stroke count, telling me things like where to find 恭 even if I don't know that the radical is 心.
 * Active vocabulary/total number of characters: might make sense to combine this with the "education" content about number of characters. Maybe you just want a "number of characters" section somewhere?
 * Is it worth mentioning the Table of Indexing Chinese Character Components as a modern Simplified alternative to Kangxi that may or may not be a widely used standard?

No further comments here. —Kusma (talk) 20:46, 20 May 2024 (UTC)

Images
No major complaints here, just minor suggestions for captions. —Kusma (talk) 21:19, 21 April 2024 (UTC)
 * File:Comparative evolution of Cuneiform, Egyptian and Chinese characters.svg - this is cute, but I think it should be attributed in the caption. I assume the Chinese characters were drawn by Maspero or some other Egyptologist? They look rather like beginner's handwriting.
 * Structural templates used in compounds, with red marking possible positions for components: is this supposed to be complete? shouldn't there be some source for these? Is something like the 或 in 國 included, or just the 囗?
 * File:噹噹茶餐廳2021年7月初的午餐餐牌-tweaked.jpg – not sure if it is worth pointing out, but there are some typical handwriting abbreviations on the board, like the way they write 炒 and in particular writing 反 as shorthand for 飯 (technically wrong, but anyone literate enough to understand this when written correctly should understand the abbreviation).
 * File:Chineseprimer3.png – a higher quality scan would be nice, but certainly optional
 * File:ROC24 SC1.jpg – source link is dead, but this seems fine. I wonder whether it is worth linking (in caption or in the body) to zh:第一批简体字表

Second pass on images

 * New images are a great addition, and licensing seems fine.
 * Without a statistics section, I would suggest to consider removing File:Cumulative Hanzi frequency.svg. Or replacing by something showing how most characters belong to the same two handful of radicals...
 * I am not sure the choice of left-aligning File:噹噹茶餐廳2021年7月初的午餐餐牌-tweaked.jpg really works. In standard settings (incognito mode Chrome) there is enough space to move it to the right.

Infoboxes

 * Direction: Top-to-bottom is still widely used in Japan and Taiwan, so I do not understand why this is marked "historical".
 * Do the Ryukyuan languages need their own mention?
 * I don't quite see the point in the massive list of transcriptions. Maybe not all of them are needed?
 * At least in Hanyu pinyin, I think it is usually Hànzì with a capital H.
 * the "other Mandarin" heading is odd
 * "Romanization" should be "Romanisation" in the variety of English used in the body
 * Requested edits implementing an WP:ENGVAR parameter for .  Remsense  诉  11:08, 12 May 2024 (UTC)
 * Why do we care about the name in Thai or Khmer, and perhaps the one in Zhuang?

Better. —Kusma (talk) 20:57, 20 May 2024 (UTC)

Source reviewing
Numbering is from Special:PermanentLink/1220112331. Not looking at everything, but at quite a few sources. More later! —Kusma (talk) 20:54, 22 April 2024 (UTC) First impression of sources: generally excellent sources, but sometimes not everything you say is in these sources (and sometimes you cite rather long sections of the books). More later (sorry for the slow progress). —Kusma (talk) 21:06, 23 April 2024 (UTC) More later! (Sorry, busy days...) —Kusma (talk) 20:52, 25 April 2024 (UTC)
 * Notes: e,f,g,k are unsourced (even if k is WP:BLUESKY to anyone who reads Chinese). h,i seem unfinished.
 * 2: ok
 * 4: ok, but is it worth discussing the "Naxi script" that Qiu mentions?
 * 6: can't see a discussion of jiajie on p. 11; you probably need to cite a few earlier pages
 * 8: can't see "logography" defined on these pages
 * 10: ok
 * 11: Handel just mentions 六書 and does not say 說文解字. Are you sure about the page numbers for Yong & Peng?
 * 12: ok
 * 13: does Qiu call them forms? Maybe I overlooked it but didn't see it
 * 14/15 could not find full text, but maybe I didn't try hard enough
 * 16: this makes it sound as if this is Qiu's theory, but he clearly says this is Tang Lan's theory (and then says why it is not enough).
 * 17: not sure where you see that "most" early characters are 象形
 * 21: ok
 * C: I don't see how the MoE dictionary page for 明白 says how it is " touching on the derived association of 明 with 'illumination'" Could you point me to where the 漢語大字典 explains how 萌 alludes to heliotropic plants?
 * 23 "made obscure by subsequent changes in form" I read the source more as referring to subsequent changes in pronunciation.
 * D: As discussed earlier, 砼 is not described as a compound ideograph in the source.
 * 26: ok
 * 27: Qiu p. 154 seems to discuss the difficulty of deciding between 象形 and 指事 and not talk about 形聲 versus 會意
 * Footnote d: no sources for this paragraph?
 * 33: can't see anything about chemistry on p. 211, and in particular nothing about plutonium.
 * 35, 36: both seem to be about syllables and not about transcriptions / connotations; nothing about 可口可樂.


 * I have been pretty under the weather myself the past week or so and as such haven't been able to be as up-to-date with your suggestions as I've liked, so I appreciate your patience and continued diligence so much, I should be on the upswing now... Remsense  诉  19:13, 26 April 2024 (UTC)
 * Also, you may notice that my solution to many—not all!—discrepancies you mention is just to excise the passage in question. Perhaps this is a bias of mine, but I find many of them are simply unnecessary to the article as weighted, and the pressure to reduce its wordcount some is well-addressed. If I'm cutting out too much, please let me know though. Remsense  诉  01:07, 28 April 2024 (UTC)
 * I guess cutting is overall reasonable (currently the article is still at 9.4k words), but some like "new characters are still being made, for example for newly discovered chemical elements" isn't something I would remove entirely. —Kusma (talk) 05:43, 28 April 2024 (UTC)
 * Of course. I guess if I've learned anything from this process, it's "rewrite entire sections instead of writing articles backwards and sourcing claims others have made piecemeal". I would like to readd properly sourced color claims as I come across them in the literature, but if I can't do so easily it seems the best option is removal Remsense  诉  16:08, 28 April 2024 (UTC)
 * I agree; generally I prefer to use old text (or Wikipedia articles in other languages) for inspiration only and just write my own text based on the sources I can access. —Kusma (talk) 16:24, 28 April 2024 (UTC)

More later (I really need to speed up, but I keep getting distracted). —Kusma (talk) 16:21, 28 April 2024 (UTC) I am wondering if I should add tags to various information not in the citations given (dragon bones, Li Ji, "Oracle bone script is the direct ancestor of later forms of written Chinese"). —Kusma (talk) 20:52, 29 April 2024 (UTC) Side remark: somebody should research more about our hero Qiu Xigui so the article can become GA :) —Kusma (talk) 10:18, 6 May 2024 (UTC)
 * 37: fine. I enjoyed reading in Qiu pp. 170–171 that 歹 might be Tibetan in origin.
 * 38: the Norman I can access doesn't quite have the right pagination, so I'm not sure about the page range. Is "throughout the Sinosphere" really covered by Norman?
 * 41: I'm sure this is somewhere in Qiu, but this seems to be the wrong page range.
 * The Chinese second edition is cited, aka our "Qiu 2013". Remsense  诉  01:07, 30 April 2024 (UTC)
 * 44: ok
 * 45/46: I am sure there are better sources than conference proceedings and a blog for this claim.
 * 50: the mention of Jiahu and Dadiwan is on p. 30. I can't seem to find the exact quote in Qiu; if this is a paraphrase (and arguably it should be one and/or be attributed to Qiu better) it should not have quotation marks.
 * 52: fine
 * 53: give page number
 * 54: seems to be p. 108, at least that is the page linked on Google Books
 * [53][54][55][56] looks overcited
 * 57a: no mention of Yin or of dragon bones or of Li Ji in this source, which only seems to cover the final sentence.
 * 58: seems ok
 * 59: page number?
 * 60: ok
 * 61: It would probably be easier (and far less likely to break in the future) to split these up into one citation per sentence.
 * 62: that's a really long page range
 * 64: the source calls it 鳥篆 or 鳥蟲書, not 鳥蟲篆 (and my Chinese input method does not recognise that as a word). The zhwiki article is at zh:鳥蟲書. I can't find in the source where the regional forms used in non-Qin states were preserved as variants in the Shuowen Jiezi, or where 大篆 is discussed.
 * 65: couldn't find all of the previous in this source. At least Li Si and 221 BCE are missing.
 * 66: looks roughly ok, did not check every word
 * 67: Chinese name for "neo-clerical" is not in these pages
 * 69: looks ok
 * 70: can't find "Sage of Calligraphy" in Qiu; also, Qiu p. 139 has different years of birth and death. If Qiu is wrong about this, you should cite Wang Xizhi from somewhere else, which should not be too hard.
 * 71: if you find time, best to split this up by sentence
 * 72: could not find, but does it mention the Sui dynasty?
 * I am with you on Qiu. Remsense  诉  12:50, 6 May 2024 (UTC)

Source reviewing 2
Now using Special:PermanentLink/1222699083 so I don't review a weeks old version. Still only halfway through :( —Kusma (talk) 21:49, 7 May 2024 (UTC) Stopping for the night. Hopefully more time over the weekend. —Kusma (talk) 22:22, 9 May 2024 (UTC) Hoping for more time over the weekend. —Kusma (talk) 22:57, 10 May 2024 (UTC) Saving progress... —Kusma (talk) 19:18, 12 May 2024 (UTC) There seem to be several unsourced statements in every paragraph, and I haven't even checked every single citation. In a normal GA review (i.e. one that takes a couple of days to a week) I would probably have failed this about 50 citations ago. As this has turned into some kind of deep-dive peer review, I will keep going and only think about the GA criteria again after I have looked at your many improvements and corrections. Stopping here for the day. —Kusma (talk) 21:11, 13 May 2024 (UTC)
 * 79: this should be citable to something in English. For example, pp. 75–76 of the (not super great) Han Jiantang book I mentioned earlier.
 * 83: a lot of this seems more about evolution than about variant characters.
 * 85: ok
 * 86: not all of this is in the source; you say 朙 was "prescribed", the source just says it was "promoted".> "明 became the character's standard form" is true but not in the source. The identification of "Before the Qin" with "During the Eastern Zhou" is slightly OR.
 * Does that 明 is listed in the "Modern print" column not count? Whoops, forgot to comma a page. Remsense  诉  00:04, 8 May 2024 (UTC)
 * 87: looks ok
 * 91: only saw snippet view, looks ok but I couldn't find Bi Sheng
 * 96: does Li mention bopomofo in this context?
 * 97: page number? I am not sure I fully understand the sentence anyway
 * 98: fine. also mentions bopomofo and other methods, generally seems like a good source
 * 99: couldn't access, but the example is also in Zhang 2016 p. 423
 * 100: not sure I understand the citation. page number?
 * 102: ok
 * 103: looks ok
 * 105: probably fine (wrong edition)
 * 107: ok
 * 109: no search results for "sinosphere", a word already used earlier in the text, and citing a 700 page book without page numbers looks like whoever cited this just hopes it is in the book somewhere.
 * 112: looks ok
 * 114: not happy. There is nothing about Vietnam or Japan here, and "Writing also arrived in Korea during the 2nd century BCE and spread throughout the country over the following three centuries" seems too far from clearly specified in the source to be in wikivoice.
 * I've already said I'm never writing an article this way again, but there's still no good reason for that disaster last paragraph other than sloppiness to exist on my watch, ugh. Apologies, and thank you for your patience. Remsense  诉  23:42, 9 May 2024 (UTC)
 * 115: could not access, but looks like a solid source
 * 116: could not access, but looks like a solid source
 * 118: ok, but a page number would be nice
 * I am ignoring the Old Chinese; it doesn't really come across in this section why we should care about the pronunciation.
 * It's puzzled me too, do you think trying to articulate this as a way that characters acquire different meanings would suffice? Remsense  诉  23:36, 10 May 2024 (UTC)
 * Yes. One or two examples (where the sourcing is best) would suffice for that. We want to talk about the characters here. (And perhaps this is a way to stress that "multiple readings" isn't a Japanese only phenomenon). —Kusma (talk) 13:29, 12 May 2024 (UTC)
 * Does the article as presently constituted imply that? Remsense  诉  13:37, 12 May 2024 (UTC)
 * Not explicitly, but I can imagine a reader could go away thinking characters have a single meaning and reading. —Kusma (talk) 18:36, 12 May 2024 (UTC)
 * 125: I am pretty sure not all of this is in the source. 25-30%? accelerating introduction of polysyllabic words into vernacular?
 * 126: not sure my pagination is right, but I am having trouble finding the three concepts close to each other
 * 127: sort of.
 * 128: the source only claims this about Cantonese. the new characters are not "borrowed", are they? How is 冇 "borrowed"?
 * E: that dictionary is not a source for the claim "generally phono-semantic compounds, although there are examples of compound ideographs"; also, is this in all dialects?
 * F: does the dictionary really say that there are "a body of official characters used to represent" 台語/客家話? Link is broken so I can't check.
 * 129: looks ok from snippets
 * 130, 131: seems ok
 * 132: text looks ok but couldn't find examples in snippet view
 * 133: looks ok
 * 134: do you really need to cite a website in Korean? seems a fairly basic claim and it is in the following source
 * 135: ok
 * Is there a source for the Hanja examples / is it worth linking them to Wiktionary at least?
 * 138: is any of the previous six sentences covered by this source? doesn't look like it
 * 139: is this just for the half-sentence? then the previous sentence is uncited, and anyway 139 does not support it.
 * 140: "no results found in this book for veritable records". None for "etymology" either, all of the three sentences seem unsourced.
 * 142: this is some unknown reporter writing? seems reliable enough for the claim, but I can only read Korean through machines. Link The Chosun Ilbo?
 * 143: is this a copyvio via some long-defunct BBS? not a particularly good looking source, and in Korean
 * 144: I do not believe that all three of the statements preceding this citation are on one page.
 * 145: are the 13th century and the "newly coined chữ Nôm" really on the same page?
 * 146: snippet looks ok
 * 147: Coulmas looks ok, but the page range in DeFrancis is a bit long for such a simple statement
 * 148: this is a master's thesis, and there are no page numbers. if this is the "survey", it is only acceptable if cited by better WP:RS per WP:SCHOLARSHIP. If this just cites the survey, this is hopeless without a page number. The first three sentences of the paragraph are unsourced?
 * 149a: looks ok? did not check for all of the scripts
 * 151: where does it say it is a "semi-syllabary"? (also uses the official name zhuyin fuhao instead of the nice nickname bopomofo).
 * I have tried to consistently use the WP:COMMONNAME bopomofo across the article. Do you see this as less ideal? I think consistency is most important in any case. Remsense  诉  04:36, 14 May 2024 (UTC)
 * I remember being one of the people asserting it is the common name at the move discussion. Certainly among language learners, "bopomofo" was the most common way to refer to it. I think it is fine, but I still wonder whether I should have done proper research before participating in that RM. —Kusma (talk) 21:05, 14 May 2024 (UTC)
 * 149c: does not mention bopomofo using forms derived from Chinese characters; that seems unsourced.
 * 152: Persian/Arabic seem not mentioned. (Also, when was it that Mongolian was written using PErsian/Arabic? These days Cyrillic is pretty popular).
 * 153: I fail to see the relevance of this quote and would suggets to cut.


 * I am doing my absolute best at least not to make this a WP:FIXLOOP for you, as to properly respect all the time you've devoted to this. I am heartened to notice that most of the issues have been with material I haven't myself written, only attempted to cite afterward. I have learned to be much, much more active in stripping out every spurious claim in an article like this (when I started the article was 13k and I nominated thinking I had cut enough out and managed to cite the rest. It was simply not mentally navigable foe me at the time, but I've learned a lot better how to knuckle down and ensure that is the case from this.) Remsense  诉  00:14, 14 May 2024 (UTC)
 * Spurred by this discontent with my own process, I've cut essentially the last material from the article that I didn't personally come across in my own research—thinking piecemeal sourcing was adequate, but this has been shown not to be the case—basically the better chunk of what you had left to review. While the need for some "education" section is more apparent than before, I am still fairly sure that the coverage is broad enough per the GA criteria. Also, the article feels a bit empty to me, like it doesn't really adequately relate the human color of the subject (as reflected in the sources) very well—though that might just be me feeling bad about the quality of my work. There are elements throughout my time with this article that I've cut and would like to reintroduce but haven't encountered the sources to best do so. Sorry again about not having the good sense to do this before. Remsense  诉  09:41, 14 May 2024 (UTC)
 * That was a lot of cutting! Given how hit-and-miss some of the sourcing was, probably a good move. I really should have done the source reviewing and the content reviewing together to better discuss what should and should not be cut. We'll have to do a second pass through the content after this source review. —Kusma (talk) 20:58, 14 May 2024 (UTC)
 * I have a document with most of the content I've cut from this article since October so that I can grab things for potential readdition. Thank you for your patience as always. Remsense  诉  02:35, 15 May 2024 (UTC)

Source reviewing part 3
Now at Special:Permanentlink/1223839594. Lexicography tomorrow, then I'll go back to content reviewing. —Kusma (talk) 20:58, 14 May 2024 (UTC)
 * 165: I can't see the "novel readings" in the source. I found it interesting that Qiu mentions government-driven standardisation versus population-driven, fairly ad hoc simplifications.
 * 167: ok
 * 168: none of this is in Chen 1999 p. 151
 * 169: this one isn't at those pages in Chen either
 * 170: what is the relevance of Chen 182–186 here? 150–153 is ok. Is the historical stuff in the other sources?
 * 171: this seems only about pinyin, can't see Zhou Enlai and script simplification here.
 * 173: isn't this pp. 156–157?
 * 175: always dangerous to cite an entire paragraph to one ref tag, but most of this seems to be in Chen 1999.
 * 176: ok as primary source, secondary in English should also exist
 * 177: only references the previous sentence, not the first couple sentences of the paragraph. (Generally, in an article like this, I would suggest to avoid end-of-paragraph citations and use end-of-sentence citations everywhere, to make it easier to spot when an unsourced sentence is added). isn't this in one of your scholarly books?
 * 179: looks ok
 * 181/182: not the greatest sources the world has ever seen, but seem to work for the moment
 * 184: where do the 4808 characters come from? (Incidentally, my favourite counterexample to "no simplified characters in Taiwan" is 台湾啤酒).
 * Hopefully today! Yesterday's wikitime was spent unproductively arguing with people who are wrong on the internet. —Kusma (talk) 08:54, 16 May 2024 (UTC)
 * Remsense 诉  09:14, 16 May 2024 (UTC)

OK, done with this bit! Next step: Lead section (at first glance, it seems incomplete and not summarising the entire article). —Kusma (talk) 21:12, 16 May 2024 (UTC)
 * 187: I am sure that there are non-Chinese books for this (could not verify, but looks plausible). You could be a bit more precise; "visual form" does not immediately suggest the ordering schemes by stroke count or radical + stroke count that are probably included here.
 * 188: the content about 說文解字 is not in these pages.
 * 189: no content about Kangxi at this page range? (source looks pretty comprehensive but it is really annoying that they translate titles so you have to learn more words like "EDCC" instead of just "Shuowen").
 * 191: only found wrong edition, so could not fully verify.


 * Some more general source comments:
 * *Baxter 1992 is not cited.
 * *Cite 118 to "Handel" probably needs to go to "Handel 2019"
 * *What is the copyright status of the online copy of 漢語大字典 that you use?
 * Sourcing generally looks much improved! —Kusma (talk) 21:00, 20 May 2024 (UTC)

Lead

 * Yeah, I haven't been sure what to emphasize in the fourth paragraph this lead should most definitely have. Totally going to be a thing where a fresh set of eyes will be extremely useful to me. Remsense  诉  12:53, 17 May 2024 (UTC)
 * Maybe you ought to start from scratch. Certainly classification/structure (types of charcters, strokes, radicals), history (Shang dynasty, Qin unification, regular script), reform are sorely missing; readers probably also expect some comment on how many characters there are, perhaps combined with modern computing info. I am not sure the phoneme-morpheme-ideograph discussion reflects the body, and also that it needs to be there. —Kusma (talk) 21:50, 17 May 2024 (UTC)
 * I am not sure I would be able to rewrite it from scratch, if I can be honest. It's very difficult to adequately introduce this topic, and I don't think I have it in me to rebalance everything the article is about without making a bigger mess. I'll at least try the introduction of a fourth paragraph and touching on the points you mention, and we'll go from there. Remsense  诉  01:12, 18 May 2024 (UTC)
 * I think that touches nearly everything, but I'm not sure how to better balance the paragraphs. What do you think? Remsense  诉  05:36, 18 May 2024 (UTC)
 * It looks better now; still on the short side. To make it even shorter, I think you can drop the emphasis on only writing system used continuously as this is more a statement about the other inventions of writing. I'll let you know if I think of things I want mentioned. —Kusma (talk) 10:43, 19 May 2024 (UTC)

Final comments
In any case, I think this passes the GA criteria now. Great work! Sorry my review took so long. —Kusma (talk) 17:00, 24 May 2024 (UTC)
 * Who is "Yang Runlu" and why do we care what they say?
 * pre-FAC: you need more for "comprehensiveness", and I would strongly suggest to find a good peer reviewer with little to no knowledge of Chinese to read this
 * the lead section especially may need close attention before attempting FAC
 * Not sure Qiu [47] needs to be a quote


 * Thanks for the good news of GA.
 * By the way, Yang Runlu is a professor of Beijing Normal University and the author of textbook " ". Ctxz2323 (talk) 00:50, 25 May 2024 (UTC)