Wikipedia:Bots/Requests for approval/TokenzeroBot 3


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was

TokenzeroBot 3
Operator:

Time filed: 18:22, Sunday, May 6, 2018 (UTC)

Automatic, Supervised, or Manual: automatic

Programming language(s): Python

Source code available: github

Function overview: Fill 'abbreviation' parameter in 'infobox journals' in trivial cases.

Links to relevant discussions (where appropriate): Wikipedia talk:WikiProject Academic Journals

Edit period(s): twice a week

Estimated number of pages affected: roughly 80 once + 1 per week

Exclusion compliant (Yes/No): Yes

Already has a bot flag (Yes/No): Yes

Function details: The bot would handle Category:Infobox journals with missing ISO 4 abbreviations in some easy cases, as requested by. Specifically, for every infobox journal that has no or empty abbreviation, it computes the ISO 4 abbreviation using the tool (already used by TokenzeroBot) and would fill it into the infobox only if the computed abbreviation actually has nothing abbreviated (so it is equal to the title, up to removing 'a'/'the' and disambigs like '(journal)')). A common case is when the title is a single word, in which case ISO 4 specifies that the word should be left unabbreviated. (We display these trivial abbreviations to make it clear that this is ISO 4, to be consistent with existing filled infoboxes).

To be safe, I would skip pages where the infobox title and article title don't match. A single run would fix ~80 of the 900 pages in the category. Then it could run together with TokenzeroBot 2 (which runs ~twice a week), to handle newly created infoboxes. I tested the code on one edit, see example diff.

Discussion
Headbomb {t · c · p · b} 19:10, 9 May 2018 (UTC)


 * , see Special:Contributions/TokenzeroBot. I believe El Iris should get 'Iris' instead; I've already added 'el' to the list of foreign articles, but I forgot to re-run the abbreviation computations to take that into account. Tokenzero (talk) 20:10, 9 May 2018 (UTC)
 * This one too is tricky 'De' is a Dutch/French article as well. Headbomb {t · c · p · b} 20:18, 9 May 2018 (UTC)
 * The list of articles is now "a an the der die das den dem des le la les l' el il lo los de het els ses es gli", so these cases should be handled now. If we still fear too much unexpected errors I could also run the bot supervised on the ~80 cases and not run it regularly (so do the new infoboxes would have to be filled by hand, as before). Tokenzero (talk) 11:59, 12 May 2018 (UTC)
 * . Do the full run, and we'll see if there's an issue. Then we can decide on future runs. Headbomb {t · c · p · b} 12:17, 21 May 2018 (UTC)


 * any updates on this? Headbomb {t · c · p · b} 16:26, 11 June 2018 (UTC)
 * I've had a hard drive failure, which stopped my PC server. I've fixed that last weekend, so I hope to do this this week. Tokenzero (talk) 19:36, 11 June 2018 (UTC)
 * See Special:Contributions/TokenzeroBot or the following log (51 edits made). I made two more edits by hand, otherwise the bot would fill the trivial abbreviation for Teb o Tazkieh (which should instead remove 'o', the transliteration of Persian word for 'and') as well as Zhizn' Natsional'nostei (which is formally correct, but languages like Russian barely have anything in the LTWA, so the ISO-4 abbreviation for them is kind of useless). Tokenzero (talk) 18:55, 17 June 2018 (UTC)

--Filling "Iyyun" with abbrev "Iyyun" --Filling "Läkartidningen" with abbrev "Läkartidningen" --Filling "Landscope" with abbrev "Landscope" --Filling "The Laryngoscope" with abbrev "Laryngoscope" --Filling "Lethaia" with abbrev "Lethaia" --Filling "Literal Latte" with abbrev "Literal Latte" --Filling "Lituanus" with abbrev "Lituanus" --Filling "Lung India" with abbrev "Lung India" --Filling "Magallania" with abbrev "Magallania" --Filling "Manuscripta" with abbrev "Manuscripta" --Filling "Middle East Critique" with abbrev "Middle East Critique" --Filling "Middle East Policy" with abbrev "Middle East Policy" --Filling "Mikroglottika" with abbrev "Mikroglottika" --Filling "Mouseion" with abbrev "Mouseion" --Filling "MUSICultures" with abbrev "MUSICultures" --Filling "MycoKeys" with abbrev "MycoKeys" --Filling "Neurocase" with abbrev "Neurocase" --Filling "NeuroImage" with abbrev "NeuroImage" --Filling "The Neuroscientist" with abbrev "Neuroscientist" --Filling "New Blackfriars" with abbrev "New Blackfriars" --Filling "New West Indian Guide" with abbrev "New West Indian Guide" --Filling "Novi Plamen" with abbrev "Novi Plamen" --Filling "The Numismatist" with abbrev "Numismatist" --Filling "Palaeoworld" with abbrev "Palaeoworld" --Filling "Party Politics" with abbrev "Party Politics" --Filling "Pathologica" with abbrev "Pathologica" --Filling "Per Linguam" with abbrev "Per Linguam" --Filling "The Philosopher" with abbrev "Philosopher" --Filling "Phylon" with abbrev "Phylon" --Filling "The Pinch" with abbrev "Pinch" --Filling "The Plant Cell" with abbrev "Plant Cell" --Filling "Ploughshares" with abbrev "Ploughshares" --Filling "The Pomegranate" with abbrev "Pomegranate" --Filling "Praksis" with abbrev "Praksis" --Filling "Public Health Ethics" with abbrev "Public Health Ethics" --Filling "Semiotica" with abbrev "Semiotica" --Filling "Sexologies" with abbrev "Sexologies" --Filling "SNReview" with abbrev "SNReview" --Filling "Spaceflight" with abbrev "Spaceflight" --Filling "Stenopetala" with abbrev "Stenopetala" --Filling "StoryQuarterly" with abbrev "StoryQuarterly" --Filling "SubStance" with abbrev "SubStance" --Filling "Synthese" with abbrev "Synthese" --Filling "T'oung Pao" with abbrev "T'oung Pao" --Filling "Themelios" with abbrev "Themelios" --Filling "Third Text" with abbrev "Third Text" --Filling "Trajecta" with abbrev "Trajecta" --Filling "Vikublaðið" with abbrev "Vikublaðið" --Filling "Voting matters" with abbrev "Voting matters" --Filling "Water SA" with abbrev "Water SA" --Filling "Winterthur Portfolio" with abbrev "Winterthur Portfolio"

Headbomb {t · c · p · b} 00:42, 18 June 2018 (UTC)
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.