Wikipedia:Bots/Requests for approval/TokenzeroBot 4


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was

TokenzeroBot 4
Operator:

Time filed: 18:26, Wednesday, May 9, 2018 (UTC)

Automatic, Supervised, or Manual: automatic

Programming language(s): python, pywikibot

Source code available: GitHub

Function overview: Create redirects between 'and' and '&' variants of journal/magazine names.

Links to relevant discussions (where appropriate): Bot_requests

Edit period(s): Weekly

Estimated number of pages affected: 2000 once + about 1 per week

Exclusion compliant (Yes/No): Yes

Already has a bot flag (Yes/No): Yes

Function details: For each page containing an infobox journal/infobox magazine or contained in a journal/magazine category: If the title contains subtrings ' and ' or ' & ' (with spaces around), then create a redirect from the replacement with ' & ' or ' and ', respectively (unless a page with that title already exists, in which case do nothing). For example: → Abstract and Applied Analysis. Created redirects would be categorized as R from modification.

The bot would create a few spurious redirects for foreign-language titles, like → Afrique & Histoire; looking at Category:Academic journals by language I expect about 10 of them.

Discussion

 * In that case, the bot should easily be able to exclude all subcategories of Category:Academic journals by language (that aren't Category:English-language journals). Headbomb {t · c · p · b} 19:15, 9 May 2018 (UTC)


 * Ok, didn't think of it. So from titles with ampersands, I'll exclude all those in Category:Multilingual journals (just to be safe), and those in Category:Academic journals by language but not in Category:English-language journals. This means currently 6 titles in total. Tokenzero (talk) 19:47, 9 May 2018 (UTC)


 * 10 edits for '&' → 'and', 10 edits for 'and' → '&'. Headbomb {t · c · p · b} 02:14, 10 May 2018 (UTC)
 * See Special:Contributions/TokenzeroBot. Tokenzero (talk) 20:54, 10 May 2018 (UTC)
 * The only issue was Lebensmittel-Wissenschaft and Technologie since the proper German is Lebensmittel-Wissenschaft und Technologie. I don't feel it's a huge issue though, but I'll leave the discussion open for further opinions and let another BAG member close approve/deny since I'm involved here. Headbomb {t · c · p · b} 21:52, 10 May 2018 (UTC)
 * Ok, sure. I've added language detection with cld2, though (mostly for fun and to learn if it's usable). It's not very reliable on such short fragments, but as an extra security measure it works: it does catch Lebensmittel-Wissenschaft & Technologie as German with high confidence, it finds no other such example, and it gives 82 titles where English has too low confidence (but all are actually English). So I could exclude and check those by hand. Tokenzero (talk) 10:37, 12 May 2018 (UTC)

just to make sure the bot still works. Headbomb {t · c · p · b} 16:31, 18 May 2018 (UTC)
 * See contribs. I only did '&' → 'and' (because only those are potentially problematic). Tokenzero (talk) 11:52, 19 May 2018 (UTC)

Headbomb {t · c · p · b} 13:43, 19 May 2018 (UTC)


 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.