Wikipedia:Bots/Requests for approval/BsoykaBot


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at Bots/Noticeboard. The result of the discussion was

BsoykaBot
Operator:

Time filed: 05:11, Sunday, February 27, 2022 (UTC)

Automatic, Supervised, or Manual: automatic

Programming language(s): Python

Source code available: https://github.com/bsoyka/wikipedia-bot/blob/main/fix_pmc_errors.py

Function overview: Fix CS1 errors due to PMC formatting

Links to relevant discussions (where appropriate):

Edit period(s): Running over the category once daily

Estimated number of pages affected: ~10-30 per day perhaps

Exclusion compliant (Yes/No): Yes

Already has a bot flag (Yes/No): No

Function details: This finds the regex  and replaces it with   for each article in Category:CS1 maint: PMC format. Some examples:

Discussion
Just commenting for curiosity, as of 13:51, 27 February 2022 (UTC) there are 3 pages in that category. "Live" count is. Primefac (talk) 13:51, 27 February 2022 (UTC)
 * The category can be cleaned out by User:Citation bot, but that bot needs to be triggered for the category manually by someone each time it's cleaned out. This new bot could take stress off of both the people manually triggering the bot and the bot itself, which is more intended for adding other information to citations.  Bsoyka  ( talk &middot;  contribs ) 14:23, 27 February 2022 (UTC)

We need to acknowledge during the BRFA that for most readers, this will be a cosmetic change, since it removes a normally hidden error message and a hidden category. I support the task, and bots are allowed to make cosmetic change with BRFA approval, but editors sometimes object to bot edits even after an approved BRFA because they don't see a change in the rendered page. – Jonesey95 (talk) 14:33, 27 February 2022 (UTC)
 * For sure, I agree. I'll also mention that I'm going through the BRFA process to get a flag and AWB access for my bot account so I'm not flooding any watchlists with this.  Bsoyka  ( talk &middot;  contribs ) 18:39, 27 February 2022 (UTC)
 * There's only one page in the category at the moment, so this might not be a high-volume task that anyone would notice. Does everyone see this message in preview though?    I find such preview warnings for "cosmetic" issues to be quite annoying, and would be happy to have bots making them go away. Human gnomes are swamped and don't have time for messing around making edits like this. Could a bot clean out Category:CS1 maint: url-status too? wbm1058 (talk) 01:44, 28 February 2022 (UTC)
 * Yeah this category seems pretty regularly cleaned-out manually, my estimate above is about 10-30 pages per day. As for the other category, it probably wouldn't be a simple find/replace like this but I might be able to make something work for it at some point in the future if no one else does.  Bsoyka  ( talk &middot;  contribs ) 02:17, 28 February 2022 (UTC)

BAG assistance needed No activity in over a week. — &#123;{u&#124; Bsoyka }&#125;  talk 17:02, 7 March 2022 (UTC)

Now searching using the regex  to catch cases with spaces — &#123;{u&#124; Bsoyka }&#125;  talk 04:07, 9 March 2022 (UTC), edited 16:20, 10 March 2022 (UTC)
 * @Bsoyka: This might be better implemented as a continuous-running bot, perhaps using Pywikibot? &#8213; Qwerfjkl  talk  20:51, 11 March 2022 (UTC)
 * Sounds good to me, done. — &#123;{u&#124; Bsoyka }&#125;  talk 23:05, 11 March 2022 (UTC)

This is assuming a single daily run. Primefac (talk) 13:12, 13 March 2022 (UTC)
 * This bot appears to have edited since this BRFA was filed. Bots may not edit outside their own or their operator's userspace unless approved or approved for trial. AnomieBOT ⚡ 22:29, 11 March 2022 (UTC)
 * was a single, carefully monitored edit that I would've otherwise made on my account simply to test this Python version of the task. — &#123;{u&#124; Bsoyka }&#125;  talk 23:05, 11 March 2022 (UTC)
 * Every 5 minutes seems excessive. Daily would perhaps be more appropriate? &#8213; Qwerfjkl  talk  23:33, 11 March 2022 (UTC)
 * Sure, works for me. — &#123;{u&#124; Bsoyka }&#125;  talk 23:39, 11 March 2022 (UTC)
 * I'm not sure this is the best way to handle this category. Having CitationBot automatically run on the category once a day would be better, because it does a lot more than cleanup pmc, it cleans up after the visual editor and other tools that inserted the less-than-ideal pmc parameter in the first place. &#32; Headbomb {t · c · p · b} 22:01, 13 March 2022 (UTC)
 * Compare e.g., with the follow up edit by Citation bot . &#32; Headbomb {t · c · p · b} 22:05, 13 March 2022 (UTC)
 * Perhaps, but on the other hand, Citation bot is manually activated every time it's run (taking at least a small amount of valuable time from human editors), and its operator doesn't appear very active at all anymore, so I'm not sure about having it run automatically. Bsoyka  ( talk ) 04:26, 14 March 2022 (UTC)
 * "at least a small amount of valuable time from human editors" It's literally a one-click activation once you're at Category:CS1 errors: DOI (and similar). The full list of one-click categories is
 * Category:Articles with missing Cite arXiv inputs
 * Category:CS1 errors: DOI
 * Category:CS1 maint: MR format
 * Category:CS1 maint: PMC embargo expired
 * Category:CS1 maint: PMC format
 * Having Citation bot automatically run would be better, sure, but manually triggering Citation bot leads to better outcomes than this bot would have. &#32; Headbomb {t · c · p · b} 21:24, 14 March 2022 (UTC)
 * I'm up for getting some more formal consensus around this; I still think it'd be a good idea to automatically get these out of the way, but, of course, I'll stick with what the community wants. I'll keep going on the trial as it's approved (for now). Bsoyka  ( talk ) 05:31, 15 March 2022 (UTC)
 * 45 edits, 4 reverted only because they were included with reverts meant mainly for other edits. Run times were a bit sporadic as I was activating the bot manually to check every edit, but would be on a schedule if approved. Bsoyka  ( talk ) 20:32, 27 March 2022 (UTC)
 * Here's what letting Citation bot do its job does instead, on the same articles. It's ongoing, but it should be done in about an hour or so. &#32; Headbomb {t · c · p · b} 21:08, 2 April 2022 (UTC)
 * It looks good to me and I was just about to approve this. Is this an objection to this task? There appears to be some overlap but not an awful lot in this case? -- The SandDoctor  Talk 17:05, 3 April 2022 (UTC)
 * It is an objection, yes. I would much prefer that Citation bot handles these because the malformed PMCs (which do not produce visible errors to readers) are symptoms of larger issues that Citation bot is much better equipped to deal with. If another bot handles them, they exit the workflow, and Citation bot will not run on them (at least not systematically). &#32; Headbomb {t · c · p · b} 17:09, 3 April 2022 (UTC)
 * Ah, so Citation bot already would deal with these cases and perform the same fix if it wasn't already done? If that is the case, then I will deny this in favour of that. -- The SandDoctor Talk 17:25, 3 April 2022 (UTC)
 * Indeed. It doesn't do so automatically, but it's a one click thing when you're in Category:CS1 maint: PMC format (the link is there for anyone to trigger the bot). &#32; Headbomb {t · c · p · b} 17:28, 3 April 2022 (UTC)
 * Thank you. -- The SandDoctor Talk 17:45, 3 April 2022 (UTC)

Given that this is a duplication of efforts already done by Citation bot, as Headbomb confirms and I can verify, I am going to go ahead and mark this as I do, however, thank  for their efforts and apologize for being the "bearer of bad news" on this task. I hope that Bsoyka isn't discouraged from filing non-duplicate tasks in the future. For the record, this bot worked as expected, just is duplicating efforts of an existing bot and thus is denied. -- The SandDoctor Talk 17:48, 3 April 2022 (UTC)
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at Bots/Noticeboard.