Wikipedia:Bots/Requests for approval/Qwerfjkl (bot) 12


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at Bots/Noticeboard. The result of the discussion was

Qwerfjkl (bot) 12
Operator:

Time filed: 09:13, Saturday, May 14, 2022 (UTC)

Automatic, Supervised, or Manual: automatic

Programming language(s): AutoWikiBrowser

Source code available: AWB

Function overview: Remove  from overcategorized pages.

Links to relevant discussions (where appropriate): Bot requests (and the prior discussion linked there)

Edit period(s): One time run

Estimated number of pages affected: <200,000

Exclusion compliant (Yes/No): No

Already has a bot flag (Yes/No): Yes

Function details: The bot will remove  from pages determined by running a   query on the relevant categories, via a regexp. The page count is hard to estimate because of the number of categories removed, and the large size of categories to work on, so I've estimated an upper limit.

The categories I'll run  on are:



Discussion
So the idea is to use a bot to clean out the redundant category from articles that are already properly subcategorized, so that the human editors can concentrate our efforts on the smaller number of articles that are only filed in the parent while lacking any subcategorization. Bearcat (talk) 12:51, 14 May 2022 (UTC) which might need splitting up. ―  Qwerfjkl talk  14:29, 26 May 2022 (UTC)
 * Just for a bit of context on why this is warranted, if it would help: WP:FILM formerly had a policy of deeming "(Country) films" categories to be all-inclusive, meaning that they had to directly include all films from that country even if they were already extensively subcategorized for genre or other characteristics. That wasn't necessarily unreasonable 15 to 20 years ago when that rule was first established, as we had far, far fewer articles about films at that time than we do now — but in 2022, a considerable number of the categories are now populated into the thousands or tens of thousands, and would have been deemed too large and in need of diffusion in virtually any other category tree. So the WikiProject has now established a consensus to drop the "all inclusive" rule, but due to the sheer number of articles involved nobody wants to tackle the whole job manually.
 * Ideally try to spread them out over the various categories. Primefac (talk) 08:59, 26 May 2022 (UTC)
 * @Primefac, Is there a limit to the length of a regexp? Currently mine is
 * I have no idea; try it, and if it doesn't work split it up. For what it's worth, you have a lot of unicode spaces in your copy above (which may or may not be present in your original files) so you might want to check that before you run anything. Primefac (talk) 14:31, 26 May 2022 (UTC)
 * Thanks, now removed, and the regex works. I'll have the trial done soon (I've alphabetised the list to try and spread out the categories, not sure how effective it'll be). &#8213;  Qwerfjkl talk  14:41, 26 May 2022 (UTC)
 * See . &#8213;  Qwerfjkl talk  14:48, 26 May 2022 (UTC)
 * (@) &#8213;  Qwerfjkl talk  14:49, 26 May 2022 (UTC)
 * , requesting update here as this has now been hanging for almost two weeks. Bearcat (talk) 19:44, 7 June 2022 (UTC)
 * @Bearcat, you might want to try BAG assistance needed. &#8213;  Qwerfjkl talk  19:55, 7 June 2022 (UTC)
 * BAG assistance needed &#8213;  Qwerfjkl talk  16:33, 10 June 2022 (UTC)
 * I've been on holiday the last two weeks, and BRFAs are a bit far down my "catch-up priority" list, but I'll try to get to these as soon as possible. Primefac (talk) 11:43, 15 June 2022 (UTC)
 * @Primefac, there's no rush. &#8213;  Qwerfjkl talk  22:23, 18 June 2022 (UTC)

After reviewing the edits, I don't have any concerns with this. As with all regexes, please be careful and spot check/fix any errors that may arise. As per usual, if amendments to - or clarifications regarding - this approval are needed, please start a discussion on the talk page and ping. -- The SandDoctor Talk 15:20, 19 June 2022 (UTC)
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at Bots/Noticeboard.