Wikipedia:Bots/Requests for approval/DoggoBot


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at Bots/Noticeboard. The result of the discussion was

DoggoBot
Operator:

Time filed: 17:57, Wednesday, June 2, 2021 (UTC)

Automatic, Supervised, or Manual: automatic

Programming language(s): AutoWikiBrowser, RegEx replaces

Source code available: AWB. The regex is available at User:EpicPupper/sandbox/DoggoBot signature replacements.js, and the custom replace for PumpkinSky's signatures is at.

Function overview: Bot to fix various Lint errors, as well as replacing PumpkinSky's signatures including font tags with span tags.

Links to relevant discussions (where appropriate): BOTREQ, Bots/Requests for approval/MalnadachBot

Edit period(s): One time run, unless more conclusive regexes are written

Estimated number of pages affected: 1,081 currently most likely much more with the new regexes, need to run a search to get the exact number

Exclusion compliant (Yes/No): Yes

Already has a bot flag (Yes/No): Yes

Function details: Requesting bot flag as to not clutter RC. The bot account will need AWB access. All edits will be marked as "minor". Tested once with my own account here (with JWB, as I was on my Mac, but the bot will probably run AWB). "Unicodify" function will be enabled.

Discussion
I support this task, and request that you add more regexes so that pages do not have to be revisited by multiple bots. On the page linked above in the sample diff, there were multiple easily fixable font tags, along with missing italics at the end of Template:RM bottom, that could be fixed with simple regexes while the bot is visiting each page. There are many such regexes listed at Bots/Requests for approval/MalnadachBot, although the operator reports that they resulted in some false positives. I have some more limited regexes at User:Jonesey95/AutoEd/doi.js, in the sections marked "font wrapping links - move inside link and convert to span tag", "replace font tag in user and user talk links with span tag", and "replace all deprecated font color tags enclosing untagged text". I have never seen a false positive with those regexes. – Jonesey95 (talk) 20:12, 2 June 2021 (UTC)


 * Thank you for mentioning these. I'll take a look at those pages (and the related discussions linked at the BRFA) and try to pick up all the regexes that seem to have no false positives. I prefer to stay on the safe side and only add the non-false positive regexes. EpicPupper (talk, contribs) 21:27, 2 June 2021 (UTC)
 * As a note, looking at WP:HTML5, there seem to be many other easy replaces (eg strike tags with s tags), which hopefully can be incorporated into this request. Compiling a full list later. EpicPupper (talk, contribs) 21:31, 2 June 2021 (UTC)
 * Please let me know when you are ready for a trial. -- The SandDoctor Talk 14:31, 3 June 2021 (UTC)
 * Thank you TheSandDoctor. could you take a look at the regexes at User:EpicPupper/sandbox/4.js? I am aware that the script is very erroneous and will most likely not work, I am just using it as a dumping ground for regexes (I'll input them raw in AWB). EpicPupper (talk, contribs) 16:27, 3 June 2021 (UTC)
 * If you are planning to run this bot without supervision, I recommend removing the sections entitled "Links in links", "Fix linter problems", "Fix linter missing or extra end tags", "div-span-flip error", "small tag or template wrapping multiple lines", and "replace deprecated tt tags" (both sections). Certain regexes in those sections can create new errors or recommend inappropriate changes. I always use my script in a supervised mode, inspecting each preview before I save it, so that I can avoid those inappropriate changes. – Jonesey95 (talk) 17:12, 3 June 2021 (UTC)
 * Removed, please tell me if you have any other concerns. EpicPupper (talk, contribs) 17:47, 3 June 2021 (UTC)
 * I think I'm ready for a trial. EpicPupper (talk, contribs) 04:08, 4 June 2021 (UTC)
 * Any objections? -- The SandDoctor Talk 04:37, 4 June 2021 (UTC)
 * Looks good to me. – Jonesey95 (talk) 05:20, 4 June 2021 (UTC)


 * -- The SandDoctor Talk 05:24, 4 June 2021 (UTC)
 * "Fix various errors" is too broad. The specific scope of this bot should be documented here.  Please note, I just denied BRFA Task#2. —  xaosflux  Talk 00:53, 5 June 2021 (UTC)
 * As mentioned above, they (regexes) are listed at User:EpicPupper/sandbox/4.js. EpicPupper (talk, contribs) 02:20, 5 June 2021 (UTC)
 * Courtesy ping. -- The SandDoctor Talk 22:44, 23 June 2021 (UTC)


 * Just wanted to note that I am, I've had some trouble with making the search query for AWB but hopefully I've got it now. 🐶 EpicPupper (he/him 04:12, 23 June 2021 (UTC)
 * , the bot needs to be added to WP:AWB/CP in order to use AWB. You can use AWB's Wiki search (all NS) option and enter insource search string to get a list of pages. ಮಲ್ನಾಡಾಚ್ ಕೊಂಕ್ಣೊ (talk) 04:56, 23 June 2021 (UTC)
 * , yes I am aware that an insource search is possible, however I am having a bit of trouble assembling the search with so many regexes. that reminds be, would my bot be able to added to the AWB CheckPage? Thanks! 🐶 EpicPupper (he/him 16:12, 23 June 2021 (UTC)
 * ✅ for trial purposes. What's the status of this? -- The SandDoctor Talk 22:44, 23 June 2021 (UTC)
 * Hi there and sorry for the extremely late reply. The regex search I put together timed out, so I'll need to split it into chunks of three. I'll finish this today. 🐶 EpicPupper (he/him 20:43, 2 July 2021 (UTC)
 * hey there, it seems like the wikitext CheckPage is not working. Please add my bot to the JSON CheckPage, thanks. I get a permission error in AWB when trying to run the bot. 🐶 EpicPupper (he/him 22:09, 2 July 2021 (UTC)
 * I did? Not sure why you are getting any errors. What JSON page? -- The SandDoctor Talk 22:20, 2 July 2021 (UTC)
 * see here. The JSON CheckPage is located here. The newest version of AWB now requires users to be on the JSON one. 🐶 EpicPupper (he/him 22:23, 2 July 2021 (UTC)
 * I didn't know about the new one, oops. Thanks for the link and the FYI. Done with Special:Diff/1031660294. -- The SandDoctor Talk 22:37, 2 July 2021 (UTC)
 * Just to follow up here: do you still intend to go through with this BRFA? ProcrastinatingReader (talk) 00:58, 8 November 2021 (UTC)
 * @ProcrastinatingReader: Sorry, this went on the backburner for me after some very busy months. I'll try to get this done ASAP. 🐶 EpicPupper (he/him &#124; talk) 02:20, 8 November 2021 (UTC)
 * Any updates or would you rather we put this one on hold? -- The SandDoctor Talk 07:21, 29 December 2021 (UTC)
 * @TheSandDoctor please put this on hold until I can get things sorted out. Still trying to get my searches to work. Thanks! 🐶 EpicPupper (he/him &#124; talk) 19:13, 29 December 2021 (UTC)

. Ping me when ready to continue. -- The SandDoctor Talk 19:42, 29 December 2021 (UTC)
 * Thanks, . 🐶 EpicPupper (he/him &#124; talk) 03:11, 1 June 2022 (UTC)
 * I was the one who posted the BOTREQ before I had a bot account more than a year ago. I have since got my own bot to do this and gone through a dozen BRFAs to fix Lint errors. The scope of User:MalnadachBot/Task 12 covers every type of Lint error, so this task will be a duplicate. For the last 5 months MalnadachBot has been clearing 35-40k errors per day. Of course there is still a large backlog so another bot can work on this too.For the record, I oppose unicodifying non-mainspace pages like this. Unescaped   and some other symbols breaks templates, so editors deliberately choose hexadecimal characters in their signatures. ಮಲ್ನಾಡಾಚ್ ಕೊಂಕ್ಣೊ (talk) 05:24, 1 June 2022 (UTC)
 * Thank you for the update, @ಮಲ್ನಾಡಾಚ್ ಕೊಂಕ್ಣೊ. I'm going to mark this request as I feel that your bot can tackle this task in the most efficent way possible :) I'd recommend taking a look at the replacements and adding any applicable to your workflow. Cheers, 🐶 EpicPupper (he/him &#124; talk) 16:48, 1 June 2022 (UTC)
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at Bots/Noticeboard.