User:JCW-CleanerBot

What is the bot approved for?
The bot has approval for the following 5 tasks. Tasks 1–4 are approved for automated editing, while task 5 is approved only for semi-automated editing.
 * 1) Closing all exposed disambiguators in work, journal, magazine, publisher.
 * 2) Fixing non-correct ways to refer to a publication.
 * Automated fixing will be done with 'sure shot' code.
 * (case insensitive) &rarr;
 * This would do both capitalization fixing ( →  ) and dot removal (  →  ), and be guaranteed to only touch the journal parameter.
 * Semi-automated fixing will be done with 'sloppy' code.
 * (case insensitive) &rarr;
 * This will also do both capitalization fixing, and dot removal, but would remove the final dot in a sentence like . This will be used for cases where it'll be quicker to be sloppy and perform a manual review.
 * 1) AWB general fixes. The bot will skips making only minor/whitespace genfixes, although they may be performed alongside its main task.
 * 2) Removal of Italics title from pages with Infobox journal/Infobox magazine
 * 3) AWB typo fixing, which will only be done semi-automatically per WP:CONTEXTBOT.
 * 1) AWB typo fixing, which will only be done semi-automatically per WP:CONTEXTBOT.

What are non-correct ways to refer to a publication?
Listing every type of 'non-correct' journal names would be rather long. But they have 7 common themes:
 * A) Legit typo/misspelling
 * &rarr;


 * B) Bad dots
 * &rarr;
 * &rarr;
 * This does not touch undotted abbreviation like


 * C) Bad capitalizations
 * &rarr;
 * &rarr;


 * D) Non-standard abbreviations to standard abbreviations. The standard ISO 4 abbreviation list can be found here.
 * &rarr;
 * &rarr;
 * The bot will respect all field-standard abbreviations, like those from MathSciNet/Mathematical Reviews, the Bluebook, or the National Library of Medicine/PubMed. For example, the ISO 4 abbreviation for Transactions of the American Mathematical Society is, but the MathSciNet abbreviation is  , and won't convert from one form to the other.


 * E) Subtitles
 * &rarr;
 * This does not cover subtitles like Journal of Physics: Conference Series


 * F) Remove multi-language language entries to match the language of the article cited [semi-automated only, Latin alphabet only].
 * &rarr; or
 * This does not touch entries in a journal with an official title in a different language (e.g. an English article published in Naturwissenschaften) even if the translated title (The Science of Nature) later becomes the official title.


 * G) Database 'leftovers'
 * &rarr;

Under the BRFA, the bot only touches journal for 2A, but I eventually plan on covering work / magazine / publisher later. Later means "once we get the equivalent compilation to WP:JCW for magazines and publishers". There will be a specific BRFA for those, although I'm pretty sure that would get speedily approved.

The bot screwed up! What should I do?
Although I take great care to ensure that it doesn't screw up, it's possible that I screwed up the logic somewhere, or that there's a corner case it didn't handle correctly.

If the bot screwed up, follow these few simple steps:
 * 1) Assume good faith
 * 2) Leave a message on the bot's talk page. This will stop the bot from editing. Your message should include
 * 3) A diff of the problematic edit.
 * 4) An explanation of why you think the edit is wrong.
 * 5) We'll discuss the issue. If the bot was wrong, I'll either update the bot, or find some other way to prevent the edits in the future.

If the bot affected more than a handful of pages, I would advise against mass-reverting before discussion. If the bot was right, it'll be annoying to have to re-do those edits later. And if the bot was wrong, it'll likely be easier to use the bot to fix the problematic edits.

Related projects

 * WP:JCW, after which the bot is named
 * WP:MCW