Wikipedia:Bots/Requests for approval/GreenC bot 6


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was

GreenC bot 6
Operator:

Time filed: 15:38, Friday, July 20, 2018 (UTC)

Automatic, Supervised, or Manual: Automatic

Programming language(s): BotWikiAwk

Source code available: Yes

Function overview: Convert New York Times archives from old to new format. Example.

Links to relevant discussions (where appropriate): Bot_requests

Edit period(s): one-time

Estimated number of pages affected: 29,707 links (fewer pages)

Namespace(s): Mainspace

Exclusion compliant (Yes/No): yes

Function details: The New York Times is an important citation source on Wikipedia. Keeping the archive URLs up to date will ensure there is no link-rot if/when redirects stop working in the future. The new URL is more informative with date information and file type (PDF in the example), the later affects the display output of CS1|2 templates.

The bot works by checking the page header of the old url, looking for Location: of the redirect and testing it works then replacing in the wikisource. It will leave any web archived URLs as-is eg. any NYT links archived at the WaybackMachine.

Discussion

 * please report back here when done trial, include diff range. — xaosflux  Talk 16:08, 20 July 2018 (UTC)
 * User:Xaosflux, looking more closely at the data I made a mistake. There are not 29,707 links, closer to 200. Most of the links in query.nytimes.com are for a different type of page not the timesmachine.nytimes.com. There's also special cases that make the bot more complex than I realized. And I'm now confused how the Times has its site organized, to confidently change the URLs to the redirects. Probably best to close this out for now until it's more clear what should be done. --  Green  C  01:10, 21 July 2018 (UTC)


 * per above. —  xaosflux  Talk 01:53, 21 July 2018 (UTC)


 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.