Wikipedia:Bots/Requests for approval/DASHBot 5


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was Symbol keep vote.svg Approved.

DASHBot 5
Operator: Tim1357

Automatic or Manually assisted: Automatic

Programming language(s): Python (pywikipedia)

Source code available: If ya want it.

Function overview: Remove non-free images from non-mainspaces.

Links to relevant discussions (where appropriate):

Edit period(s): Daily

Estimated number of pages affected: 1000 to start. I have no idea how many after the backlog is done.

Exclusion compliant (Y/N): Y

Already has a bot flag (Y/N): N

Function details: Replaces BJBot, generates a list of Fair-Use images in non-mainspace from Betacommand's tool here. Then, it replaces any and all fair use images with File:NonFreeImageRemoved.svg. It replaces multiple images at the same time (if there are more than one in the page). I am considering making a new version of NonFreeImageRemoved.svg, so that when it replaces the image, it does not fill the entire page. Perhaps the new image would be 200px. That way the biggest it could be would be 200px. See what happens otherwise here.

Discussion
Per the request of SoWhy, the bot changes all images in the talk namespaces to links. Tim1357 (talk) 23:32, 27 December 2009 (UTC)
 * UPDATE: Also per the recomendation of SoWhy, the bot will leave a note on the talk page of the user that added the image. Tim1357 (talk) 01:59, 28 December 2009 (UTC)

It's an appropriate task, as non-free images should never be in non-main space. --IP69.226.103.13 (talk) 20:19, 28 December 2009 (UTC)
 * Will it pick up Non-free images from their inclusion in the non-free media category or my transclusion of a non-free image template?  MBisanz  talk 10:17, 30 December 2009 (UTC)
 * I'm not sure how betacommand generates the list. Let me ask him. Tim1357 (talk) 14:03, 30 December 2009 (UTC)
 * The list is apparently from Category:All non-free media Tim1357 (talk) 03:05, 31 December 2009 (UTC)
 * (recursive of course) Tim1357 (talk) 03:10, 31 December 2009 (UTC)
 * What do you mean "recursive of course?" -- IP69.226.103.13 19:18, 31 December 2009 (UTC)
 * Oh, im sorry. Recursive means that it finds all the files that are in subcategorys as well. That means that articles in Category:Non-free_musical_artist_logos are included in the list, as Category:Non-free_musical_artist_logos is a sub-category of Category:All non-free media. If there are sub-sub categories (i.e. categories in sub-categories) it finds the files in those categories as well. Tim1357 (talk) 19:30, 31 December 2009 (UTC)
 * Well, recursive has a different specific technical meaning, so it's more useful, imo, in a bot discussion, which is a community discussion, to say it will check subcategories too, unless you're discussing the code in particular. In this case it's about what it does. Thanks. -- IP69.226.103.13 19:36, 31 December 2009 (UTC)
 * You're right. Thanks : ) Tim1357 (talk) 00:28, 1 January 2010 (UTC)

I've scaled the default size of File:NonFreeImageRemoved.svg to 200×200px to avoid the problem you noted above. It's an SVG, so this should have no effect when it's used with an explicit size.

I'd also like to suggest that the bot should leave the name of the replaced file visible in some way, e.g. in a &lt;!-- comment --&gt; or, where possible, linked from the caption. People something include non-free images in discussions also outside the odd namespaces (e.g. on the village pumps or the refdesks, and IME quite often at the graphics lab), and it can be annoying to have to dig through the history for the name of the image being discussed. Alternatively, perhaps the bot should treat the Wikipedia: namespace as if it were a talk namespace. —Ilmari Karonen (talk) 10:02, 1 January 2010 (UTC)
 * I was thinking that too, I like the Idea of a comment. The only problem is: sometimes the non-free image is in an infobox, and Im not sure how to turn those into links, as they often-times do not have the or [[Image: markers within the box. [[User:Tim1357 (talk) 16:03, 1 January 2010 (UTC)
 * Ok, seems like a good idea to trial as soon as we can figure out the infobox issue, I think User:ST47 had a solution to that once, so you might try emailing him.  MBisanz  talk 00:53, 3 January 2010 (UTC)
 * You can also try asking one of the editors who does a lot of work with templates (User:ThaddeusB?), or check the infobox discussion pages to find someone. -- IP69.226.103.13 |  Talk about me.  05:18, 3 January 2010 (UTC)
 * For infoboxes, I think simply including the original image name in a &lt;!-- comment --&gt; (as in, say, " ") is probably the best solution. The MediaWiki parser strips comments pretty early, so they shouldn't affect the infobox syntax.  —Ilmari Karonen (talk) 14:58, 3 January 2010 (UTC)
 * Ok, I guess that works ok. Tim1357 (talk) 16:26, 3 January 2010 (UTC)

Will the bot avoid editing the same page a number of times in a row, such as happened with BJBot here? - Kingpin13 (talk) 09:04, 5 January 2010 (UTC)
 * I dont know why that page was edited, isnt it in the mainspace? Anyways, the bot will group multiple files into a single edit. Tim1357 (talk) 01:33, 7 January 2010 (UTC)
 * I can't spot it being mentioned anywhere; will the bot be ignoring pages in Category:Wikipedia non-free content criteria exemptions? - Kingpin13 (talk) 09:52, 8 January 2010 (UTC)
 * Not yet, ill find a way to code that in there. Tim1357 (talk) 22:00, 8 January 2010 (UTC)
 * Tim1357 (talk) 18:36, 10 January 2010 (UTC)
 * All ✅ with the code, the bot trys to link the image if it can, and replaces it with File:NonFreeImageRemoved.svg if it cant. In all cases, it leaves an inline comment, leaving the images name if it was replaced. Im ready for trial if nobody objects. BAGAssistanceNeeded Tim1357 (talk) 02:30, 11 January 2010 (UTC)
 * No objections from me. The sooner we clean these problems up, the better, imo. -- IP69.226.103.13 |  Talk about me.  22:47, 10 January 2010 (UTC)
 *  MBisanz  talk 05:38, 11 January 2010 (UTC)
 * ✅ see /log Tim1357 (talk) 06:07, 15 January 2010 (UTC)
 * I don't see any user warnings...? - Kingpin13 (talk) 09:53, 15 January 2010 (UTC)

If I may step in to clarify here: it appears that BQZip01 is asking for all image removals be noted on the image's talk page, and I would suggest some sort of notice placed on the image's page pointing to the talk-page. The intention here is that if images have been removed in error, then on the image's talk page is a list of all the reversions that need to be made / the affected pages. "Here's every article that used to use this image, before the bot had its way with them". Please correct me if I'm wrong.

As an aside, there might even be an easy way "Click here to revert these removals"-style to undo the bot's actions (presumably by having the bot do so). Josh Parris 00:59, 20 January 2010 (UTC)
 * Ok here is what I think is a reasonable compromise:
 * The edit summary looks like this: Robot: Removing N Non-Free files per WP:NFCC (Shutoff | Log | Error?)
 * The talk pages of Images are not edited, I think it is kind of spammy, and really serves no purpose.
 * The user-message has links to the log and the error page.
 * The log page is sorted by day. Each Day has its own table, with the actions sorted by file. The table will contain links to diffs performed by the bot.
 * Anytime more then 40 images are called to be removed, it waits for human conformation.
 * I hope that works well for everyone.

Tim1357 (talk) 02:04, 20 January 2010 (UTC)
 * Tim, I love what you are trying to do here, but I think you are missing the point that Josh and I are trying to make. Let's say an image with an improper label is used on 39 user pages and is removed by a human, with no link to the image or its talk page, no one knows what was removed unless you were watching the pages upon which it was used. I agree it is "kind of spammy", but that is kind of the point. I have no problem with appending
 * ==DASHBot image removal==
 * DASHBot removed this image from N pages on [www.google.com 21 January 2009] ~
 * N would be the number of pages (you're already tracking this) and should be easy to insert.
 * The link would be to the log entry.
 * This would make changes much easier.
 * Also, will the log feature a clickable link to undo a group of actions? or will each individual page require an individual "undo" (don't get me wrong but fixing any problems would be a gold mine for my edit count :-) ) — BQZip01 —  talk 02:19, 21 January 2010 (UTC)
 * I have multiple reasons why that would not be a good idea:
 * BJBot Once removed a fair use image from one of my userspace draft. I was confused for a second, but the edit comment, the inline comment and the userpage all explained it to me. I did not think of checking the talk page.
 * Included in the user message explanation is a link to report errors (my talk page). Note that the bot logs all removals, so the log will be useful to anyone.
 * There are hundreds of files to be removed each week. What if the same file is removed multiple times from an unwatched talk page? That could ammount to a huge ammount of posts to a talk page that nobody will ever look at anyways.
 * I think leaving messages on talk pages is overkill. User talk pages should suffice, along with the edit summaries, and the inline comments.
 * Ok thats it. And in IRC, Josh_Parris and I discussed this. He had some reservations, but seemed to sign off on this. Ill get him to comment here again if he can. Tim1357 (talk) 02:33, 21 January 2010 (UTC)
 * I did express reservations, right up to the point where Tim1357 pointed out that if images were removed in error, he'd fix the problem himself. Given the bot will only run while Tim1357 is still around, it's the ultimate fall-back; he can go trawling through the logs to figure out what needs reverting.
 * Having read BQZip01's comments, I believe his scenario is one where the user in question is not using the image on their user page, but may perhaps be the creator of the image. Suddenly she notices that the image isn't used on 38 pages, it's used on 2.  There's no where for her to go to discover what happened.  (BQZip01, correct me if this scenario is not similar to what you are envisaging)  If this is the scenario BQZip01 is contemplating, you might get around it by simply placing a single note on the talkpage "At various times this image was removed from one or more user pages as required by ; for details inquire at User Talk:DASHBot or see the bot logs at " Josh Parris 12:08, 21 January 2010 (UTC)
 * That's basically the gist of it. Somehow I am failing to get the point across to Tim (my fault Tim, not yours).
 * In your scenario, BJBot removed the image and you checked the edit history. Let's say you fixed the tag on the page and decided to add it back into your page. What you don't know is that the same image was removed from 38 other pages (a hypothetical situation here) and you wouldn't have any idea that actually happened unless you were watching one of those pages. You have no way to know what the Bot did. While you can see the log, you will have to know what date it was removed on in which articles it was used to find the appropriate diff(s) to undo. — BQZip01 —  talk 17:45, 21 January 2010 (UTC)
 * Sorry, I missed the comment that you'd fix any errors yourself. By all means press ahead! — BQZip01 —  talk 06:20, 25 January 2010 (UTC)

OK, If there is no more problems. Maybe I could get started? Tim1357 (talk) 20:23, 31 January 2010 (UTC) BAGAssistanceNeeded
 * Appears issues are resolved, approving.  MBisanz  talk 20:06, 2 February 2010 (UTC)
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.