Wikipedia:Bots/Requests for approval/FastilyBot 5


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was Symbol keep vote.svg Approved

FastilyBot 5
Operator:

Time filed: 07:14, Monday, March 7, 2016 (UTC)

Automatic, Supervised, or Manual: Automatic

Programming language(s): Java

Source code available: When I have written it

Function overview: Find files tagged with both a free and non-free license tag, and apply to the file description page.

Links to relevant discussions (where appropriate):

Edit period(s): Weekly

Estimated number of pages affected: 1-2k

Exclusion compliant (Yes/No): no

Already has a bot flag (Yes/No): yes

Function details: See above - F ASTILY 07:14, 7 March 2016 (UTC)

Discussion
This will probably mostly return false positives: photos of non-free sculptures both need a non-free copyright tag for the sculpture and a licence from the photographer. Unfortunately, there currently doesn't seem to be a way to tag such files so that they do not end up in both Category:All free media and Category:All non-free media. --Stefan2 (talk) 11:25, 7 March 2016 (UTC)
 * True, but it should be fairly trivial to exempt tags which might lead to false positives (e.g. ) from the result set - F ASTILY 11:54, 7 March 2016 (UTC)
 * Which tags would appear in that list? --Stefan2 (talk) 12:43, 7 March 2016 (UTC)
 * I've started an ignore list here; please feel free to add any other titles you can think of! - F ASTILY  04:43, 10 March 2016 (UTC)
 * Photos of non-free 3D artworks often (but not always) use photo of art to indicate the two copyright tags, so I added that template to the list. I don't know if the current list is complete. --Stefan2 (talk) 22:11, 12 March 2016 (UTC)
 * Should files tagged as NFUR not needed also be ignored? Jo-Jo Eumerus (talk, contributions) 18:31, 24 March 2016 (UTC)
 * I'd assume so. Those files have been identified as free, but the fair use rationale needs to be converted to information and remains on the page since it provides some information of the file. --Stefan2 (talk) 21:44, 24 March 2016 (UTC)
 * Do we feel that we can keep the false positive number low if we ran this through a small trial or two for adjusting the ignore list? If by the second one it's looking abysmal, we could just abort the idea. -- slakr  \ talk / 03:24, 24 March 2016 (UTC)
 * There are currently 278 files which appear in Category:All free media and Category:All non-free media and which do not have any of the tags in User:FastilyBot/Task5Ignore. I took 20 random files from that set and checked them.
 * These files should not appear in both categories, and the file information pages should be edited to remove them from one of the cats:
 * File:Summer Forever Movie Poster.jpeg
 * File:Masisa logo.png
 * File:ITV promotional poster for Doors Open.jpg
 * File:KQLK-FM 2015.png
 * File:CSC Group Logo.jpeg
 * File:Magic923.jpeg
 * File:Harbhajan Singh Yogi with Hopi elder.jpg
 * File:Param 2.jpg
 * File:Denver Revised Municipal Code title page.jpg
 * File:Asian-college-of-science-and-technology-logo.jpg
 * File:Perry-Mason-Returns-intertitle.jpg
 * File:LESSDockingManeuver.jpg
 * File:Qualcomm Stadium logo.jpg
 * These files appear in both categories, but should not be tagged with wrong license by FastilyBot for one reason or another:
 * File:2 euro mo series1.gif - different copyright tags refer to different parts of the image
 * File:Internet Explorer 4.png - different copyright tags refer to different parts of the image
 * File:Opera 7.02.png - different copyright tags refer to different parts of the image
 * File:Netscape9.png - different copyright tags refer to different parts of the image
 * File:Peruvian Airlines White Logo.jpg - already has wrong license, no need to add a second one
 * File:Internet Explorer 8 InPrivate.png - different copyright tags refer to different parts of the image
 * File:Nintendo - 1950.png - different copyright tags refer to different countries
 * It looks as if the number of false positives will go down a lot of Copyright by Wikimedia is added to the list of exempted templates as it seems that many false positives are screenshots which show a non-free web browser and a Wikipedia page. I'll go through the first set of files in my list above and fix them. --Stefan2 (talk) 00:33, 25 March 2016 (UTC)

While I'm not too thrilled about the prospect of false positives, it looks like there's a belief it might be mitigated with proper whitelisting. I'd strongly recommend a dry run. -- slakr \ talk / 02:21, 2 April 2016 (UTC)
 * Agreed, I'll do a few dry runs and try to refine the rule set before actually making any edits. - F ASTILY 23:45, 3 April 2016 (UTC)
 * Have the trials been performed yet? — xaosflux  Talk 03:51, 25 April 2016 (UTC)
 * Yes, I'm doing a series of dry runs to refine the ignore list. Here is the current list of files that would be tagged by the bot if it were live.  Anybody is welcome to help review the files and add templates to the ignore list -  F ASTILY   04:47, 1 May 2016 (UTC)
 * ; fixed one false positive involving, which has since been added to the ignore list. -  F ASTILY   06:42, 2 May 2016 (UTC)
 * I've checked the list. Most of the files indeed have problems. There are a lot of files like File:MarcOPoloLogo.png which have correctly been tagged as PD-textlogo, but someone needs to convert logo fur and similar templates to Information. I converted some, but there are too many of them. Also, a number of clearly non-free files have a FUR but a free licence. It's maybe useful to add free screenshot and/or GPL to the list as software screenshots may show different things with different copyright status. See for example File:Compaadblocker.png and File:Deep note on Audacity.png. It's maybe also a good idea to leave out the deletion process (that is, deletable file and ffd) as the deletion tags probably already address the problems with the copyright status. --Stefan2 (talk) 23:17, 6 May 2016 (UTC)
 * Sounds good to me; I've added the suggested templates to the ignore list - F ASTILY   04:03, 7 May 2016 (UTC)

If there are no other objections/comments, could this task please be approved? Thanks! - F ASTILY   05:13, 8 May 2016 (UTC)
 * @Pinging Slakr and @Xaosflux: any chance one of you could review the BRFA? Thanks! :) -  F ASTILY   05:16, 12 May 2016 (UTC)
 * Sorry for the delay. (1) I'm assuming you've fix the missing line break on the tags (2) What I've been looking for is any editor to take any action based on your tagging, I haven't checked every page yet, but it doesn't look like this being followed up on by anyone. I don't see a link above, did anyone ask for this task to be done or are you the requester as well?  —  xaosflux  Talk 04:08, 17 May 2016 (UTC)
 * (1) Yes, this is fixed. (2) Yes, I am the requester.  This is simply a maintenance/categorization task, and not anything controversial;  does not imply impending deletion.  Also I should note that the bot is de-facto exclusion compliant per the ignore list, as it will ignore pages transcluding  -  F ASTILY   04:53, 17 May 2016 (UTC)
 * Thank you - everything looked fine on the trials, good to go. — xaosflux  Talk 11:35, 17 May 2016 (UTC)


 * — xaosflux  Talk 11:35, 17 May 2016 (UTC)
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.