Wikipedia:Bots/Requests for approval/KiloBot


 * The following discussion is an archived debate. Please do not modify it. Subsequent comments should be made in a new section. The result of the discussion was Symbol keep vote.svg Approved.

KiloBot
Operator: Wikihermit

Automatic or Manually Assisted: Automatic - unsupervised

Programming Language(s): pywikipedia

Function Summary: Search new pages for copy right violates

Edit period(s) (e.g. Continuous, daily, one time run): Most likely continuously

Edit rate requested: read request: 5 edits per minute

Already has a bot flag (Y/N): N (probably won't need one)

Function Details: KiloBot will search Special:newpages for copyright violations. If it finds one, it will save it to a text file on my computer with the website it found it on. I can then add the log to wikipedia for users to deal with the copyvios. I would have the bot upload it to a page on wikipedia, but I'm having problems with pagefromfile.py If anyone could help, (either by helping me with getting pagefromfile.py to work, to by rewriting copyright.py to upload to a user page) that would be great. ~  Wi ki  her mit  04:17, 6 July 2007 (UTC)

Discussion

 * How will it search for copyright violations? Google?  Daniel  05:13, 6 July 2007 (UTC)
 * If this bot will only be reading, it doesn't need much approval here, as it obviously doesn't need flagging. To be official we can still go through this though in which case:


 * 1) How often will this bot run (24/7?)
 * 2) When (if ever) will the bot edit?
 * Thanks, —  xaosflux  Talk 05:31, 6 July 2007 (UTC)


 * The bot will use yahoo to search for copyright violations, since yahoo allows use of its search engine with an ID. The bot will run 24/7 to check newpages. It will edit as soon as I can figure out the command to enter for pageforfile.py. If I can't figure this out shortly, we can close the BRFA until I figure it out. (if anyone can help with this: this is what I've been entering in: (I need help with the parts after -file ))

c:\pywikipedia2 c:\python25\python pageforfile.py -file:c:\pywikipedia2\copyright\output.txt -appendtop -summary:logging
 * I need the bot to edit/add logs to User:KiloBot/CopyVioLogs... I'm not sure what to do with the -end:xxx and -start:xxx (it hasn't been working :-/)
 * So, in short, the bot can detect and report copy vios to a text file. I can then put that into a wikipedia page. The (optional) part that doesn't work is getting the bot to automatically upload it. Thanks! ~   Wi ki  her mit  05:50, 6 July 2007 (UTC)


 * Yeah, as long as it's only editing one page in it's own userspace and not going crazy (read: 50+ writes a minute) I think it's a speedy approve. It's going to be putting less stress on the servers than the anti vandalism bots and most of it's action will be reads which is fairly server light. -- Tawker 17:48, 6 July 2007 (UTC)


 * Yup. I'll keep it down to five reads/edits per minute. I'm trying to use C# now along with pywikipedia, but my laptop doesn't have internet access right now, and it has Microsoft Visual Basics on it, so I'll have to work on it in two weeks or so. And it will only edit one page (its own userspace, unless there is a demand for a bot that reports possible copyright vios in the wikipedia space). ~   Wi ki  her mit  02:34, 10 July 2007 (UTC)
 * I wouldn't rush to a speedy approval just yet. I'd very much like to see some sample work first.  The idea sounds good, but I'd very much like to see how well the execution actually goes before granted rushed blanket approval.  -- Cyde Weys  03:32, 12 July 2007 (UTC)

Can we hold the speedy approval bit a moment? Please see User talk:Eagle_101 where a request was made to me by this bot operator to write some code for him, I'm presuming that this code would be used in conjunction with this bot. as its specifically stated in this request that there is no way to upload the results. Who is writing the code for this? How much of the code is actually working? Could you go step by step and explain how the bot knows which pages to check, is it using IRC, the rss feed, ect? Also how does it identify a copyright violation? What are the criteria? Also note that google has a 2,500 search (query) limit per IP that accesses its servers (this is assuming that google is being used here to find the copyright violations). Any and all replies to these questions would be welcome. Thanks. ——  Eagle 101 Need help? 04:45, 15 July 2007 (UTC)
 * Almost all of your questions can be answered above. They have the main code, just not the uploading code. They are not using google, they are using Yahoo, with their API. Matt/TheFearow (Talk) (Contribs) (Bot) 04:49, 15 July 2007 (UTC)
 * Ok, thats fine, I missed the yahoo bit above, in that case may I ask what exactly is the criteria that pywiki is using? ——  Eagle 101 Need help? 04:59, 15 July 2007 (UTC)
 * Yea, I went ahead and wrote a script (for upload) and am going to try it out. The bot uses pywikipedia: copyright.py . As far as what to check, its an argument: -new:50. I hope this answers your questions. ~   Wi ki  her mit  01:04, 16 July 2007 (UTC)

 May I point out that a similar bot already exists? It is and has been operating on Suspected copyright violations for quite a while now. Perhaps some communication with Where would be in order so as not to duplicate functionality? Миша 13 09:03, 16 July 2007 (UTC)


 * Duplicate bots are great, it stops things from not being done if one is offline. Also, this scans newpages, not a prepared list. That means it will find it a lot easier. Matt/TheFearow (Talk) (Contribs) (Bot) 00:43, 17 July 2007 (UTC)


 * Just to clarify, Wherebot scans the new pages feed on IRC; it does not operate on a prepared list. It creates a list.  Please see the source here.  — Madman bum and angel (talk – desk) 02:48, 19 July 2007 (UTC)


 * ✅ I've written a quick C# script to upload the .txt file to Wikipedia. I published the source: User:KiloBot/source. The bot will still edit in its userspace only, and I can use   to put it on Suspected copyright violations  ~   Wi ki  her mit  17:45, 26 July 2007 (UTC)
 * Example . ~   Wi ki  her mit  17:50, 26 July 2007 (UTC)

Make sure the edit rate isn't excessive, and you are free to continue. --ST47 Talk·Desk 18:36, 26 July 2007 (UTC)


 * The above discussion is preserved as an archive of the debate. Please do not modify it. Subsequent comments should be made in a new section.