Wikipedia:Reference desk/Archives/Computing/Early/Dmcdevit 20050718

I wonder if you could update the List of Wikipedians by number of edits. There's been a lot of hubbub recently after the page was put on VfD for being out of date. It looks like no one there knows how to update it anymore, and to be honest, I don't care for the pag much,so I don't know much about how it works. Help is appreciated. --Dmcdevit·t 22:11, July 18, 2005 (UTC)
 * Who ever put it up for deletion is maniacal. If it's out of date - update it!--Knucmo2 17:55, 16 August 2005 (UTC)
 * I'll take a look and see what I can do. Triddle 20:48, July 19, 2005 (UTC)
 * It looks like the SQL query will have to be rewritten. I would do it myself but I don't have the time at the moment. I'll be able to get to this in the future but not for a while. In the mean time I hope someone else will notice this and write the query. Triddle 18:00, July 20, 2005 (UTC)

See Wikipedia_talk:List of Wikipedians by number of edits. I have a program running (on Unix, but it's in C and uses Stdio, so it'll run on anything) to grovel over a database dump and produce the StatisticsUsers.csv file (or something that looks something like it, if another format would be easier). Now I need to track down what the next stage was (in terms of turning that into the stuff on List of Wikipedians by number of edits. Can you help with that? Noel (talk) 04:25, 22 July 2005 (UTC)


 * Good work; that should be a quick read, store, and sort operation in Perl to get the sorted list. If you can give me a short sample of the dataset (or even a link to the entire dataset), I'll try my hand at a Perl program that'll generate the output. Triddle 15:44, July 22, 2005 (UTC)

Edit count (all): SELECT rev_user_text User, COUNT(*) Edits FROM mw_revision revision WHERE rev_user != 0 GROUP BY rev_user ORDER BY Edits DESC LIMIT 2000

Edit count (number of articles): change COUNT(*) to COUNT(DISTINCT rev_page)

Edit count (non-minor): add  AND rev_minor_edit = 0 to the WHERE clause

Edit count (main namespace only): add  LEFT JOIN mw_page page ON rev_page = page_id to the FROM clause and  AND page_namespace = 0 to the WHERE clause

I'm going to ask a dev to run <tt>EXPLAIN</tt> on some of these and decide if it's worth doing. r3m0t talk 21:40, July 24, 2005 (UTC)

Suggestions

 * 1) The SQL that is created to generate this report should be attached to this ticket so this problem can be avoided in the future. Triddle 18:00, July 20, 2005 (UTC)

Status

 * Case needs help Triddle 18:00, July 20, 2005 (UTC)
 * Case opened Triddle 20:48, July 19, 2005 (UTC)