User talk:Henrik/Archive 18

Percentage question
Hi Henrik. I read your FAQ about the page statistics! I use it frequently for my GLAM-WIKI work. I'm writing a case study for the Walters Art Museum who has partnered with Wikipedia. They asked a question: what percentage do we think the page views are from bots? I didn't know if there was some type of guessed/blanked percentage (i.e. approximately 5% of page views are from bots/crawlers). Any idea would be great. And of course - thank you for the great tool and the great things you do for the movement. SarahStierch (talk) 06:21, 15 January 2013 (UTC)

The Signpost: 14 January 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 15:52, 16 January 2013 (UTC)

Undeletion of Artist Tim Alek Mulley
Hi Henrik,

I'm wondering if you can assist me in the undeletion of artist/drummer Tim Alek Mulley. Also, im wondering how we can get some editors to help build that page correctly. This is a notable artist, one i worked with years ago as a manager. I tried building a wikipedia page for his accolades but it never turned out how i intended. Any assistance would be appreciated.

cheers, Niles — Preceding unsigned comment added by Nilest (talk • contribs) 13:01, 17 January 2013 (UTC)

You've got mail
Mungo Kitsch 21:25, 19 January 2013 (UTC)

pageviews statistics tool
Hello Henrik, the "pageviews statistics tool" stops at December 2010. Only single page statistics can be viewed for the current month. Best regards. 87.171.80.89 (talk) 13:35, 21 January 2013 (UTC)

stats.grok.de
I noticed that when you click on "Top" (at least for it.wikipedia.org) the statistics are for December 2010. Is there anything newer? TIA --.mau. &#x2709; 10:12, 22 January 2013 (UTC)  — Preceding unsigned comment added by .mau. (talk • contribs)
 * See FAQ, User:Killiondude/stats. --Nemo 15:09, 23 January 2013 (UTC)

Article traffic statistics
Hello, could you please add Wikivoyage to your pageview statistics tool? Thanks. – sumone10154 ( talk ) 00:43, 22 January 2013 (UTC)
 * No, he can't he already does, see bug. --Nemo 15:09, 23 January 2013 (UTC)

The Signpost: 21 January 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 00:25, 24 January 2013 (UTC)

Missing stats
Yesterday: Stats appeared to go only up to part way through January 1st. January 2nd - 0. December stats show OK on 90 day view.

Today: Stats for January 1st still showing. January 2nd & 3rd - both 0. The WHOLE of December now shows as 0 too on the 90 day view.

What's gone wrong? - 212.139.103.10 (talk) 01:00, 4 January 2013 (UTC)


 * It fixed itself a few days later. I guess some part of the process was running slowly. - 212.139.105.251 (talk) 22:56, 27 January 2013 (UTC)

90 days is not always 90 days
At 23h00 UTC each day, the 90 day graph drops back to showing just 89 days worth of visitor figures. Some time after 00h00 UTC (sometimes mere minutes, often several hours, and occasionally well into the next afternoon or evening) the graph returns to showing 90 days data again, with "yesterday"s figures finally added.

Is there any way that the "90 day" caption could be amended to say "89 days" during the period that is the case? Dividing the visitor total (which is always the correct summation of all the numbers visible in the bargraph, whether 89 or 90 days are shown) by 90 gives an incorrect lower average for at least one hour, and often several hours, every day.

I assume the same holds true for the 60 and 30 day versions, too.

Additionally, how easy (or difficult!) would it be to make the leading edge of, say, all of the Monday bars a different colour, or to add a very thin coloured line between the Sunday and Monday bars, or shade the Saturday and Sunday bars differently? For pages with a regular peak and trough of visitor numbers, it would be useful to see which days of the week those are on and whether that changes over time. - 81.157.177.5 (talk) 23:13, 26 January 2013 (UTC)

The Signpost: 28 January 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 19:27, 30 January 2013 (UTC)

The Signpost: 04 February 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 03:07, 6 February 2013 (UTC)

Wikipedia article traffic statistics
Hallo Henrik, what the matter that the views to articles in Wikipedia on July 12 and 13 are not countet? Kindest regards (Lothar Spurzem) -- 80.144.249.207 (talk) 19:48, 14 July 2011 (UTC)

Statistics
Hi! I'm Nicolai from the Faroese Wikipedia. I just found the website http://stats.grok.se/ with statistics over visited Wikipedia-articles on several Wikis. How come the Faroese Wikipedia isn't included, and what can I do to include it? Niceley 01:08, 28 July 2011 (UTC)

Regarding using the Pageview Statistics Tool
Hey Henrik, thanks for your Pageview Statistics Tool, it's been very interesting to use.

My name is Danny Lewis and i'm the Project Manager of an analytics tool. We are planning on bringing through wikipedia page view analysis data. We have investigated using the raw data you provide but it is an impractical option for us given the sheer scale of the data and the specific information we are interested in (it's a very small percentage of the big picture) I'm wondering if it's safe to use your Pageview Statistics Tool? How likely is it that you will take the tool down? Would there be a usage quota if we were to use it on a regular basis?

Best regards,

Danny Lewis -- Dannyjlewis (talk) 10:21, 15 January 2013 (UTC)

Stats delay?
http://stats.grok.se/ is not working right now. I've mentioned this in WP:VPT. --George Ho (talk) 02:38, 10 February 2013 (UTC)

Unable to access stats.grok.se
Hi, from my IP address (82.35.252.27) I cannot access your tool at stats.grok.se. I was just wondering whether it's possible that my IP is blocked in some way, because from other places I can access your site!

Any help much appreciated,

Thanks

Bryan — Preceding unsigned comment added by Lydgate (talk • contribs) 17:52, 11 February 2013 (UTC)

What article rank means exactly
Dear Henrik,

The http://stats.grok.se webpage was very useful for me. I write a dissertation about physics in education, and this page helped me to confirm my statements about the pages I wrote. However I do not know, what is the exact mean of the statement like this "This article ranked 285 in traffic on hu.wikipedia.org." It not always accords the number of hits/90 days. I think that it tooks account a more longer period. How long is this interval? Is there anything else to know to analyze these numbers?

Thanks.

Harp (talk) 09:12, 19 January 2013 (UTC)


 * For articles in the top 5,000 it appears that the rank is quite different from that shown here, maybe the rank is not being updated? The pageview numbers also can be quite different, as mentioned here (examples: Computer virus stats, Antivirus software stats, Internet safety stats, Internet security stats, Comparison of Android devices stats, and Linux stats). LittleBen (talk) 03:12, 26 January 2013 (UTC)
 * See Wikipedia talk:5000#Differences with Henrik's tool. It is my belief that Henrik's tool has a bug of some kind that causes this discrepancy. Thanks, West.andrew.g (talk) 01:28, 28 January 2013 (UTC)
 * The alternate page view tool mentioned at the top of this page can be used as a sanity check for Henrik's tool. LittleBen (talk) 04:19, 15 February 2013 (UTC)

The Signpost: 11 February 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 09:21, 13 February 2013 (UTC)

Förbättring av bild
Hej Henrik! Jag fick syn på bilden nedan och tog bort så mycket som möjligt av blänket på målningen. Om du samtycker med förändringen föreslår jag att du uppdaterar bilden på Wikimedia.

Stormningen_av_Köpenhamn_11_feb._1659.jpg behandlad

Stormningen_av_Köpenhamn_11_feb._1659.jpg orginal

Joeghurt (talk) 16:43, 15 February 2013 (UTC)

The Signpost: 18 February 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 20:14, 20 February 2013 (UTC)

Fractional visitor numbers.
For a page with very few visitors, the Y axis is sometimes labelled with fractional visitor numbers.

Can it be re-programmed such that the top of the Y axis is never less than 6? -- 86.151.156.246 (talk) 20:51, 23 February 2013 (UTC)

stats.grok.se
Hi Henrik,

I'm from wp.min a new Wikipedia Minangkabau, can you add Minangkabau to the list of your tool "stats.grok.se"? Thanks in advance.  Ę-oиė   >>>   ™ 13:17, 26 February 2013 (UTC)

Using "?" in stats
The "?" is not computed well. Take this, for example. After typing just "?", the results omitted "?" However, if "%3f" is typed, the results... I can't find words to describe. --George Ho (talk) 05:29, 24 February 2013 (UTC)
 * Each redirect to an article has a separate pageview stats page, and often the sum total of traffic to all the redirect pages is greater than the pageview traffic to the article title page (this means that pageview traffic to the article title page does not include pageview traffic from redirects). Note that the same reasoning (that separate URLs have separate pageview scores) also applies to Kyōto Station and Kyoto Station. It seems that each variation is counted as a separate URL, with a separate pageview count, because "?" (which is stripped) and "%3f" are treated as different characters. (? is a special character used to prefix parameters -- search engines add the question mark and search engine keywords used when the pageview is a referral from a search engine, however "%3f" is an escaped question mark and doesn't have the meaning of a parameter prefix). LittleBen (talk) 07:39, 27 February 2013 (UTC)

The Signpost: 25 February 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 06:52, 28 February 2013 (UTC)

The Signpost: 04 March 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 22:29, 7 March 2013 (UTC)

Quick pageview stats tool question
Apologize if this is a stupid question (couldn't see it in the FAQs), but is the tool counting unique pageviews or just pageviews? Thanks - Brycehughes (talk) 22:03, 11 March 2013 (UTC)

page view statistics (stats.grok.se)
hi

i've found your page-view statistics tool (http://stats.grok.se/) extremely useful. i'm looking to create a local dump of the statistics so i can run a few queries on it.

i downloaded the raw data from http://dumps.wikimedia.org/other/pagecounts-raw/ but ran into a few character encoding issues while trying to parse it. specifically, in the second column (i.e., the title of the requested page) i'm getting entries such as %D0%90%D0%B6%D1%8C%D0%B0 and Cookie_\x00\x00. this is with locale set to utf-8.

seeing how your traffic statistics visualizer uses the same data, could you help me in figuring out what the right character encoding ought to be and/or share the script(s) you used to parse the data?

thanks

-- Suhail - 66.152.64.226 (talk) 16:36, 12 March 2013 (UTC)

The Signpost: 11 March 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 09:25, 13 March 2013 (UTC)

The Signpost: 18 March 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 09:08, 22 March 2013 (UTC)

Can i create this page or not?
Hello there, i have been trying to create a page for the director Mr Antony Hickling which I see has been deleted by you. Before creating a new one i wanted to check with you if there is enough new evidence of his work and notoriety before continuing. I found articles in French aswell but am not sure if that counts.

Interview with Attitude Magazine "http://www.attitude.co.uk/viewers/viewcontent.aspx?contentid=3270&catid=culture&subcatid=film&longtitle=ANTONY+HICKLING+INTERVIEW"

Interview The current for BFI London : "https://thenewcurrent.jux.com/1047736"

Article Bent Mag : http://mag.bent.com/2013/03/little-gay-boy-christ-is-dead/

Article Gay Times "http://www.gaytimes.co.uk/Interact/Blogs-articleid-9493-sectionid-705.html"

Jury at the Forum des Images cinema France " http://cheries-cheris.com/jury.html"

Film at BFI London "https://whatson.bfi.org.uk/llgff/Online/queer-provocations"

The list goes on. Can i ask your advice on whether i should proceed on not.

Many thanks

J. - 109.156.199.246 (talk) 10:20, 23 March 2013 (UTC)

http://stats.grok.se Data Before 2007
Dear Henrik,

I am a researcher at Stanford University looking at how Crusade history has become popular in the U.S. and the Middle East after 9/11, the Iraq War, and other major events between the regions. Aside from films, novels, and other websites, I am interested to see if there is time and gps-specific data for Wikipedia articles so that I could ask, for instance, if there was a spike in visits of articles about the Crusades at certain times, and to see if the numbers of those visits were higher in different countries or locations.

Do such data logs exist? If so, is it possible to access them so that I could ask these questions? I know that your page access statistics by page per day are available from 2007. Is there any data available before this date? And is there location information for site visitors and editors?

Many thanks,

Brian Johnsrud johnsrud at stanford dot edu — Preceding unsigned comment added by 128.12.208.5 (talk) 18:48, 25 March 2013 (UTC)

Table of stats
Hi Henrik,

I would like to create automatic tables that shows the access of each article. To create manual tables is complicated, and else impossible. Well, I wanna know whether be an way to get the value of access directly from the server. Eg.:


 * Manually


 * Automaticly

It's possible? There are any code to do this?

Answer me as soon as possible.

Friendly,

Imagens SM (talk) 04:45, 26 March 2013 (UTC)

Wikipedia Pageview Statistics
Dear Henrik,

I am currently retrieving pageview statistics for 2011 from stats.grok.se. It appears that a small fraction of the files, more specifically the statistics from 08/10/2011 18:00-22:00 are missing. The files linked on http://dumps.wikimedia.org/other/pagecounts-raw/2011/2011-10/ for the respective hours are not valid gzip archives.

Is there any chance to retrieve the correct data? Thank you very much in advance for your efforts.

Kind regards, Stephan Seufert — Preceding unsigned comment added by 139.19.4.201 (talk) 09:54, 28 March 2013 (UTC)

The Signpost: 25 March 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 00:54, 29 March 2013 (UTC)

Bots affecting page view numbers
Thanks for your excellent page view tool Henrik :-) I wonder if the tool registers another hit for a wp page if a wp bot runs through that page. If I see that a page got a hundred views yesterday, is there any way of knowing for sure that -- say -- ninety page loadings were from a web-browser (and therefore a human was probably reading) and ten loadings were by wp bots? jonathan riley (talk) 23:13, 30 March 2013 (UTC)

No April stats yet?
I see no updates on 1 April 2013 yet. Is there an explanation for this? --George Ho (talk) 04:23, 2 April 2013 (UTC)

http://stats.grok.se/
Hi there. Is your fantastic website, http://stats.grok.se/ still running? It wasn't working for me earlier today! — Preceding unsigned comment added by Woodlandscaley (talk • contribs) 14:27, 2 April 2013 (UTC)

The Signpost: 01 April 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 15:18, 5 April 2013 (UTC)

The Signpost: 08 April 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 09:20, 10 April 2013 (UTC)

stats.grok.se bug?
Hi Henrik, thought it might interest you: these two  report the same numbers for two different pages. Their names only differ in case L/l, and L used to redirect to l. Reg'ds Littledogboy (talk) 12:49, 14 April 2013 (UTC)

Hi Henrik: Other strange things have recently happened on stats.grok.se. For example, the daily number of reported viewings of the Parabola page suddenly dropped in late March from more than 2000 on weekdays (fewer on weekends) to about 500, and has stayed around 500 ever since. I find it hard to believe that this sudden decrease is real, especially since other related pages, such as Hyperbola, have stayed more or less constant. Any ideas? Cheers. DOwenWilliams (talk) 18:36, 14 April 2013 (UTC)

I cannot enter the statistic site from certain from my home network at any computer
i cannot enter the statistic site from certain from my home network at any computer could you suggest me what to do? any configuration need to be checked?

(it is not happening from this IP/computer where i write the messege, here i can connect grok.se)

Thanks Yuval — Preceding unsigned comment added by Yuvalshafriri (talk • contribs) 09:21, 15 April 2013 (UTC)

The Signpost: 15 April 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 22:27, 17 April 2013 (UTC)

The Signpost: 22 April 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 14:47, 25 April 2013 (UTC)

The Signpost: 29 April 2013

 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 08:20, 2 May 2013 (UTC)

stats.grok.se: vanished data?
Hi Henrik, there are some complaints about stats.grok.se in german Wikipedia. There are no statistics of April 2013, or they are now vanished (at least one user said, there was data for April during April). Seems a general problem: enWP Elephants March 2013, enWP Elephants April 2013; deWP Elefanten March 2013, deWP Elephants April 2013. Can you please check this? Regards --Schniggendiller talk  12:33, 5 May 2013 (UTC)
 * Actually it appears that ALL the stats for April are missing on en-wiki, including the Main page Ottawahitech (talk) 14:43, 5 May 2013 (UTC)


 * Sorry about that, it was a database compactation job which crashed midway through and left a corrupted copy. The data should be back now. henrik  • talk  17:22, 5 May 2013 (UTC)
 * Yes, all data seems to be back. Thank you very much! Regards --Schniggendiller talk  22:44, 5 May 2013 (UTC)

Page popularity data
Henrik, First of all thank you for providing such a simple way to access the Wikipedia statistical data. We are a web design firm in New York City and are currently working on a project a portion of which involves accumulating lists of popular wikipedia articles or getting the popularity of an article. We see that there is a way to access this information in JSON format via your website. However, before doing so we wanted to make sure that this was an acceptable thing to do, or if not, there was some other way you could provide the data for us. The reason being is that we would be making the requests directly from our servers in PHP, so there may be quite a lot being made per unit time. We also know that Domas is providing the data, but we would very much prefer to use your format. Please let us know if this is a possibility, or if you don't mind us making requests to your server at a reasonable rate for a little while.

Again, thank you very much, and we appreciate what you have done for the community.

Steve + NoFavorite team

My email is steve@nofavorite.com. Please CC dmitry@nofavorite.com as well.

Thanks again, -Steve 208.105.82.85 (talk) 21:50, 6 May 2013 (UTC)

The Signpost: 06 May 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 03:37, 11 May 2013 (UTC)

stats.grok.se/az/top
Hello, Henrik! Don't you know, when this statistics is going to be refreshed? --Мурад 97 (talk) 16:29, 14 April 2013 (UTC)
 * Many projects are very interested in this! --Nemo 09:38, 11 May 2013 (UTC)


 * Updated now! henrik  • talk  19:26, 11 May 2013 (UTC)

stats.grok.se code
Hello Henrik, nice to see you active! In case you were not told, it seems the WMF may be interested in hosting a copy of stats.grok.se, waiting for a proper solution to be implemented for data reusers. Is your code hosted somewhere? I would also be interested in insights on what are the minimum hardware requirements for a DB hosting the data in a way that can be used to generate reports such as WP:5000. Thank you very much, Nemo 09:42, 11 May 2013 (UTC)


 * Hi Nemo! Yeah - the code is on github (https://github.com/abelsson/stats.grok.se). Though Diederik already knows where this code is, he helped write some of it :) HW requirements would depend a bit on how you would code it up, but a good implementation should work well on a decently modern server. For reference, stats.grok.se is running on a three year old computer with a ~2.5GHz processor and 12GB of ram.  henrik  • talk  09:56, 11 May 2013 (UTC)


 * Thanks! With how much disk space? And how much does it take to produce the "top" charts? --Nemo 19:39, 11 May 2013 (UTC)


 * 8 TB. I don't know exactly how much disk it would take to produce top charts, it depends on your implementation (the actual lists are not large, but you need to crunch a lot of data to get them). henrik  • talk  19:51, 11 May 2013 (UTC)


 * Thanks! I meant more CPU time with your code: I suppose that's the main bottleneck? Or maybe RAM depending on the implementation. --Nemo 09:31, 13 May 2013 (UTC)


 * For me, I/O (=hard disk speed) is definitely the limiting factor. I wish I could afford 8 TB of SSDs, then I could really do something fun with the stats. :) henrik  • talk  18:30, 13 May 2013 (UTC)


 * Ah! Well, how much would that cost, 6000 $? Looks far from impossible, you could try asking a grant. :) It would not be hard to find someone helping you write the application and a few hundreds users signing it. :p --Nemo 06:48, 14 May 2013 (UTC)

stas.grok
Working with Translators Without Borders to translate key medical articles in other languages. We have so far completed about 200 as listed here We have received funding to help with the work in Swahili and are wondering what impact it is having. Do you know if there is a way to get page views for articles in Swahili? Doc James (talk · contribs · email) (if I write on your page reply on mine) 20:33, 11 May 2013 (UTC)


 * Replied on your talk.  henrik  • talk  20:41, 11 May 2013 (UTC)
 * Do you know if these numbers include mobile? Doc James  (talk · contribs · email) (if I write on your page reply on mine) 22:25, 11 May 2013 (UTC)


 * I belive so, but I'm not 100% sure. Ask the WMF guys. henrik  • talk  18:31, 13 May 2013 (UTC)
 * Do you know who at the WMF would know? Doc James  (talk · contribs · email) (if I write on your page reply on mine) 18:37, 13 May 2013 (UTC)
 * Ask on analytics. --Nemo 06:49, 14 May 2013 (UTC)

Tracking links to Wiktionary
Hello - first off, thanks so much for the stats.grok page. I use it all the time, both for curiosity, and to try to improve Wikipedia. Along those lines, I was wondering if you knew how to track traffic stats for Wiktionary. Both 1) actual Wiktionary page traffic stats, and 2) links via template from a Wikipedia page to a Wiktionary entry (i.e. how many times gets clicked from the Dictionary (disambiguation) page). Appreciate any help! Dohn joe (talk) 23:25, 13 May 2013 (UTC)


 * Hi! For 1) User:Killiondude/stats (example link: http://stats.grok.se/en.d/latest/gregarious), for 2) clicks there will be tracked as a normal visit, but there's no way to distingush those from other referrers. henrik  • talk  07:00, 14 May 2013 (UTC)
 * Ok - thanks for the info! Dohn joe (talk) 15:32, 14 May 2013 (UTC)

The Signpost: 13 May 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 04:11, 16 May 2013 (UTC)

Log of https pageviews resumed 14 May 2013
The pageview data logs, such as for stats.grok.se, have been fixed (at 18:44, 14 May 2013) to re-enable the https/ip6 stream to webstatscollector, where Google https-protocol links, for over 300 major articles (see stats: 201305/Email or 201305/Parabola or 201305/Shakira, and thousands of wikilinked pages), had been 55%-80% under-reported during late March, April and early May (see essay: wp:Google https links). The typical pageview counts, from March 2013, have resumed in pageviews, 2x-3.5x times higher for https-prefix pages/images, during 15 May 2013. German WP pageviews are also fixed (see stats: /de/201305/Euklidischer Raum or /de/201305/Oval). All https page requests had been omitted during 26 March 2013 to 18:44, 14 May 2013, and so there will be permanent low spots in the pageview stats of some pages during those 50 days (~7 weeks), for various articles, images, talk-pages, templates or categories which were viewed mostly via https-protocol links on some of those 50 days. Many thousands of pages/images were not affected, and those pageviews will seem relatively stable during that 50-day period. As of 15 May 2013, the http/https pageviews have been re-confirmed to log exactly "to the penny" and so, if a page/images was viewed 16x times during a day, it will show a total of exactly 16 pageviews for that day. -Wikid77 (talk) 05:17, 16 May 2013 (UTC)

"Page view statistics" for Wikiquote
Dear Henrik, do you think it would be possible to also have "stats.grok.se" article traffic statistics on Wikiquote? How should one go about implementing it? Thanks much ~ DanielTom (talk) 20:53, 15 May 2013 (UTC)


 * If you'd like it linked in the same place as enwp (under the history tab on all pages), have an admin edit http://en.wikiquote.org/wiki/MediaWiki:Histlegend and add something like Page view statistics
 * henrik • talk  06:04, 16 May 2013 (UTC)


 * Hi Henrik, another Wikiquotian here. Thank you very much for your help with this. I am delighted to learn that the stats.grok.se dataset includes sister projects like Wikiquote! I notice that, as indicated at User:Killiondude/stats, several components of the report (title, link to article, and interactive selector) are hard-coded for Wikipedia, and I wonder if there is any chance of enhancing the report for better presentation of statistics on sister projects. In particular:
 * The report title refers to "Wikipedia article" in all cases. It might be better to name the language and project to which the report pertains, e.g. "English Wikiquote article". Alternatively, the title could be shortened to something generic like "Article traffic statistics" and the specific context could be identified beneath it.
 * The link to the subject article does not work for sister projects. E.g., for the English Wikiquote, the domain "en.q.wikipedia.org" does not exist and does not resolve to the correct domain "en.wikiquote.org".
 * The interactive selector at the bottom of the report would provide better access to the data if one could select by project.
 * I have some reservations about adding the tool to Wikiquote's interface in its present state; but I imagine these enhancements would not be difficult to implement with some lookup tables. Is this something you would be interested in doing?
 * Thanks, Ningauble (talk) 12:54, 16 May 2013 (UTC)
 * Hi Ningauble! Yes, the database actually has statistics for all the sister projects for several years. but the user interface has unfortunately never really shown it.
 * good point - fixed.
 * actually it did work for most sister projects - except that I had forgotten to add wikiquote. Also fixed now.
 * I need to restructure some things to do a separate project / language selectors, hold on a bit, but I'll get that fixed too.
 * Thanks for your comments, useful (and warranted) comments for improvements. henrik  • talk  19:29, 16 May 2013 (UTC)
 * Thank you very much for these improvements. You are awesome! One tangential question: Is the data at http://stats.grok.se/en.q/top current? The FAQ indicates that it is not currently being updated, but the report displays a current as-of date (and the title says "Wikipedia"). ~ Ningauble (talk) 11:18, 17 May 2013 (UTC)


 * Ah, one more place to fix the title. Yes, the top list is updated and current - this time it's the FAQ that is outdated. :) henrik  • talk  15:12, 17 May 2013 (UTC)
 * Cool. It is a very interesting report. ~ Ningauble (talk) 15:25, 17 May 2013 (UTC)
 * Thanks from me as well! ~ DanielTom (talk) 10:31, 20 May 2013 (UTC)

page views
Hi Henrik

Great work on http://stats.grok.se/

Would you be interested in working with us to get some of these time series into www.quandl.com?

thanks Tammer tammer@quandl.com — Preceding unsigned comment added by 205.197.156.6 (talk) 10:57, 18 May 2013 (UTC)

Page view stats for foundation wiki
Hi Henrik, I read the FAQs for your page view tool but couldn't find an answer for my question there. If possible I'd like to request that stats for foundationwiki (www.wikimediafoundation.org) are also added to the tool - are these stats available? Thanks! The helpful  one  19:40, 16 May 2013 (UTC)


 * Hm, I don't know - it's not immediately obvious to me which one of the following would correspond to the foundation wiki. Ask Domas if it's included in the dumps? henrik  • talk  20:02, 16 May 2013 (UTC)

+-+ +-+ +-+
 * project |
 * en.b   |
 * en.d   |
 * en.f   |
 * en.mw  |
 * en.n   |
 * en.q   |
 * en.s   |
 * en.v   |
 * en.voy |
 * en.wd  |
 * Thanks for your prompt response! I think looking from https://gerrit.wikimedia.org/r/gitweb?p=analytics/webstatscollector.git;a=blob;f=filter.c;h=907636cbfd5acc986b4fdc34f1aa73c733ae5704;hb=HEAD the en.f one is for foundation wiki. Would you be able to add it to the drop down in the interface too? The  helpful  one  16:50, 17 May 2013 (UTC)
 * Where does the en.f come from? It doesn't make sense, the first code is always the subdomain so it must be www.f. I don't find any en.f in the raw data, while www.f works, except that counts are very low: http://stats.grok.se/www.f/latest30/Home --Nemo 07:59, 25 May 2013 (UTC)

The Signpost: 20 May 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 09:38, 23 May 2013 (UTC)

Dumb question about the statistics tool
I hate to bother you with such a trivial question but I was just kind of curious about something. I was using the statistics tool and saw there was a link to the raw data, and now I'm totally confused. Does the tool really compile 24 separate ~300mb (post-extraction) files every single day? Also, I downloaded and extracted one of the files and they make no sense when taking the tool into consideration.

en Main_Page 385981 16005625905

I'm assuming the number 385,981 includes all possible server requests for this page. This is just from one file/hour, I can't imagine what the sum of the numbers from the 24 files would be, but it's fair to assume it would be under 8,717,062 – the number of views for this day according to the tool. How does the tool only count page views, and not all requests as printed in the raw data files? Do you subtract some other number from this data? Is there some other algorithm? Does the tool not use these specific files? Thanks in advance. Scarce2 (talk) 23:45, 23 May 2013 (UTC)
 * Some of your questions are answered on the FAQ, the others don't make any sense. ;-) --Nemo 07:48, 25 May 2013 (UTC)


 * Hi Scare2. Yes, roughly 7GB of raw data is added and processed every day. Each of the files represent one hour of traffic. I'm a little bit confused why you think 8.7 million views is unrealistic when the hour you sampled has a bit less than 400k views( 385,981 * 24 = 9,263,544). henrik  • talk  09:33, 25 May 2013 (UTC)
 * Duh, I'm so stupid. Thanks for the reply. Scarce2 (talk) 21:18, 25 May 2013 (UTC)

stats.grok.se and Wikisource
Hi! Probably somebody has already asked it before but... Is it possible to add support for Wikisource for the statistics tool? --DixonD (talk) 11:48, 28 May 2013 (UTC)
 * Indeed it was already asked, and there is already: User:Killiondude/stats. --Nemo 12:53, 28 May 2013 (UTC)

How many concurrent requests do you allow?
Hi Henrik,

I would like to access your traffic stats with a script. I have written it so it won't fire more than 20 requests at a time, but I will need the stats for > 200000 pages. I have just been testing with a small set of pages, and the response is very quick. Still, I don't want to mess up things on your side.

Unless there is an error during execution, I will probably need to do this only once.

Is it ok with you if I let the script run?

thanks,

Rob — Preceding unsigned comment added by Phnaargos (talk • contribs) 09:44, 27 May 2013 (UTC)


 * Hm, 20 requests at a time would consume nearly all the capacity of the server. I would be much happier if you stuck to 1-2 parallel requests and let it run over a few days instead. henrik  • talk  16:06, 28 May 2013 (UTC)


 * Ok, I'll stick to 2 parallel requests, tnx -- 77.250.75.189 (talk) 07:12, 29 May 2013 (UTC)

API for yearly page view data
Hi,

Thanks for your contributions to http://stats.grok.se/! I was wondering if you would be able to post a new API that would post data for a page for an entire year. So instead of looking at page views over a month or the latest 90 days you could gather all data for 2012 or up to today 2013. If you like please email me back at grehm87@gmail.com

Thanks,

Greg Rehm — Preceding unsigned comment added by 71.202.175.100 (talk) 06:51, 29 May 2013 (UTC)

making a batched request?
Is it possible to put multiple page titles in one request? I mean like the MediaWiki API, where you can add up to 50 page ids in one query.

cheers — Preceding unsigned comment added by Phnaargos (talk • contribs) 07:26, 29 May 2013 (UTC)


 * Nope, unfortunately not. henrik  • talk  07:49, 29 May 2013 (UTC)

No stats for 2013-05-28?
I have been editing WP:DYKSTATS for a while, and I haven't yet seen any updates lately. Is there a cause of delay? --George Ho (talk) 09:07, 29 May 2013 (UTC)


 * Should be there in an hour or so, I hope. Sorry! henrik  • talk  09:12, 29 May 2013 (UTC)

The Signpost: 27 May 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 09:43, 31 May 2013 (UTC)

Server error 2013-05-28
stats.grok.se/en/latest/Xilinx gives "internal server error". Guess there is a problem with the server application. Electron9 (talk) 15:27, 28 May 2013 (UTC)


 * Crap. Ran out of disk, hold on. henrik  • talk  15:52, 28 May 2013 (UTC)


 * This has happened again today. Electron9 (talk) 18:30, 6 June 2013 (UTC)

corporate social entrepreneurship
Hello Henrik,

Once again, thank you very much indeed for the statistics facility.

What I need to do, though, is count the total number of views of the corporate social entrepreneurship page since I created it back at the beginning of 2010. I don't want to add up the monthly views by hand. I have a notion that the total views exceed 17,000 but I need to check whether or not this is correct. Can I manipulate the data on screen to do this, please? It's a query that I will keep repeating. Thank you.

Best wishes, Christine Hemingway Chemingway (talk) 09:04, 5 June 2013 (UTC)

Internal Server Errors
Hi Henrik. I hope you are well. It's been nice to see that you're more active lately. Today the stats site is throwing a server error when attempting to retrieve information. See this test. Killiondude (talk) 17:15, 6 June 2013 (UTC)


 * Same here. Electron9 (talk) 18:30, 6 June 2013 (UTC)


 * Fixed by restarting, but I need to figure out what went wrong here. henrik  • talk  19:48, 6 June 2013 (UTC)

The Signpost: 05 June 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 00:42, 7 June 2013 (UTC)

2013-06-06 stats missing?
I don't see yesterday's hooks coming up yet. Is there a delay? --George Ho (talk) 08:17, 7 June 2013 (UTC)

Same problem... Is there a delay ? Thanks a lot for the answer. Best regards — IP, 7 June 2013 — Preceding unsigned comment added by 84.99.243.70 (talk) 09:27, 7 June 2013 (UTC)


 * Try it now. henrik  • talk  11:52, 7 June 2013 (UTC)

Top 100 by Year for stats.grok.se/
Henrik, thank you for providing access to this rich data source. I am interested in getting access in the Top 100 or Top 500 Most visited pages for the years 2012, 2011, 2010 and as far back as is possible for EN and other languages if possible. Please advise. infovis Infovis (talk) 18:45, 7 June 2013 (UTC)

No Views
Is there a reason why when i search for Kinky Boots (musical)‎ it comes back with 0 page views. . Blethering  Scot  23:18, 7 June 2013 (UTC)
 * Remove the %E2%80%8E from the end of the stats URL. 79.67.245.117 (talk) 07:34, 8 June 2013 (UTC)
 * Thanks. Why does it generate that when you just enter the page name.? Blethering  Scot  10:02, 8 June 2013 (UTC)


 * It doesn't for me, but perhaps your browser is doing something strange. Which browser are are you using? henrik  • talk  11:34, 8 June 2013 (UTC)
 * Safari. It doesn't happen on all pages but have had the problem a few times. Blethering   Scot  12:37, 8 June 2013 (UTC)


 * The text in the first line of this section " Kinky Boots (musical)‎ " has a U+200E Left-to-right mark after "(musical)". That character percent encodes as %E2%80%8E. Some log pages like user contributions add that character after page titles so if you copy-paste a title from a log page like then you may accidentally include a left-to-right mark. I suppose stats.grok.se could strip a trailing left-to-right mark but I don't know how common the problem is. PrimeHunter (talk) 23:42, 13 June 2013 (UTC)

The Signpost: 12 June 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 08:54, 14 June 2013 (UTC)

Page Views
Hello Henrik,

while surfing wikipedia, sometimes i use your data from pageviews for views like this:



Do you think, it is possible to include the last view (see source code on commons) in your report. --LoKiLeCh (talk) 21:35, 19 June 2013 (UTC)

The Signpost: 19 June 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 23:08, 20 June 2013 (UTC)

internal server error - again
Your server returns internal server error again ;-) out of disk? Electron9 (talk) 03:13, 24 June 2013 (UTC)

Seems it's working again.... Electron9 (talk) 03:19, 24 June 2013 (UTC)

I was slightly optimistic. It works most of the time.. Electron9 (talk) 06:18, 24 June 2013 (UTC)


 * No, not out of disk this time :) Someone was doing a very large volume of queries against the server very rapidly - that may have been the cause of sporadic errors. henrik  • talk  19:25, 24 June 2013 (UTC)


 * Suggestion.. if( last 10 visits == IP + coockie less than 1 second ) { print "Norty norty!!\n"; } else { print $stats; } .. ;-) Electron9 (talk) 23:19, 24 June 2013 (UTC)

Email
--Itzike (talk) 14:09, 25 June 2013 (UTC)

stats.grok.se: Page views over last 12 months ?
Hi Henrik, as you know, internet traffic is very seasonal. It changes a lot from a month to another. It would be really great if you could add a "last year" clickable option after the "last 90 days" one. It actually takes hours to do it by hand clicking on the last 12 months for each entries, especially when we want to compare datas between various languages as I would like to. This is only a tiny tweek in the SQL query after all! Thanks a lot for your work! Metropolitan (talk) 23:04, 8 May 2013 (UTC)


 * True, it's only an SQL tweak to get more data. The reason it's been limited to 90 days is performance and also that I need to change the graph to something different - I don't think a bar graph with 365 bars would look good. But I agree it would be useful. I have a few hours to kill today, so I'll do some experimenting. henrik  • talk  05:55, 9 May 2013 (UTC)


 * Well.. here's an initial test: http://stats2.grok.se/en/latest_year/Zoo you can play with. It's indeed a bit slow and the graph isn't that great. Hm. henrik  • talk  06:28, 9 May 2013 (UTC)


 * Oh many many thanks Henrik this is so great! I never imagined you would react so fast! I'll check some stats now so this may help you to see if it affects too badly performances. Metropolitan (talk) 13:02, 9 May 2013 (UTC)


 * Henrik, just so that you know, I've collected statistics about Wikipedia pages views regarding world sports teams articles in 10 world languages over the last 365 days (English, Spanish, German, French, Portuguese, Italian, Russian, Japanese, Chinese and Arabic). If you're curious of the results, here there are: http://footinter.free.fr/world-sports-teams-wikipedia-audience.gif
 * I couldn't have done that without you. Thanks again. Metropolitan (talk) 10:01, 15 May 2013 (UTC)


 * Cheers! Fun infographic! henrik  • talk  18:22, 15 May 2013 (UTC)


 * This is almost the answer to my prayers - at just the right time too! However, I just noticed that the latest_year data is not available in JSON format. I guess you already have the data to generate the graph, would it be too difficult to implement a json version so that it can be retrieved? Many thanks! Rohan 17:21, 5 June 2013  — Preceding unsigned comment added by 182.64.7.201 (talk)


 * The 12-month graph would be so much more readable and useful if the bars were only 2 or 3 pixels wide. Is that easy to change? -- 79.67.247.248 (talk) 07:53, 15 June 2013 (UTC)

Hi Henrik, it would be very helpful if you provide the new feature for the last 365 days also as jsonFormat. This doesnt work yet. — Preceding unsigned comment added by 92.78.129.18 (talk) 10:50, 26 June 2013 (UTC)

stats.grok.se
Will Wikidata be added? -- Ricordi  samoa  22:00, 22 June 2013 (UTC)
 * Any updates? -- Ricordi  samoa  13:44, 27 June 2013 (UTC)

stats.grok.se move to mw-labs?
Hey there! I'm wondering if you have considered moving your tool to mw-labs (the toolserver replacement). Perhaps this would enable the option for you to be able to get it so that people can easily report bugs for you on Bugzilla. I only mention it because it took me a while and some asking around to find you here. I wanted to report a bug that seemed to add around 300 pageviews to the count for any day when the tool was used to view the pageviews. This was a couple weeks ago, and it seems to have cleared up since then (good work). Anyways, have a nice day! Technical 13 (talk) 13:32, 27 June 2013 (UTC)

The Signpost: 26 June 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 22:10, 27 June 2013 (UTC)

Unblocking statistics
Hi,

I'm using statistics API and I get : Too many requests, please limit your service to 1-2 requests per second and contact User:Henrik on wikipedia to be unblocked on every request how can I unblock this? Please feel free to contact me ...

Thanks — Preceding unsigned comment added by Idankoch (talk • contribs) 11:21, 30 June 2013 (UTC)

Receieving "Too many requests, please limit your service to 1-2 requests per second and contact User:Henrik on wikipedia to be unblocked"
Hi Henrik,

You were already contacted by Idan, a member of my development team. We're trying to access your excellent service over the past few days, but we're getting the error message in the subject. If we overloaded your system it was totally by mistake due to a bug in the system, we will fix that.

Please advise. My e-mail address is oren.shoham@gmail.com

Thanks,

Oren

62.0.6.28 (talk) 14:53, 2 July 2013 (UTC)

Countrywise breakdowns for article pageviews
Hello Henrik, wikipedia currently shows a time series of pageviews in the article statistics. Can we also get a breakdown of the views from each country ? Thanks and regards. I am invariant under co-ordinate transformations (talk) 13:02, 4 July 2013 (UTC)

The Signpost: 03 July 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 00:07, 5 July 2013 (UTC)

Nightly run of 2013-07-07 stats failed?
I see no stats for 2013-07-07, is there any failure? Electron9 (talk) 07:46, 8 July 2013 (UTC)

The Signpost: 10 July 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 09:33, 12 July 2013 (UTC)

The Signpost: 17 July 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 18:16, 18 July 2013 (UTC)

The Signpost: 24 July 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 21:59, 25 July 2013 (UTC)

Page View Statistics Question
Hello, could I ask for some clarification on the page view stats? I read on the FAQ list that page views counted are for both readers and editors, but if someone is using a bot to edit, does that still register as a page view? I apologize if this is a silly question, but I would be grateful for the answer! Thanks! KjkFromNC (talk) 17:17, 28 July 2013 (UTC)

The Signpost: 31 July 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 04:15, 2 August 2013 (UTC)

Page Views Question
Hi Henrik,

Are page views being tracked differently now? I've seen a dramatic drop in page views for my page over the past week or so.

Thanks!

Miranda — Preceding unsigned comment added by 66.167.190.242 (talk) 12:47, 5 August 2013 (UTC)

No stats for July 23?
Just wondering. Thanks in advance, XOttawahitech (talk) 20:51, 24 July 2013 (UTC)
 * See . Legoktm (talk) 08:00, 25 July 2013 (UTC)
 * Thank you Legoktm - Does this mean someone is working on fixing this wiki-wide problem? XOttawahitech (talk) 14:06, 26 July 2013 (UTC)

Looks like it is still down 7/25/13, is there any update since the post and link yesterday? 99.140.180.101 (talk) 14:52, 25 July 2013 (UTC)
 * Day 3 with no stats.--TonyTheTiger (T/C/BIO/WP:CHICAGO/WP:FOUR) 04:21, 26 July 2013 (UTC)
 * I see that TonyTheTiger has posted the same question at the help desk - hopefully this report will be taken seriously by someone at  Wiki. XOttawahitech (talk) 14:08, 26 July 2013 (UTC)
 * File:Don't abbreviate as Wiki (English version).png 128.122.70.187 (talk) 14:09, 26 July 2013 (UTC)

Day 3 with no stats... Thanks a lot for some explanation. Best regards.

IP, 26 July 2013 — Preceding unsigned comment added by 84.99.243.241 (talk) 15:29, 26 July 2013 (UTC)


 * Trying to move discussion to: Village_pump_(technical), which is where such discussions are supposed to take place(?) XOttawahitech (talk) 15:39, 27 July 2013 (UTC)

page view statistics?
What happened to the page view statistics? Although the data has been collected the counts have not been reported since July 22. You are listed as the person to contact with any questions about the Beta version application that provides page view counts in the form of a bar graph. Have they been discontinued?

Thanks, — Preceding unsigned comment added by 98.118.177.145 (talk) 02:38, 27 July 2013 (UTC)


 * Henrik is a Missing Wikipedian, unfortunately. The discussion about the missing stats is here. XOttawahitech (talk) 15:45, 27 July 2013 (UTC)
 * Just removed Henrik from Missing Wikipedian, please find his last edit here. --Burkhard (talk) 09:46, 28 July 2013 (UTC)

Update for 7/28. Not sure Henrik is back since his last edit is on 7/25, and your post is 28 July. Someone has tried to restart Page view stats today for 7/24, and the numbers are abnormally low for total counts system wide, as if over half of the raw data packets are missing or lost, many data packets appear to be completely uncounted. Stats for Page view count for 7/23 were not even attempted for 7/23 for unstated reasons. 76.237.181.233 (talk) 13:59, 28 July 2013 (UTC)

page view statistics out of order
Really, I don't understand why the stats are Henrik's exclusive field ? Thanks a lot if someone knows the reason why it doesn't work. Best regards.

IP, 27 July 2013 — Preceding unsigned comment added by 84.99.243.241 (talk) 19:48, 27 July 2013 (UTC)
 * Henrik runs the site that shows the graphs. The raw figures are compiled elsewhere. -- 31.54.63.170 (talk) 19:46, 29 July 2013 (UTC)

Thanks. But "elsewhere", I don't understand... Where is elsewhere ? Best regards.

IP, 10 August 2013 — Preceding unsigned comment added by 84.99.243.241 (talk) 09:00, 10 August 2013 (UTC)


 * The bottom of all pages have the link About these stats. PrimeHunter (talk) 13:32, 10 August 2013 (UTC)

The Signpost: 07 August 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 01:38, 10 August 2013 (UTC)

pageview statistics not updated since August 9
Can you please check why the pageview statistics are not updated since August 9 2013? Thanks. Eransgran (talk) 02:42, 11 August 2013 (UTC)

Confirmation of no updates on page stats for 2 days now system-wide. Repeat of issue from 2 weeks ago? 76.217.61.88 (talk) 03:29, 11 August 2013 (UTC)

Comments From the Useful Page Count Graphs
Greetings from your useful page count graphs.

This may be completely unexpected but would it not make for greater utility to have the page counts graphs printed on 7-day cycles graphs rather than simply groups of ten. If on the seven day cycles then this would correspond to weekly cycles. This would be even more useful since they could assist in understanding weekday versus week-end frequency counts.

If this sounds like it might be sensible, then the quick observation would be to use block periods as 35days-70days-105days, rather than the present 30-60-90 days. On the test cases I ran for myself (manually) aligning the evenly spaced vertical graph orientation lines looked best when they are aligned for starting on either Monday, or, Saturday (start of week-end) for the evenly spaced vertical graph orientation lines for the plotted points. Any thoughts? AutoMamet (talk) 02:45, 14 August 2013 (UTC)

The Signpost: 14 August 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 11:46, 16 August 2013 (UTC)

Possible data glitch?
Is the 1.3 million views on 2013-04-13 for the IEEE 1284 article as seen here correct? or an indication of some kind of glitch? Electron9 (talk) 02:32, 19 August 2013 (UTC)

Question about data
Hi, Henrik,

I was looking at Wikipedia stats site and was wondering what software program can open a .gz file as my computer didn't recognize it. I was interested in looking at the raw data to see a Top 100 or Top 500 visited pages. Thanks in advance for any assistance you can provide. Liz Let's Talk 15:51, 20 August 2013 (UTC)
 * See gzip article. - 79.67.243.158 (talk) 22:24, 20 August 2013 (UTC)

The Signpost: 21 August 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 08:26, 26 August 2013 (UTC)

Stats server clarification
An editor has pointed out at Talk:Chelsea Manning, with respect to the stats server returns, that "Chelsea vs. Bradley is 8652 vs. 3881. And because Bradley is a redirect to Chelsea, it's actually 4771 vs. 3881."

I want to be sure that I am understanding this correctly. If "Foo baer" is a title, and "Foobare" redirects there, and 100 people type in "Foo baer" when looking for the term, while 2000 people type in "Foobare", will the stats server show that "Foo baer" has 2,100 hits? Is that what is meant by the stats server FAQ statement that "redirects and moves will unfortunately split the statistics across two different statistics pages"?

Cheers! bd2412 T 12:28, 29 August 2013 (UTC)

The Signpost: 28 August 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 15:50, 31 August 2013 (UTC)

The Signpost: 04 September 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 00:07, 7 September 2013 (UTC)

Dramatic Stats Spikes when Editing
Hi Henrik -- thanks for your work to create stats.grok.

I'm noticing that when I make a minor edit on a page the stats can spike by as much as 5x on that day. I'm curious if that is normal bot activity, if that might be the result of someone checking "watch this page" and volunteers flooding to check / edit the page, or any other explanation?

Thanks -- jora8488 — Preceding unsigned comment added by Jora8488 (talk • contribs) 11:46, 7 September 2013 (UTC)

counting views
Hi, the information re: counting views does not show who has viewed your page. is that possible to know? Or at least their role ...for example if an editor went on the page that is useful information. do the views include me? i.e..the user that created the wiki page. I think all the views are me which is underwhelming :-) Thanks for your response! Lily — Preceding unsigned comment added by Lrh246 (talk • contribs) 19:27, 7 September 2013 (UTC)

The Signpost: 11 September 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 07:37, 13 September 2013 (UTC)

Integer overflow
Hi Henrik, there seems to be a problem on stats.grok.se, with some view data like this one obviously having some kind of signed 32bit integer overflow (2^31) added to the count. Subtracted by 2^31, the values seem quite reasonable, so the data can be repaired. That buggered up my GLAM stats tools, but I am now filtering these out (for the most part). Just FYI. --Magnus Manske (talk) 08:27, 13 September 2013 (UTC)

The Signpost: 18 September 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 09:19, 20 September 2013 (UTC)

Aggregated stats?
hi! thank you so much for your fabulous stats aggregator, http://stats.grok.se ! I wonder if it is possible to summarize over all years? Also I think this data is not from the beginning of wiki time, right? I saw the recent video re contributors, and one lady mentioned total number of hits her page ever got, ie hitcounter. I'm not sure we have that yet? Kissedsmiley (talk) 15:28, 20 September 2013 (UTC)
 * We used to have http://stats.grok.se/en/2013/Main_Page and the like. --Nemo 05:09, 22 September 2013 (UTC)

No graphs for 2013-09-21
In the last couple days I'm not seeing updates. despite. Or maybe it just takes a few hours more to process yesterday? --Nemo 05:09, 22 September 2013 (UTC)

The Signpost: 25 September 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 09:23, 27 September 2013 (UTC)

Accessing Per-Hour Statistics 2013-09-23
Hello Henrik,

I have been using the stats.grok.se application recently, and have found it extremely useful - thanks very much for creating this!

I have a question about how to gather some additional information. Through the stats.grok.se application, I can gather daily page view information. However, in the "dumps" section (http://dumps.wikimedia.org/other/pagecounts-raw/) I see that data is actually available in an hourly format. What I would ideally like to do is to gather hourly page view information for several specific english wiki articles. Currently, the only way I know how to do this is to download each .gz file for every hour, which contains data for ALL wiki articles, and dig out the information I'm looking for. As you can imagine, this is an extremely time intensive task.

Is there a way to use the stats.grok.se format - by specific article search - or any other way that I can more easily gather this hourly page view information?

Thanks for your help! You can contact me at the address below.

David McIver Boston Children's Hospital Harvard Medical School david.mciver/at/childrens.harvard.edu 134.174.21.27 (talk) 14:35, 23 September 2013 (UTC)
 * I've edited the above email address to attract less spam. I suggest using "Email this user". But note that at the top of page, this user has been unavailable for some time, so set your hopes for a response accordingly. In the meantime, consider setting up an automated cron job to perform the downloading, and either scriptize or python-ize your expansion/filtering task. Since you're at Harvard with, presumably, a very fat data pipe, the download should only take a few seconds. If it takes longer, make a request to your IT department for a priority increase, or a new account on a Childrens server with space and a priority increase, for this sole project. This will keep the fattest file traffic off the general network. Keep it tidy by deleting intermediate files asap of the server. Alternatively, you can wend your way towards becoming a Wikipedia developer, getting an account on the http://tools.wmflabs.org server. This may let you preprocess the files you need before downloading, pre-filtering them and even further reducing unneeded network traffic for everyone. Ideally, you'd be publishing a tool and/or an API which will allow WP editors and devs to prefilter for stats for a particular article, then download those few. I'm on the outside of toolserver and wmflabs, so I don't know if the tool you need already exists. Ask on IRC at #wmflabs, I think. --Lexein (talk) 08:27, 30 September 2013 (UTC)

Just to let you know -- Missing Wikipedians
You have been mentioned at Missing Wikipedians. XOttawahitech (talk) 14:59, 29 September 2013 (UTC)
 * Looks like you're back? Liz  <sup style="font-family:Times New Roman;"><b style="color:#006400;">Read!</b> <b style="color:#006400;">Talk!</b> 15:44, 8 October 2013 (UTC)

Aggregate pageviews
Hi Henrik! I love your tool. I'd like to do two things with it. What do you think? Ocaasit &#124; c 10:43, 3 October 2013 (UTC)
 * Calculate total page views for an article since its creation.
 * Generate a combined statistic for all pageviews to any article that an editor has created.
 * very unfortunately Henrik is a Missing Wikipedians. XOttawahitech (talk) 04:55, 6 October 2013 (UTC)
 * This is not suitable for Henrik's tool, though similar things can be done with its data by "consumer tools" like Magnus' baglama. See 42259 to provide such tools with better data. --Nemo 19:49, 14 October 2013 (UTC)

Request for adding the SWWP to Wikipedia article traffic statistics
Hello there.. I've just viewed the site today - didn't know if there is a site for the Wikipedia statistics. It was awesome seeing some good stuff in it. However, I couldn't find my home Wikipedia on the list. Would you please be so kind to add the SWWP also? Only if possible. Best regards,--'''Mwanaharakati(Longa) 19:36, 3 October 2013 (UTC)
 * All Wikipedias are included, just look up the correct URL. For instance: http://stats.grok.se/'sw/top. --Nemo 19:49, 14 October 2013 (UTC)

chinese text in pagecount... is it hex? how to convert back?
Hi,

I downloaded a pagecount file to look for a chinese phrase (language = zh), but all I see is gibberish i.e.

zh %AE%D5%D1%B5 1 8583 zh %AE%E6%C4%F5%BBy%A4%E5%B0%D3%AC%EC%BE%C7%AE%D5 1 8682 zh %AFS%B9p%A6%E8%A1@%C2%F3%A7J%AE%E6%B9p%AD%7D 1 8691 zh %B0%A2%B1%B4%BF%ED%D0%D8%D3%AC%BB%A2 1 10888 zh %B0%A2%B2%BC%D4%FA%B1%C8%C4%C2%B0%CD%B4%EF%C0%AD%B7%A2%D5%B9%B9%AB%CB%BE 1 8791 zh %B0%A2%B6%FB%B8%A5%C0%D7%B5%C2%A1%A4%CE%F7%CB%B9%C0%B3 1 14821 zh %B0%A3%CB%B9%CC%D8%B9%FE%C6%EB%BF%A8%C2%E5%C0%EF%D1%A7%D4%BA 1 8757 zh %B0%B2%B5%C2%C1%D2%A1%A4%BF%C6%CB%B9%CD%D0%C0%BC%C4%E1 1 713 zh %B0%B2%B6%AB%C4%E1%A1%A4%B0%A2%B6%FB%B0%CD%C4%E1%CE%F7 1 714

Is this hex? Is there any known way of converting this back to chinese?

Regards

Stuart. — Preceding unsigned comment added by 121.75.13.95 (talk) 09:37, 5 October 2013 (UTC)


 * The raw data (so called "Domas wikistats") is not produced by Henrik, please refer to the official documentation. --Nemo 19:49, 14 October 2013 (UTC)

The Signpost: 02 October 2013
<div class="hlist" style="margin-top:10px; font-size:90%; padding-left:5px; font-family:Georgia, Palatino, Palatino Linotype, Times, Times New Roman, serif;">
 * Read this Signpost in full
 * Single-page
 * Unsubscribe
 * EdwardsBot (talk) 04:09, 6 October 2013 (UTC)

Stats down with "internal server error"
stats.grok.se - gives "internal server error" Electron9 (talk) 04:22, 6 October 2013 (UTC)
 * Yep - I posted about it at Village_pump_(technical). It's a shame this useful tool is not maintained by Wikimedia. XOttawahitech (talk) 04:44, 6 October 2013 (UTC)


 * Oops, should be fixed now. Stats for yesterday should be up soon. henrik  • talk  07:29, 6 October 2013 (UTC)
 * Nope, Henrik. You might want to take another look --- still not working...Thank you--أخوها (talk) 18:38, 8 October 2013 (UTC)

Why this useful tool is not maintained by Wikimedia ? Thanks a lot for the answer. Today, it does't work. Best regards.

IP, 09:31, 7 October 2013 (UTC)


 * Read and cc yourself to 42259 for the answer. --Nemo 19:49, 14 October 2013 (UTC)

Communications Thesis on the Consumption of Knowledge / TEL AVIV UNIVERSITY
Dear Henrik, I hope I find you well!

I am writing to you after seeing your name in the Wikipedia traffic Statistics, and was hoping that you may be able to help me..?

My name is Yuval Shani, and I am a student of Communications in Tel Aviv University. I am currently writing my thesis on the consumption of knowledge in various media, and am desperately looking for relevant statistics to back up my argument.

Do you perhaps know Where could I find information regarding the number of overall daily views in the English Wikipedia? What percentage from the overall views, do the “Top 1000” account for? And the “Top 5000”, or 10000?

Thank you so much for your time!

Sincerely, Yuval sinishani@yahoo.com — Preceding unsigned comment added by Sinishani (talk • contribs) 13:06, 6 October 2013 (UTC)


 * Maybe you're looking for WP:5000. --Nemo 19:49, 14 October 2013 (UTC)

Stats out of order
In the last couple days, it doesn't work... Why ? Thanks a lot for the answer about stats. Best regards.

IP, — Preceding unsigned comment added by 86.73.64.169 (talk) 12:35, 8 October 2013 (UTC)


 * I noticed they now usually get updates around 10-11 UTC, maybe they take more time. --Nemo 19:49, 14 October 2013 (UTC)

Question about stats
When you hit "Top" on the stats page, one is presented with a list of "Most viewed articles in 201304". Is there any way this could be updated? I've changed the dates in the search page where I click from but it still goes to a top chart from April 2013.

Also, is this cumulative, since records were kept, or just for the month of April 2013? That might seem obvious but I have no idea what the level of normal traffic is. Thanks!

P.S. I did look in your FAQs page but couldn't find an answer to this quesiton. L. Liz  <sup style="font-family:Times New Roman;"><b style="color:#006400;">Read!</b> <b style="color:#006400;">Talk!</b> 15:43, 8 October 2013 (UTC)
 * Actually, this is covered by the FAQ: User:Killiondude/stats. --Nemo 19:49, 14 October 2013 (UTC)