Wikipedia:Wikipedia Signpost/2014-04-30/Recent research

Wikipedia Usage Estimates Prevalence of Influenza-Like Illness
Researchers from Harvard Medical School have tested the possibility of predicting the number of seasonal influenza-like illness (ILI) in the U.S. using data about the traffic to a selected number of Wikipedia entries related to influenza.

They compared their models against the prediction of Google Flu Trends (GFT), one of the earliest and most famous web-based tools for predicting the evolution of seasonal influenza disease patterns. Gold standard for comparison were the public data released by the Center for Disease Control (CDC). The accuracy of GFT is increasingly under question by several authors, culminating in a recent Science commentary piece about the promises and perils of Big Data for prediction of real-world phenomena. The authors start from this observation and submit that Wikipedia searches may be less subject to the biases that affected GFT, and test this hypothesis in the present work. They find that their model is more accurate than GFT, and was able to predict the peak week of the influenza season more often. Another undoubted advantage of Wikipedia compared to GFT, the authors argue, is its public availability, which makes the present model open to public scrutiny.

Survey of academics' view on Wikipedia and open-access publishing
A study titled "Academic opinions of Wikipedia and open-access publishing" examined academics’ awareness of and attitudes towards Wikipedia and open-access journals for academic publishing through a survey of 120 academics carried out in late 2011 and early 2012. The study comes from the same authors who published a similar paper in 2012, reviewed here, which suffered from a major basic fallacy: Wikipedia is not the place to publish original research academic work. The authors, unfortunately, seem to ignore no original research policy when they write: "There are in general three models in the current movement towards open-access academic publishing: pushing traditional journals towards open access by changing policies; creating open-access journals; and using existing online open-access venue Wikipedia" and "we surveyed academics to understand their perspectives on using Wikipedia for academic publishing in comparison with open-access journals". In the final discussion segment, the authors do acknowledge the existence of the OR policy, where they suggest that certain types or academic papers (reviews) are similar enough to Wikipedia articles that integration of such articles into Wikipedia could be feasible. The authors do provide a valuable literature review noting prior works which analyze the peer-review system in Wikipedia, perceptions of Wikipedia in academia, and related issues (through said review is partially split between the introduction and discussion section).

The study provides some interesting findings regarding academics' view of the benefits of Wikipedia-style peer review and publishing. Most respondents (77 percent) reported reading Wikipedia, and a rather high number (43 percent) reported having made at least one edit, with 15 percent having written an article. Interestingly, as many as four respondents stated that they were "credited for time spent reviewing Wikipedia articles related to their academic careers" in their professional workplaces. The more experience one had with Wikipedia, the more likely one would see advantages in the wiki publishing model. Most common advantages listed were cost reductions (40 percent), timely review (19 percent), post-publication corrections (52 percent), making articles available before validation (27 percent) and reaching a wider audience (8 percent). Disadvantages included questionable stability (86 percent), absence of integration with libraries and scholarly search engines (55 percent), lower quality (43 percent), less credibility (57 percent), less academic acceptance (78 percent) and less impact on academia (56 percent).

54 percent of respondents were aware that Wikipedia had a peer-review process and about third of these considered it to be less rigorous than that of scholarly journals; none of the respondents demonstrated any significant experience with the specifics of how Wikipedia articles are reviewed, suggesting that their involvement with the Wikipedia is rather limited. 75% of the survey respondents did not feel comfortable having others edit their papers-in-progress, and over 25% expressed concern about the lack of control over changes made post-publications. Majority of respondents did not also feel comfortable with their work being reviewed by Wikipedians, with the most common concern being unknown qualifications of Wikipedia editors and reviewers.

Perhaps of most value to the Wikipedia community is the analysis of suggestions made by the respondents with regards to making Wikipedia more accepted at the universities. Here, the most common suggestion was “making the promoted peer-reviewed articles searchable from university libraries” and in general, making it more easy to find and identify high quality articles (some functionality as displaying the quality assessment of an article in mainspace already exists in MediaWiki but is implemented as opt-in feature only).

The authors conclude that the academic researchers’ increased familiarity with either open access publishing or wiki publishing is associated with increased comfort with these models; and the academic researchers’ attitudes towards these models are associated with their familiarity, academic environment and professional status. Overall, this study seems like a major improvement over the authors' 2012 paper, and a valuable paper addressing the topics of the place of Wikipedia in the open publishing movement and the relationship between Wikipedia and academia.

Briefly

 * Wikipedia use driven by new media or replacing news media?: In a series of blog posts  Oxford Internet Institute researchers Taha Yasseri and Jonathan Bright examined pageview data from before, during and after the 2009 European Parliament election on different language Wikipedias (mostly corresponding to different European countries where the election took place). They found evidence both for the theory that Wikipedia readership is driven by media coverage (people turning to Wikipedia for background information on what they say in the news) and for the theory that Wikipedia acts as "media replacement" (people looking online for e.g. election results instead of getting that information from news media).
 * New Python library for researchers: Wikimedia Foundation researcher Aaron Halfaker published a collection of software tools "for extracting and processing data from MediaWiki installations, slave databases and xml dumps."
 * "Do Famous People Live Longer?" Yes for academics, no for artists and athletes: Four researchers from Ben-Gurion University of the Negev examined 7756 biographical Wikipedia articles about people who had died between 2009 and 2011 for gender, occupation and age at death. 84% of the article subjects were male, "and the mean age of death was lower for males than females (76.31 vs. 78.50 years). Younger ages of death were evident among sports players and performing artists (73.04) and creative workers (74.68). Older deaths were seen in professionals and academics (82.63)." Two of the authors also published another preprint titled "Wikiometrics: A Wikipedia Based Ranking System", applying it to universities and academic journals in particular. The resulting rankings correlate strongly with some established metrics like impact factors.

Other recent publications
A list of other recent publications that could not be covered in time for this issue - contributions are always welcome for reviewing or summarizing newly published research.
 * "Behavioral Aspects in the Interaction Between Wikipedia and its Users" (see also our review of an earlier paper that the two authors published with others in 2012: "Science eight times more popular on the Spanish Wikipedia than on the English Wikipedia?")
 * "Bots vs. Wikipedians, Anons vs. Logged-Ins" (poster at the WWW 2014 conference)
 * "Telling Breaking News Stories from Wikipedia with Social Multimedia: A Case Study of the 2014 Winter Olympics"
 * "A classifier to determine which Wikipedia biographies will be accepted" - according to the abstract, it relies on "indicators [that] do not refer to the content itself, but to meta-content features (such as the number of categories that the biography is associated with) and to author-based features (such as if it is a first-time author)".
 * "What Makes a Good Biography? Multidimensional Quality Analysis Based on Wikipedia Article Feedback Data"
 * "Counter narratives and controversial crimes: The Wikipedia article for the ‘Murder of Meredith Kercher’" (a linguistic essay examining two different versions of the article each on the English and the Italian Wikipedia. University press release: "Scrutinising the myth of social media ‘neutrality’")
 * "Assessing the Quality of Thai Wikipedia Articles Using Concept and Statistical Features"
 * "The Genealogy of Knowledge: Introducing a Tool and Method for Tracing the Social Construction of Knowledge on Wikipedia"
 * "Wikipedia As a Tool for Disseminating Knowledge of (Agro)Biodiversity"
 * "Complementary and Alternative Medicine on Wikipedia: Opportunities for Improvement"
 * "Revision Graph Extraction in Wikipedia Based on Supergram Decomposition and Sliding Update" (earlier coverage of related papers by the same authors: "Revision graph extraction in Wikipedia based on supergram decomposition", "Unearthing the "actual" revision history of a Wikipedia article")
 * "Detecting Controversial Articles in Wikipedia " (as an exercise in an undergraduate course on graph theory)