Wikipedia:Wikipedia Signpost/2019-02-28/Recent research

Wikipedia is a second screen for Internet TV watchers

 * Reviewed by Bri

In this study it was shown that Wikipedia pageviews follow Internet TV coverage (technically, there was a high Pearson correlation between viewership and pageviews). The researchers were able to use data mining of anonymized AbemaTV logs to correlate individual viewers to Wikipedia pageviews changes. They could even determine second screen behavior: viewers were watching programs and reading related articles in near-realtime after 23:00 (viewer's time) – taking measures to exclude channel surfing viewers. This reviewer finds the use of Wikipedia as an adjunct or research tool for television watching is somewhat at odds with a naive assumption of an either-or media consumption model, and could help explain some of our media-centric Top 25 article views.

Wikidata calculates cultural diversity

 * Reviewed by Andrew Krizhanovsky

This research deals with Cultural Context Content (CCC), that is articles of Wikipedia "related to the editors' geographical and cultural context (i.e. their places, traditions, language,  agriculture, biographies, etc.)". CCC is about 25% of articles in one Wikipedia edition. These articles are exclusive and have no equivalence across language editions.

There are thousands of Wikidata properties. So, this research paper can be useful for learning Wikidata, because it describes Wikidata properties grouped by country, location, language, author, affiliation and several taxonomic relations (part of, has part). The future research of this paper could be directed to the visualization of some results via Wikidata Query Service (see examples in the Wikiversity course "Research in programming Wikidata").

Very interesting results of this work are
 * The mapping of language to territories.
 * Number of unique articles and intersections with other Wikipedias.
 * Nearest Wikipedia or which Wikipedias cover this Wikipedia (e.g. for the Tamil Wikipedia, the five other language editions which cover most of its articles are, in descending order, English, Hindi, Arabic, Simple English, and Urdu, as of October 2018).

See also below for a list of other papers related to the "Wikipedia Cultural Diversity Observatory", by the same authors

Briefly

 * By Tilman Bayer

Wikimedia research whitepapers
The Wikimedia Foundation's Research team has announced three white papers outlining research plans and priorities for the next five years, on the strategic themes of "knowledge gaps", "knowledge integrity" and "foundations".

Dario Taraborelli and Erik Zachte leave Wikimedia Foundation
Two longtime employees known for their work on statistics and research about Wikimedia projects departed the Wikimedia Foundation this month. Erik Zachte, who in 2003 began to work as a volunteer on what would become known as Wikistats, and in 2008 took up paid work as data analyst for WMF, announced his retirement. Dario Taraborelli, the Foundation's head of research (also co-founder of this research report and its associated @wikiresearch Twitter feed) announced his departure after eight years to do open science work at a different organization.

Conferences and events
See the research events page on Meta-wiki for upcoming conferences and events, including submission deadlines, and the page of the monthly Wikimedia Research Showcase for videos and slides of past presentations.

Other recent publications
''Other recent publications that could not be covered in time for this issue include the items listed below. Contributions are always welcome for reviewing or summarizing newly published research.''
 * Compiled by Tilman Bayer

"Wikipedia Culture Gap: Quantifying Content Imbalances Across 40 Language Editions"
From the abstract: "...we developed a computational method to identify articles that can be related to the editors' cultural context associated to each Wikipedia language edition. We employed a combination of strategies taking into account geolocated articles, specific keywords and categories, as well as links between articles. [...] The results show that about a quarter of each Wikipedia language edition is dedicated to represent the corresponding cultural context. Although a considerable part of this content was created during the first years of the project, its creation is sustained over time." (see also Wikidata calculates cultural diversity" above)

"Identity-based motivation in digital engagement: the influence of community and cultural identity on participation in Wikipedia"
From the abstract: "By analysing data from 15 language editions, I find that editors develop a community identity in Wikipedia and at the same time they consistently create content representing their cultural identities. Such content occupies around a quarter of each Wikipedia in number of articles, and even more in terms of edits. When editors increase their participation or become administrators, they still prefer editing content imbued with identity-based meanings, which suggests their centrality in the editing process." (see also "Wikidata calculates cultural diversity" above)

"Cultural Identities in Wikipedias"
From the abstract: "... we developed a computational method to identify articles related to the cultural identities associated to a language and applied it to 40 Wikipedia language editions. The results show that about a quarter of each Wikipedia language edition is dedicated to represent the corresponding cultural identities. The topical coverage of these articles reflects that geography, biographies, and culture are the most common themes, although each language shows its idiosyncrasy and other topics are also present. [...] An analysis of how this content is shared among language editions reveals special links between cultures." (see also "Wikidata calculates cultural diversity" above)

Students favor Wikipedia in English even when it's not their main language
From the abstract: "Seventy-seven first-year audiovisual communication students [most with Catalan and/or Spanish as their main languages] made contributions to Wikipedia as part of the assessed work in the first year course titled 'Digital Culture.' Before and after writing Wikipedia articles, the students responded to two questionnaires that enquired about their language-related habits when using the site and about their language choice for contributing to it. ... Students favor the English edition of Wikipedia when consulting it despite the fact that this is the language they assess themselves as being less proficient at in reading. More generally, our research shows that multilingual Wikipedia users move seamlessly from one language edition to another, thus refuting the cliché that relates minority languages with exclusively local and self-referential topics."

"Studying the Effect of Network Position on Efficiency"
From the abstract: "We gathered data about 2978 Wikipedia featured article editing history. We use degree centrality for both article affiliation network and editor’s affiliation network. [...] This study finds that article degree centrality [has a] negative effect the collaboration process, that suggests that article linkages with other article attracts diverse knowledge bases and editors, that require time and consensus for further moving the editing process. Editors maximum centrality in a single article editor affiliation network have a positive effect on efficiency of an article, while editor maximum degree centrality in a multi-article affiliation network have negative effect on editing process efficiency."

"Collaborative Approach to Developing a Multilingual Ontology: A Case Study of Wikidata"
From the abstract: "In this article, Wikidata has been taken as an example to understand how community-driven approach is used to develop a multilingual ontology and in the subsequent building of a knowledge base."

"Exploring Translators’ Expectations of Wikipedia: A Qualitative Review"
From the abstract: "This paper's goal is to explore potential uses that translators could expect from Wikipedia. [...] We have concluded that translators might use Wikipedia expecting to find linguistic, semantic, terminological, lexicographic and cultural information."

"Wikipedia as a translation zone. A heterotopic analysis of the online encyclopedia and its collaborative volunteer translator community"
This article is a case study focused on the construction of the English Wikipedia article about Tokyo.

"Discussing Wartime Collaboration in a Transnational Digital Space: The Framing of the UPA and the Latvian Legion in Wikipedia"
From the abstract: "[This book chapter] investigates the different framing strategies used to represent wartime memories in Wikipedia, the ways these strategies are developed by local editors’ communities, and the reception of Wikipedia’s representations of the past by national and transnational audiences. The chapter concludes with a discussion of the different forms of consensus used in Wikipedia for dealing with contentious past as well as the promises and dangers of using digital media for transnational history writing."

"Translation and the Production of Knowledge in 'Wikipedia': Chronicling the Assassination of Boris Nemtsov"
From the abstract: "Based on a set of articles about the assassination of Russian politician Boris Nemtsov from nine different editions of the encyclopaedia, the article examines the place of translation in Wikipedia and the role it plays in knowledge production. Each of the articles is likely to use a number of different information sources, including other Wikipedia articles that are already in existence, with translation contributing to knowledge production as each new article evolves. ..."

"Locating foci of translation on Wikipedia"
From the abstract: "... it is generally accepted that most Wikipedia content is the product of original writing rather than being translated from another language version of the encyclopaedia. [...] The main aim of this paper is to make a number of proposals towards a possible methodology for discovering where the main foci of this new type of collaborative translation are located. Significant methods for this include the use of the encyclopaedia’s list-based structure and of different features of page anatomy. The article [is] using Russian and Chinese to English translation as its main sources of examples."