User:Fnielsen

fnielsen is (me!) Finn Årup Nielsen, Lyngby, Denmark, - an engineer working with data science. I edit in Wikipedia, perform research on wikis and Wikipedia.

On Wikipedia and sister projects
I have been active on the English and Danish versions of Wikipedia, with the first English Wikipedia entry August 23 2002 and the first Danish February 11 2002.

Among the articles I have added to are non-negative matrix factorization, ratio distribution, intraclass correlation, Niels A. Lassen, John Lykoudis and 5-HTTLPR. My English edits have usually been related to my work. On the Danish Wikipedia I usually write articles that are not related to my work. I also take photos for Wikimedia Commons. Most photos have been of Copenhagen and suburbs.

(I have a sandbox as a subpage where I try to understand Wikipedia tables, templates, substitutions, transclusions and other technical details of Wikipedia.)

Community
I am a member of Wikimedia Denmark, and presently is a member of the board. I have been at Wikimania in Alexandria, Gdansk and London.

Tools
I have made a tool that converts a PubMed/MEDLINE identifier, PMID, to a cite journal template. As of 2008 it is available from

On the Toolserver I had a online visualization service called revvis. Enabled for the Danish Wikipedia, it shows the sequential collaboration network for a Wikipedia article.

The most recent Toolforge tools are Scholia, Wembedder, CVRminer and Ordia.

Article-by-author matrix factorization
I participated in WikiSym 2006 in Odense and had a short talk in the workshop about Wikipedia research organized by Jakob Voß and Angela Beesley. The slides used for the presentation are available from the department archive : Non-negative matrix factorization decomposed a article-by-author data matrix for identification of clusters of authors and articles.

Scientific citations in Wikipedia
In 2007 I looked at the outbound citations from Wikipedia (using Template:Cite journal) to scientific journals. Nature and Science were the most cited. I compared the number of citations with citation information from Journal Citation Reports of Thomson Scientific, and the correlation between the two sets of values was not bad, e.g., when a scientific journal had many citations it tended to also have many Wikipedia citations. An updated scatter plot for the July 2007 database dump is available.

I also tried to cluster scientific journals and Wikipedia articles and made a longitudinal comparison of the citation information. This is described in Clustering of scientific citations in Wikipedia.

The data set used for the clustering is available from the homepage and a couple of researcher have used it to test machine learning algorithms, see the references on Google Scholar.

Wiki for representing and computing with scientific data
I am experimenting with representing scientific data in a wiki-like environment. My first attempt was a spreadsheet link online service for personality genetics. It is available from:
 * http://neuro.imm.dtu.dk/cgi-bin/brede_wiki_personalitygenetics

I now mostly work with the "Brede Wiki".
 * http://neuro.imm.dtu.dk/wiki/

This MediaWiki-based wiki can represent spreadsheet-like data and meta-analysis is presently performed with an online script.

On Wikimania 2010 I tried to argue for a way to represent scientific data in Wikipedia: Wikipedia is not the sum of all human knowledge. I look forward to see how Wikidata can be used for such data.

Review of Wikipedia research
Since 2008 I have been writing a review on Wiki and Wikipedia research entitled Wikipedia research and tools: Review and comments. I now collaborate with fellow researchers (Chitu Okoli and others) on the Internet to summarize and review Wikipedia research. We presently have a long working paper entitled The people's encyclopedia under the gaze of the sages: a systematic review of scholarly research on Wikipedia, see at http://ssrn.com/abstract=2021326. We maintain the data for the review on a Semantic MediaWiki at http://wikilit.referata.com. The setup of this wiki was inspired by WikiPapers (http://wikipapers.referata.com) at and my Brede Wiki.

I have also written a popular science overview of Wikipedia research in Danish: Wikipedia - nørdernes sejr over vandalerne?.

Blog
I maintain blogs where I sometimes write posts relating to Wikipedia. These may be opinionated or related to my work. I was originally writing from Posterous, but as this is closed down Wikipedia posts from my blogs can be found here:
 * http://finnaarupnielsen.wordpress.com/tag/wikipedia/
 * http://fnielsen.tumblr.com/tagged/wikipedia (nothing here yet)

Examples of blog posts are:
 * Hunting down the undead ghost of classical conductor George Richter &mdash; the story about an article about a conductor that I added to Wikipedia, but later was deleted as it turned out that he was ficticious.
 * Sentiment analysis of Wikipedia pages on Danish politicians.

Links

 * "Blaming" tools:
 * WikiChanges
 * Hauptautoren de:Wikipedia:Hauptautoren
 * User edit counting:
 * "River" (Danish)
 * Overall statistics:
 * Wikichecker. User profiling
 * Xiong Changnian's Xiong's stats. Also makes some notes about user types.


 * User:fnielsen/Book:Music, example book

Conflict of Interest Statement
I am working or have been employed by: I have received grants from: I received gifts associated with:
 * Technical University of Denmark
 * Rigshospitalet
 * The Lundbeck foundation, - partial owner of Lundbeck.
 * Villum Foundation associated with VELUX.
 * Danish Research Councils.
 * American research councils.
 * Klovn
 * Frank Hvam
 * Packt

Reference
da:Bruger:fnielsen de:Benutzer:fnielsen