Wikipedia:WikiProject Wikidemia/Quant/Arch

Parser

 * This converts the zipped xml database dumps into csv files with file specification:....

Stats

 * csv files of header information can be read into Statistical software packages R and Stata

Data Anomalies
In the Indonesian Wikipedia dump occasionally usernames appear in the  tag (e.g. user:Vyasa). These appear to be localized to 2003. It is not clear why this occurs.