Wikipedia talk:Contributor copyright investigations/Norden1990

Print sources
I've found several more cases where large amounts of text were copied verbatim from online English sources, or faithfully translated into English from online Hungarian sources. In these cases the identification and removal of the infringing material was relatively easy. However, I am also seeing several articles which are sourced to English- or Hungarian-language books which are not available online. Given that it could be very expensive in terms of time and money to check these for copyvios, I'm wondering if these should just be presumptively deleted. If so, what's the best way of going about this? If not, what other options do we have? —Psychonaut (talk) 10:49, 25 April 2014 (UTC)

Translation sources
Just a heads-up that many of the copyvios discovered so far are unattributed translations from the Hungarian or German Wikipedias. (Fortunately these are easy to fix; we just need to apply the translated page template to the article's talk page.) So when processing articles in this CCI, it may help to check the corresponding article on de-wiki or hu-wiki, particularly if the only citations are to German or Hungarian sources. —Psychonaut (talk) 10:35, 26 April 2014 (UTC)

Images
I've tagged around 50 problematic images uploaded here and on Commons (see Commons:Commons:Deletion requests/2012120110006287 for the ones there). I'm pretty sure I got everything on en-wiki, though the list on Commons might not be exhaustive. —Psychonaut (talk) 15:32, 27 April 2014 (UTC)
 * The above-noted images have now been deleted as copyvios. (There were attempts by a couple users to obtain permission, but hadn't been obtained in several weeks.)  A further batch of around 70, with somewhat different circumstances, is being discussed at Commons:Commons:Deletion requests/2010052110030211. —Psychonaut (talk) 20:48, 18 May 2014 (UTC)

Highlighted diffs
I have highlighted in yellow those diffs with a very high probability of containing copyright violations. These diffs all insert material referenced to politics.hu. (Norden1990 apparently admitted that many of his copyvios are from this site, and experience so far with this CCI shows that nearly all material sourced to politics.hu was in fact copied verbatim or with minimal paraphrasing.) The highlighting should allow us to clear much of the low-hanging fruit. In the meantime I'll look into automated solutions for identifying other copyvio sources, such as the translated non-free parliamentary biographies and the Google Translated versions of Hungarian and German Wikipedia articles. —Psychonaut (talk) 14:09, 19 May 2014 (UTC)