Wikipedia:Reference desk/Archives/Computing/2018 December 7

= December 7 =

Is there a way to produce a list of one thousand (or more) of the most popular words in the Norwegian language based on all the Norwegian Wikipedia articles?
In believe that such a website / tool might be of great help for people who want to learn a completley new language as they would be able to begin by learning the most common words in that language (in this instance Norwegian, but it could be any other language that has a Wikipedia site) and how to spell those most used words according to the most common spelling.

I tried googling if such a program / tool exists and all I have found so far is that there is such a tool at github for the English Wikipedia but I'm not sure if this particular tool could be fixed so that it would do the same thing but with a different Wikipedia (the Norwegian Wikipedia in this instance) + I am not sure how to run github repositories. Do any of you know a simpler solution? (maybe there is some other easier tool / website?) ויקיג&#39;אנקי (talk) 21:23, 7 December 2018 (UTC)


 * I'm sorry, I don't know. However, two points: (i) Norwegian orthography is unusually problematic. (ii) The description that you point to says nothing about recognizing two or more strings of letters as alternative spellings of the same word. It's not easy to recognize that "traveler" and "traveller" (or indeed "jail" and "gaol") are a pair of alternatives. -- Hoary (talk) 00:34, 8 December 2018 (UTC)
 * The unusual problem with Norwegian that Hoary mentions is so great that there are actually two Norwegian Wikipedias. Please see no.wikipedia.org for the Bokmål version and nn.wikipedia.org for the Nynorsk version.  Nyttend (talk) 19:30, 9 December 2018 (UTC)