Wikipedia:Typo Team/moss/not English

This report highlights articles that have long passages of non-English text, or a large number of non-English words.

Tips and how-to
How does this work?
 * Click through to each article you want to work on, and use the example words to find the problem area.
 * Fix the problem or tag the article. Common resolutions that will prevent an article from reappearing on this report include:
 * Look through the history of the article and revert to the last good version.
 * Remove a large block of unsourced foreign-language text that appears to be intended as article prose.
 * Use templates to wrap foreign-language lyrics, verse, or quotations. See lang, transl, and Template:Quotation templates
 * Wrap chemical names in chem name
 * Tag the article for cleanup
 * Not English for articles and sections that just need translation (first parameter is the language name)
 * cleanup lang for non-English text that is legitimately included
 * cleanup (use the "reason" parameter)
 * Move to Wikisource for primary source texts
 * See Pages needing translation into English for more ways to tag
 * Standardize section names to prevent false alarms
 * Sometimes sections with long lists of non-English-language works appear at the end of an article, and result in the article inappropriately being listed here. Retitling the section to use a more standard name, such as one of those suggested by MOS:LAYOUTWORKS should fix this. (Contact User:Beland if you think a given title should be ignored but isn't.)
 * Delete the listing from the table so that no one else will try to duplicate your work. If someone else has already fixed the article (check the article history), delete it from the table.
 * If the article doesn't need fixing, or you have a question about what to do with it, feel free to leave a note in the last column of the table below.
 * If you want to, you can sort the table by a different metric, by suspected language, or by title.

Google Translate may be helpful to identify the language, if you don't trust the language code listed in the table. When the table adds "?" to a language code, that's based on words not found in Wiktionary, and is particularly unreliable.

About this report
Jonesey95 came up with the idea for this report after noticing an all-Spanish article had been sitting around for over a year.

The algorithm behind the report is complicated, and tries to find long passages of non-English text (ignoring paragraphs with fewer than 20 words). Short words (3 letters or fewer) are also ignored because they are difficult to classify and many are untagged mathematical expressions. It's difficult to determine what language some words come from because they are not in any dictionary, so you will see some misspellings and non-words mixed in. This is actually useful, because articles with a lot of those are generally in need of cleanup. Other words are in multiple languages, or are proper nouns or species names. As a result, the "percentage English" and "non-English word count" numbers are only approximate and are only intended to rank articles, not accurately measure them in an absolute sense.

The table includes the top 250 highest-ranking articles from both the "non-English word count" and the "percentage non-English" metrics, which have some overlap so there aren't actually 500 articles listed. You can choose which metric to rank with by clicking on the arrows in the column header. Only the first 10 example words are listed, and sometimes fewer if the system is confused about which language they are from. Only examples from the most-common non-English language are shown.