User:Robevans123/sandbox/Newton Aycliffe railway works

Built by Hitachi Rail Europe

A problem shared...
I'd been attacking a list of misspellings (insitution > institution) and found myself editing an article on a Malaysian university with the title (Universiti Tunku Abdul Rahman), with not only a typo, but also a word order based on the original Malay. An obvious candidate for moving (probably to "Tunku Abdul Rahman University").

Before doing that, I did a global (Wikipedia) search for "universiti " to see how widespread the problem was (~1,100 hits) - which was more than I was expecting. A bit more digging showed that these were mainly related to a number of Malaysian universities, and that a few related Categories and Templates were also mis-spelt and with an unusual word order (there are also a number of files and categories affected on commons, but we'll come back to that later).

A bit more investigating showed that the main culprit for generating so many typos was a nav box with about 80 blue links, of which about 20 have mis-spelt names. If all the universities listed used the nav box then there should be 1,600 typos (a bit more than were found, but obviously the source of a lot of the typos).

So, I'm guessing that it's ok for me to be bold and rename all these mis-spelt university articles to something like "University of some place in Malaysia" or "some-one/thing commemorated in Malaysia University", and to follow that up with some editing of the article so that the lede starts with "Fully anglicised name (and acronym)" in bold, followed by Malaysian name an acronym as a secondary title.

After that the rest of the article should use the anglicised name or acronym (although there may be occasional justifiable reasons for using the Malay version depending on context).

A good example of a correctly named article is National University of Malaysia which has a good first sentence in the lede but then consistently uses the Malay name...

Please let me know if I'm correct so far in my assumptions about what should be done.

Next, how to achieve this (with the least disturbance to Wikipedia) in a safe and trackable way. I think the best way is:  Move article as mentioned Edit to reflect the new name If present, edit the article to change to  Edit the article so the text uses the English name or acronym where necessary Repeat steps 1 through to 4 for each mis-spelt article Put in a request for speedy renaming of the 7 or so mis-spelt categories (reasons C2A, C2B, and C2D) Fix any remaining occurrences of "universiti" Go for a long lie down in a darkened room...</li> </ol>

Interestingly, the article where I first saw this problem (Universiti Tunku Abdul Rahman) is named after Tunku Abdul Rahman, the first Prime Minister of Malaysia after independence. This is not mentioned in the article (but I've got a source for that so I'll edit that in).

There's still the question of whether the categories and files on commons need renaming, but I'm less familiar with the naming conventions over there, but I do know that you can't do any of it yourself - it's all done by request. At least the procedure above should sort of the errors in Wikipedia and not affect the links to commons.

Many thanks in advance. Robevans123 (talk) 17:45, 7 January 2016 (UTC)

Twmbarlwm

A Log of misspelt University
Fixing a typo (insitution > institution) I realised that the article title was a bit strange (Universiti Tunku Abdul Rahman) so not only including a typo, but with a strange word order. The article should be moved sometime soon. After fixing the original typo I did a search in Wikipedia for "universiti " (so as not to pick up "universities") and found 1,168 hits. Rather more than I was expecting, and thought this was going to be a big job - I'll just fix the typos in the article and come back to the bigger problem later. There were only a couple of fixes required.

A thought was niggling away (along the lines of "hidden text" so I thought I'd do a Google search (which doesn't pick up on hidden text. The search string was

"universiti " -inurl:wiki-User -intitle:Talk -inurl:wiki-Wikipedia -inurl:wiki-WP -"redirected-from" site:en.wikipedia.org

This still gave about 600 hits (still high, but an improvement). Then I noticed that there were a lot of entries beginning "File:... Universiti ...". Ah - there must be a lot of photos on commons that are also misspelt. There were also some results beginning with things like "Category:...", and "Template:...". These wouldn't affect the text that is displayed, and wouldn't be being picked up in the Wikipedia search.

It must be hidden text somewhere. Back to original article - there at the bottom is an unopened nav box listing Malaysian universities... open it up to find a list of about 85 (roughly 60 in blue and spelt correctly, 20 in blue but misspelt, and about 25 redlinked).

If all the blue entries (80 in total) use the nav box (which they should) and the nav box contains 20 typos, then you'll get at least 1600 typos found. Actually a fair bit more than found, so some of the entries probably don't use the nav box.

So there are a number of problems to address:
 * the misspelt article names (including some that need the word order changed as well)
 * the nav box needs to be edited
 * the categories on Wikipedia should be renamed (this needs to
 * perhaps the categories on commons should be renamed
 * the links from an article to a commons category need to be changed
 * ideally, the files on commons need to be renamed (and the links to them updated)

, and also that the introduction started "Universiti Tunku Abdul Rahman (Tunku Abdul Rahman University) is a ...]]. Ah - so the name of the article looks as though it

End Log
A random selection of 20 articles that cite The Railway Magazine:
 * 1 using cite book: Francis Whishaw
 * 6 using cite journal (not necessarily exclusively): LNER Peppercorn Class A1 60163 Tornado, Rother Valley Railway, Benjamin Britten (train), Hull and Selby Railway, Chris Green (railway manager), and Transrail Freight
 * and of the remaining 13 not all necessarily used cite magazine