Wikipedia talk:Historical archive/Common words, searching for which is not possible

Outdated?
With the new Lucene search this page seems to have been invalidated? --Dittaeva 09:10, 23 Jun 2005 (UTC)

Old talk
Which means that searching for, say, Lily Brett's novel Just Like That is impossible. Is this page just presenting the status quo or intended to trigger changes? --KF 14:27 Jan 13, 2003 (UTC)


 * Presenting the status quo. I agree that improvements are needed. - Patrick 14:33 Jan 13, 2003 (UTC)


 * I see. But you've already done a very good job by (a) creating this page and (b) linking it to both Help and Searching. --KF 14:39 Jan 13, 2003 (UTC)


 * Another example is a short film produced by South Park's Matt Stone and Trey Parker, called Your Studio And You, in which Steven Spielberg and other bigtime directors made cameo appearances mocking certain annoyances of movie production bureaucracy. I believe it was one of their pre-South Park era creations, actually.  Doubt Wikipedia has an article on it anyway, but just try looking up information on Google for that film.  Every webpage in existence with the word "studio", instead, will be presented to you. --I run like a Welshman 11:35, 8 Dec 2004 (UTC)

These seem like common words, but I was curious how they were determined. Were they based on a word frequency corpus, or were they simply derived intuitively? I was wondering because, if they were derived systematically, this might be a useful list to use at http://wiktionary.org as a template for making lists of common words in other languages. We set up such a page for Basic English words already, but I was thinking this list might also be useful. Thanks. - Brettz9 03:48 Apr 22, 2003 (UTC)


 * I took them from http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/wikipedia/phpwiki/newcodebase/FulltextStoplist.php?rev=1.1&content-type=text/vnd.viewcvs-markup but do not no how that list was made. - Patrick 06:15 Apr 22, 2003 (UTC)

Isn't this an article title up with which we should not put? DJ Clayworth 18:59, 31 Oct 2003 (UTC)

The current stop word list is the default list built in to MySQL version 4.0.20. I'm currently doing some work to update it, mainly adding new words based on the peculiarities of this site and handling languages other than English. For those unfamiliar with stop words, they are words which are so common that a search for them would produce a uselessly large set of results. For example, until I add it to the stop word list, a search for 'redirect' will return more than 350,000 results. Stop word list is what I'll be using as a base page for this topic. The list in MySQL is documented as being derived from the Cornell University Smart software (smart.11.0.tar.Z), which has a brief description. One of the MySQL employees also asked for the Fulltext index statistics for en Wikipedia, which contains a list of the most frequent 5,000 words in the index (which doesn't include the stop words). Jamesday 13:23, 26 Nov 2004 (UTC)

The MySQL 4.0.20 stop word list is the list which has previously been used. See Stop word list for the ongoing work to improve it. Jamesday 22:17, 28 Nov 2004 (UTC)

noone
"Noone" should be a valid search. It is a common mistake and is not a word. Guanaco 18:20, 12 Dec 2004 (UTC)

Delete
I don't see any point to keeping this page. — Gulliver ✉ 07:11, 5 March 2006 (UTC)

Why is this linked from the standard "search failed" page?
If this page is more than two years out of date, why is it still linked from the WP Special page you get when a search returns no results? ~ MD Otley (talk) 00:57, 24 March 2008 (UTC)