User:Jason Quinn/searches

This page helps me organize my efforts to perform a bunch of small but repetitive editing tasks. Many of them are related to helping eliminate deprecated cite template parameters.

Each section below relates to a seperate small project. It helps to have these links:
 * Help:Searching
 * mw:Help:CirrusSearch
 * Elasticsearch RegEx help
 * RegEx grammar

month
insource:/\|\s*month\s*=\s*\|/ -incategory:Pages_containing_cite_templates_with_deprecated_parameters


 * Dec 09, 2014: 537 -> 498
 * Dec 11, 2014: 498 -> 478
 * Dec 12, 2014: 478 -> 458
 * Dec 13, 2014: 458 -> 399 (also 6 -> 0 for origmonth, which I don't recall ever being a valid parameter)
 * Dec 14, 2014: 399 -> 342
 * Dec 15, 2014: 342 -> 298
 * Dec 16, 2014: 298 -> 249
 * Dec 18, 2014:    -> 178
 * Dec 19, 2014: 178 -> 149
 * Dec 20, 2014: 149 -> 99
 * Jun 23, 2015: 78 ->  65

Cautions: the string "month=" can occur in URLs and some non-cite templates:
 * Certification Table Entry uses "certmonth"
 * Educational assignment uses "month"
 * FishBase genus uses "month"
 * Queen videography uses "month"

coauthor/coauthors
 insource:/\|\s*coauthors?\s*=\s*\|/ -incategory:Pages_containing_cite_templates_with_deprecated_parameters


 * Nov 28, 2014: 2865 -> 2695
 * Nov 29, 2014: 2694 -> 2529
 * Nov 30, 2014: 2597 -> 2398
 * Dec 01, 2014: 2398 -> 2297
 * Dec 02, 2014: 2297 -> 2197
 * Dec 03, 2014: 2299 -> 2099 (went up by 2 overnight)
 * Dec 04, 2014: 2096 -> 1999 (went down by 3)
 * Dec 05, 2014: 1999 -> 1940'ish
 * Dec 06, 2014: 1940'ish -> 1898
 * Dec 07, 2014: 1898 -> 1852
 * Dec 08, 2014: 1852 -> 1796
 * Dec 08–Jan 05, 2015: 1796'ish -> 1762
 * Jan 06, 2015: 1762 -> 1729
 * Feb 23, 2015: 1733 -> 1696
 * Mar 20, 2015: 1691 -> 1678
 * March'ish–Jun 21, 2015: ~1678 -> 1,626
 * July 4, 2015: 1627 -> 1599
 * Sep 21, 2015: 1598 -> 1549
 * Sep 22, 2015: 1549 -> 1499
 * Oct 2, 2015: 1503 -> 1449
 * Oct 4, 2015: 1448 -> 1399
 * ~Oct 7'ish, 2015: 1399 -> 1373
 * Oct 14, 2015: 1373 -> 1334
 * April 6-7, 2016: 1348 -> 1330
 * April 13, 2016 -> 1298
 * April to Feb 13, 2017: 1290'ish -> 1216'ish

Several themes show up: lots of actors and actresses, NBA pages, military articles, Polish, and EU topics. Probably due to particular editors with corresponding interests who were using templates.


 * October'ish 2020 -> November 13: 1400'ish to 1085
 * November 20, 2020 -> 932

(I've been slowly nipping at these since 2017 but only recently started trying to really nail them down again. Maybe I can get them all done in a few months.)

edition

 *  insource:/\|\s*edition\s*=[a-z0-9\t ]*(ed\.|edition)[a-z0-9\t ]*\|/i 
 * finds (most) values containing "ed." or "edition" (case insensitively, note "i" at end of search)


 * Dec 2, 2014: 376 -> 352
 * Dec 5, 2014:     -> 327
 * Dec 14, 2014: 315 -> 1014 (expanded search to include A-Z in character classes and "edition")

page and pages
Note 1: Be careful not to make your regex too general here. For instance page names like "A9" or "9B" (among many other possibilities) might be perfectly valid.

Note 2: It's not documented but I think you can have multiple character ranges in a specified character class, e.g.,, as is common in many regex engines.

Note 3: A character class like [a-z0-9\t ,\.\+\?\*] is more powerful.


 *  insource:/\|\s*pages?\s*=[a-zA-Z0-9\t ]*pp?\.[a-zA-Z0-9\t ]*\|/ 
 * finds values that include strings "p." or "pp."


 *  insource:/\|\s*pages?\s*=[a-z0-9\t ]*(chap|sect)[a-z0-9\t ]*\|/i 
 * finds values that includes string "chap" or "sect" case insensitively (note the "i" at end of search)