User:John of Reading/AWB settings/Pages using infoboxes with thumbnail images

These settings help to deal with Category:Pages using infoboxes with thumbnail images.

Rules 1 to 5 try to identify an -like parameter followed by a   construction that isn't too unusual - no alt text; size, if specified, is between 200 and 259 pixels; caption, if present, contains no templates or wikilinks that might confuse the regular expressions.

Rule 1 fires when there is no caption, leaving only. This is usually good, but fails when  is followed by something else, such as second image.

Rule 2 tries to guess when a caption can safely be ignored. Typically the image is at the top of the infobox just underneath the name of the person, school or company, so captions such as "John Doe", "Logo of FooBar plc" or "SomeVille High School Logo" can probably be dropped. My "John Doe" example, at right, is not that unusual.

Rule 3 looks ahead for a blank -like parameter, and moves the caption there. This is often correct, but there is no guarantee that the rule has found the correct parameter name.

Rule 4 moves the caption to a following -like parameter so that the two captions can be merged by hand. The guideline is at MOS:CAPTION.

Rule 5 moves the caption to a new  parameter. There is no guarantee that this is the correct parameter name, or that there isn't another parameter of this name somewhere else in the infobox.

Rule 6 tries to draw attention to cases where rules 1 to 5 are inadequate or have decided not to trigger, ready for manual copyediting.

These rules often help towards a correct edit, but there is usually more to do. In particular, if there is an -like parameter, it was being ignored before the edit and should probably be removed. I have been checking both the diff and the preview.

  false false   true    false  <Variants /> <ContextChars>20</ContextChars> </Disambiguation> <Special> <namespaceValues /> <remDupes>true</remDupes> <sortAZ>true</sortAZ> <filterTitlesThatContain>false</filterTitlesThatContain> <filterTitlesThatContainText /> <filterTitlesThatDontContain>false</filterTitlesThatDontContain> <filterTitlesThatDontContainText>/</filterTitlesThatDontContainText> <areRegex>false</areRegex> <opType>0</opType> </Special> <Tool> <ListComparerUseCurrentArticleList>0</ListComparerUseCurrentArticleList> <ListSplitterUseCurrentArticleList>0</ListSplitterUseCurrentArticleList>