User:TomTheHand/Unit tests for AWB regexes/General

This section contains regular expressions that make general fixes, not limited to a particular topic or type of unit.

Unicodifying
There may be some cases where the text or HTML may be preferable to Unicode; be careful of those situations.

Vulgar fractions
I feel that in many cases vulgar fractions from sources are worth retaining as Unicode symbols rather than converting to a decimal. For historical articles, vulgar fractions feel appropriate, and they give level of precision that is lost on conversion to a decimal. For example, converting 5⅞ to 5.875 implies precision to the thousandth when you only actually have precision to the eighth. If you were to convert to 5.9 instead, you're losing information and still implying higher precision than the measurement actually provides.

Superscripts
Please read this section of the Manual of Style on Mathematics before using these regular expressions. If the article you are editing uses higher powers as well, use &lt;sup&gt;&lt;/sup&gt; tags, because these Unicode symbols will not match superscripts for higher numbers. If the article only contains ² and ³, and will never contain higher powers, using Unicode symbols can be more compact and easier to understand. An article completely unrelated to mathematics which happens to include an area in km² has no need to support higher powers.