User:Zoeannl/Help project

=Help @ Wikisource=

=Distributed Proofreaders Transfer=

Making the transition from DP to Wikisource as smooth as possible.
 * DP proofreaders can get more proofreading practice towards P2.
 * Wikisource gets pre-trained proofreaders with consistent standards.
 * DP proofreaders get introduced to wiki skills (html markup, concepts such as transclusion, templates and programming stuff) that could support Post Processing at DP
 * Learning wiki skills at Wikisource can support engagement in Wikipedia. Proofreaders come with a skill-that encourages them to learn how to work in wiki.
 * The support provided to newcomers at DP is exemplary. Help @ Wikisource uses that as a model which may improve retention of new wikipedians. Wikisource would be a good introduction to Wikipedia for those who are overwhelmed by the acquisition of a new skill set- the content is already provided so it is an easier process to learn.


 * '''TO Do: page on DP Wikisource re Wikisource http://www.pgdp.net/wiki/Main_Page

Comparison of Wiki and DP standards of Proofreading and Formatting
DP Proofreading Guide DP Formatting Guide

Introduction
Transferring from Distributed Proofreaders (DP) to Wikisource requires a few adjustments. This guide assumes we will keep DP standards where relevant. Where there is a definite clash, it will point it out. Anyone starting Proofreading at Wikisource, I STRONGLY RECOMMEND you do some training at DP to get the basics under you belt.

Then main differences between DP and Wikisource is the amount of freedom that Wikisource allows. This has its pluses and minuses. In Wikisource each page of a book is Created, Proofread, and Validated. The pages are then Transcluded-joined together to form articles, chapters or books. The pages remain editable, always and forever. This makes it possible to create a book with Annotations, which has huge possible educational and artistic possibilities. The Wikisource policy is to NOT annotate in any way the "Original". Annotated versions are titled "Original Annotated x". The "Original" is true to the printed copy and anything missed in the 2 proofreading passes can be adjusted later. Wikipedia has a philosophy of "give it a go" and has a supporting monitoring system that picks up malingers and beginners to keep editing appropriate.

When Proofreading a page, the expectation is to reproduce the page as closely as you are able. This means, in the DP sense, that Proofreading and Formatting are done in the same pass. Yon can leave the too-hard bits for the Validater. You can have a go-we strongly recommend you use the Preview option before you save. You can ask for help online-the Wiki way is very responsive to specific queries. Often a solution is applied to the page you are working on, for you to study and apply later. It's very much a mucking in approach. Because everything can be edited again if it needs fixing. No pressure.

Wikisource help

 * Find a page to proofread
 * Page navigation
 * Places to ask
 * 'Talking' on wikis
 * Linking pages within your wiki - home, without home


 * Proofreading pages
 * Process - scan/page layout
 * Procedure - scroll cursor through page for spellcheck to work

Proofreaders Summary

 * TO DO:Wiki Proofreaders Summary As per DP http://www.pgdp.net/c/faq/proofing_summary.pdf

Proofreaders Guide

 * TO DO:As per DP

List of significant difference

 * The formaters, post processors -->the validater, the transcluder, and any subsequent editors
 * There is no final work
 * We use  for italics, ' for bold if you need to italicise a ', insert ANSI code for ' that is, &#39
 * Closing of abbreviations (not Acronyms) such as I.e. and E.g, so they are not split over a line. Ref Wikipedia Manual of Style

=Proofreading of pages. === Dictionary and spell check === OCR shortcomings ===Paragraphs
 * italics and bold do not span lines
 * Using the spell check of the browser is sufficient.
 * Bad spelling in the original is indicated by [sic]. The sic template is invisible in read mode, but in edit mode indicates that a previous editor was aware of the error.
 * Outdated, but correct spelling, is left as is.
 * Spelling variations of English words are to be accepted as it is.
 * An alphabetic list of archaically spelled words and proper names collected from the Volumes can be found on this page.
 * An alphabetic list of archaic spellings and proper names collected from Volume 1, can be found on this page, although the list need to be cleaned up.
 * Recommended word reference Wiktionary
 * The OCR process has difficulty in distinguishing certain characters and commonly misreads the following:
 * Words beginning especially beginning with "W are preceded by a double quotation mark. Compare to the original.
 * Short words beginning with 'w' are occasionally garbled as in 'w T here', which is supposed to be 'where'. Correct these by searching for ' w ' surrounded by spaces.
 * Occasionally, the lowercase 'h' is rendered as 'b'.
 * Words containing 'g' is problematic.
 * Words containing 'p' are often rendered as 'jj'.
 * Uppercase 'N' is often rendered incorrectly.
 * The uppercase "R" is often rendered as 'K' 'E', or 'B'. Spell check finds the error, unless the change is a meaningful word.
 * Ligatures.


 * Contrary to the original scan, proofread paragraphs are not indented. However, there are exceptions in poems in which alternate lines are indented, and indented lists, where inserting a table is not warranted. In such cases there are two templates available:
 * Use gap template where there is a wide gap or indent in the text.
 * Use spaces template where there is a short gap or indent in the text.
 * Use the {{tl|Dropped initial) or {{tl|Di}} template to format an article's first letter.
 * The double height row template {{tl|Dhr}} is used in places where 2 or more empty lines separate paragraphs. Such as where there is a gap beteen paragraphs (not just indented)This template also accepts a height specification if the space between two sections {{color|blue| {{|Dhr|4em}} }} indicates a vertical spacing of 4em. Click to see this page and open it in edit mode.
 * If the end of a paragraph is also the end of the page, terminate a page with the {{tl|nop}} template. This prevents the transclusion process from joining this paragraph to the subsequent paragraph. This template must be placed on it's own line and must not be followed by any character or space.

=== Paragraph titles


 * Paragraph titles of monthly recurring sections are CAPITALIZED LETTERS centered, and enclosed with the {{tl|fs85}} template.
 * Paragraph titles of the Editor's Table are also the same font size, but the title is italicized.

=== Paragraph spacing and separators


 * Where a line separates topics in the original, paragraph separators are standardized to be {{tl|rule}} of 4em in length, and padded before and after with {{tl|Dhr}}.

End of topic space {{Dhr}} {{rule|4em}} {{Dhr}} space Start of topic (Header)

=== Hyphenation


 * Hyphenated words, which by themselves are correct, are left as is, being the typesetting style at the time.
 * A hyphen at the end of line is often used to justify the text. Use your judgement if the words should be hyphenated.
 * If the last word of the page is hyphenated, check following page for the complete word and enclose the first part of the hyphenated word at the bottom of the page {{color|blue| {{hyphenated word start|comp|complete}} }} and enclose the second part of the hyphenated word at the top of the following page {{color|blue| {{hyphenated word end|lete|complete}} }} and this will merge the two parts into the word {{color|blue|complete}} when transcluded in the main namespace.
 * The abbreviated form of the hyphenation templates are {{tl|hws}} and {{tl|hwe}} {{sc|Click to see this example}}Pages 16 and 17

=== Single and double quotes


 * Use standard English typewriter double quotes " (ANSI 034) and not international glyphs or the Unicode slanted/curved quotes.
 * Check for matching opening and closing quotes and close up the space between the marks and the enclosed text.
 * There is an occasionally used typographical style applied to a series of paragraphs where the beginning of each paragraph is opened without a closing double quotation mark.
 * For single quotes use the standard English typewrite single quote ' (ANSI 039) They are used to enclose text, within, or in place of, double quotation marks.

=== Typographic characters, ligatures, and symbols
 * Zoom with ctl scroll.
 * In some volumes, symbols and characters are ignored by the OCR. These include the em dash (—), currency symbols ($ and £), the temperature indicator º, and the centered decimal point, etc.
 * Check for {{color|blue| italics }} in the text. Referenced publication names are always italicized.
 * Check for the missing 'em—dash' — (ANSI 0151) character. This is available on the advanced editor toolbar, or, by request can be added to the user's Charinsert preference, or use the {{tl|--}}template.
 * Check for ambiguous text. They may be incorrectly rendered scientific, technical, or currency symbols like, fractions, degrees '°' (ANSI 0176), currency '£' symbols {ANSI 0163} or centered decimal points '·' (ANSI 0183).­­
 * Check for the characters 'ae', and 'oe', which are most likely to be the ligatures of 'æ', (ANSI 0230) and the 'œ' (ANSI 0156). Assumptions can be made of their existence based on the article's subject matter.
 * Characters, symbols and ligatures can also be inserted by using their HTML equivalents. See References for the HTML ANSI codes.
 * All the above mentioned symbols can be had in the "User" selection of the Charinsert gadget. Just post a request in the Scriptorium/help.

=== HTML & ANSI characters and symbols

{{c|This list was copied (without permission) from User:Mpaa's home page.}}

{{Dhr}}

{{Dhr}} ==Typographic formats === References, footnotes and endnotes


 * Use the {{tl|smallrefs}} template in the page footer to render footnotes in small font. Footnote references are automatically numbered.
 * Footnotes which span over pages requires a named reference tag on the the page where it begins {{color|blue| }} and a "follow" reference tag on the subsequest page(s) as {{color|blue| }}. Click to see an example of a footnote spanning two pages.
 * In the main namespace footnotes are converted into numbered endnotes.

=== Poems


 * Poems, without exception, are wrapped starting innermost with the {{tl|fs90/s}} {{tl|fs90/e}} font template, followed by the {{color|blue| }} tags, and then enclosed in {{tl|block center/s}} {{tl|block center/e}} templates.
 * The template order is necessary because the font template line height is not applied to the contents, unless it is the innermost template.
 * The {{tl|block center/s}} template is the most versatile template for multiple paragraphs and page spanning.
 * The {{color|blue| }} tags can't span pages. In poems that span pages the tag must be terminated at the last line of the poem and inserted anew in the following page.

{{fs90/s}} Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. {{fs90/e}}

{{fs90/s}} Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. {{fs90/e}}

== Tables


 * As an introduction, this page contains a written conversation about a particular advanced table design:Page talk:Mexico, Aztec, Spanish and_Republican, Vol 2.djvu/178

=== Layout templates of tables styles with borders used in {{TPSMProject}}.

=== Formatting codes declared in the table header (which affects the whole table)


 * For complete reference List of table style shorthand codes

{{Dhr}} {{Dhr}}
 * mc = centers the table on page the order of values are clockwise Top, Right, Bottom and Left. (margin:0 auto 0 auto;)
 * ar|al|ac|aj = aligns the contents of all cells. Base this universal alignment by the content alignment of the majority of columns.
 * bc = border collapse. If omitted, cell borders are double.
 * bt|br|bb|bl = single line border around the table.
 * |-ac|bb = when declared on a table row indicator, aligns content with a single line bottom border of a row of cells.
 * border/border=1 = single line border of every cell.
 * sm90|lh12 = font size of 90% and the matching line height of 120%.
 * sm85|lh1 = font size of 85% and the matching line height of 100%.
 * pt.5|pb.5 = cell padding of .5em top or bottom.
 * pr1|pl1 = cell padded 1em on the right or left.
 * The template top row is the header for centered column titles with padding top and bottom of the cells.
 * The template second row is the first row padded on the top.
 * The template third row is no padding.
 * The template fourth row is the last table row, padded on the bottom.

=== Table layout for tables with single line borders for various font sizes, and matching line height.

{|{{ts|mc|ar|bc|bt|br|bb|bl}} (standard 100% font size and matching standard line height of 140%)

{{Dhr}} Click to see an example of a single image with two short captions. {{Dhr}}

=== Two separate images with two captions

{|align=center width="430" {{ts|sm85|bc|lh95}}
 * width=210px ||frameless
 * width=10px|
 * width=210px ||frameless

=
Centered captions


 * }
 * }
 * }
 * }

=
Hanging indent justified text captions


 * }
 * }
 * }
 * }

Click to see an example of three images side by side. ==Font sizes
 * Fonts larger than 100% are of no concern. Use any size deemed to be matching the original.
 * For font sizes that are less than 100%, the following list of templates were designed for the project because they include line heights proportional to the font size.

==Linking ===Anchors === Relevant namespaces, a short explanation
 * fs90 is used to enclose Author names,
 * fs90/s fs90/e used to enclose a bloc of paragraphs and/or span pages. When used to span pages, the is placed in the footer of the first page, to terminates the block and  is placed in the header of the following page to begin the new block. This way the transcluded text in the main namespace will be enclosed with a single set of templates because headers and footers are excluded.Click on this link to see an example.
 * fs85 85% font size and 100% line height. - Used exclusively for image captions and subtitled sections of recurring monthly features.
 * fs75 75% font size and 95% line height. - Used to enhance the diversity of font sizes of article sub-titles.
 * fs70 70% font size and 90% line height. - Used inline to match the line height of fraction templates and.
 * Named font templates are not used in the PSM project
 * Link to the 100% and smaller font size and style comparisons table.


 * Index namespace - Container storing the individual pages of a book.
 * Page namespace - page by page storage contained by the Index container. It is where most of the work takes place, the proofreading of the pages.
 * Transclusion process - links the proofread pages and the Main namespace.
 * Main namespace - Assembled display of the transcluded pages from the Page namespace.

=== Titles


 * Article titles are set by templates. See list of title templates in the Reference section.
 * This page provides examples of the article title, author, subtitles and Roman numeral paragraph numbering.

=== Author names


 * The main title is followed by the author's name for which there is no template. The name is enclosed in the small caps template, then the set to font size is 90% always using the fs90 template and then centered on the page.

=== Secondary title font sizes


 * If there is a subtitle below the author's name, it is centered and wrapped with the is 75% font size, using the fs75 template.
 * If there is a secondary subtitle, center it and wrap it with a font size comparable to the original. This may be 100% or if smaller, use the 85% font size fs85 template.
 * This page has one main and five sub titles. Otherwise, article titles consist of one main and, at most, three sub-titles. Since the styles differ, there is good visual contrast, even when the font-size difference is less than 10%. New articles can start anywhere on a page.

== Tags


 * When a page specific code is required, the .djvu number of the page is used because it guarantees uniqueness. Printed page numbers are not unique or accurate. If the code contains no spaces, then enclosing the code with quotes can be omitted.

=== Section tags


 * The codes are made up of the following segments:


 * End of article begin and end section code segments:

E = End of article 27 = .djvu page number


 * The article following on the same page uses the code segments, except prefixed by 'B' to indicate the beginning section of the article.

B = Beginning of article 27 = .djvu page number


 * Click to see an example of the above coding scheme in edit view of .djvu page 27/Page 17. The final results are visible on this transcluded page.

=== Anchors and how they are numbered


 * Consider all article titles, and monthly recurring sections to be anchored for linking to the index at the back of the volume. Click to see an example of anchoring - open page in edit mode.
 * The anchor is made up of the .djvu number of the page preceded by the uppercase "D" without space, and followed by the hyphenated number beginning with "1" indicating the order of appearance on the page. Click to see an example of two anchored titles on the same page.


 * Articles main title anchors are always "1" as their order of appearance because two titles never appear on the same page.
 * Recurring monthly section titles are "0", as in If there are two recurring sections on a page, (which does happen), the second is identified with two zeros, "00" as in . The reason for this is that recurring monthly section titles can fall between two anchored paragraphs of two different sections.
 * Recurring monthly sections can have several anchors on the same page and they are numbered in their order of appearance from top to bottom as in ,, etc., regardless of which section they belong to. Thus the anchors always follow the order of appearance.Click to see an example of anchoring - open page in edit mode.

=== Volume index anchors


 * Volume index anchors are generated from the same database as the article titles. Since their order of appearance on the page is not known, they are always assigned "1". After the titles are anchored, Most of the links are correct and the rest are edited while the index is in edit preview mode, which permits editing and testing the link simultaneously.


 * Attempt was made to use Templates almost exclusively. HTML tags are limited to the minimum where a template wasn't available or possible.