User:Headbomb/JCW

Journals Cited by Wikipedia: Evaluating the Impact of Journals on Wikipedia


 * Presenter: Gaëtan Landry (User:Headbomb)
 * Background: Acadian (French-Canadian), born in Grande-Anse, New Brunswick, Canada
 * Work: Physics Lab Instructor (Dalhousie University, Truro, Nova Scotia, Canada)
 * Education: BSc + MSc in Physics (Université de Moncton), P. Phys.
 * Wikipedia: WikiProject Academic Journals, WikiProject Physics, Bot Approvals Group, + a bazillion other things ($200,000$+ edits)

Intro

 * Main page: WikiProject Academic Journals/Journals cited by Wikipedia (Shortcut: WP:JCW)
 * Original Bot/Coder: User:WikiStatsBOT/User:ThaddeusB (2009−10)
 * Current Bot/Coder: User:JL-Bot/User:JLaTondre (2011−present)
 * Main purpose: Help WikiProject Academic Journals prioritize work

What is JCW?

 * A searchable compilation of all journal parameters from templates on en-wiki
 * Based on citations like
 * Ignores named reference repeats like
 * Ignores "manual" citations like
 * Available at WP:JCW (with disclaimers / caveats)
 * By alphabetical order WP:JCW/ALPHA
 * By popularity (count) WP:JCW/POP ← Look at this one
 * By missing entries WP:JCW/MIS
 * By target WP:JCW/TAR
 * Updated twice per month
 * See also WP:MCW for Magazines cited by Wikipedia


 * Designed to identify high-interest areas lacking coverage, and high-interest problem areas
 * Framework gives several side benefits, or puts them within easy reach
 * Creation and identification of redirects
 * Abbreviations J. Phys. &rarr; Journal of Physics
 * Former names Annales de Physique &rarr; European Physical Journal H
 * Common typos/spelling variants Annual Reviews of ... &rarr; Annual Review of ...
 * Categorization of redirects and corresponding targets (more on this later)

Identification/Categorization

 * Basic status
 * Bold = Article
 * Italics = Redirect
 * Underline = Disambiguation
 * Categorization (see exact rules)
 * Hierarchical keyword-based approach
 * ISO 4 > Journals > Magazines > Newspapers > Websites > Books > Databases > Publishers
 * Looks for R from ISO 4 first (extremely reliable)
 * Looks for  disambiguator in title (very reliable)
 * Looks in foobar in categories (reliable)
 * Category keywords are a bit different (plurals/less variations)
 * Looks for foobar in titles (not very reliable)
 * Category keywords are a bit different (plurals/less variations)
 * Looks for foobar in titles (not very reliable)

Filtering

 * Bar is treated as Bar
 * markup and whitespace is stripped and normalized
 * journal is often misused for books, magazines, websites, or contains wrong/extraneous data like authors/publisher/volume/page
 * journal also has many abbreviated variants (standard and non-standard), spelling variants, punctuation variants, typos
 * Redirect creation, cleanup efforts, etc.

Highlights

 * Breakdown
 * 3 databases
 * 1 encyclopedia (Wikipedia)
 * 73 journals (standalone)
 * 12 journal series
 * 8 magazines
 * 3 newspapers


 * Top 100 entries
 * 16 open access journals (1 research-articles only)
 * 36 closed access (pay option for all/most of them)
 * Embargo
 * 06 months – 13 journals
 * 12 months – 27 journals
 * 24 months – 2 journals
 * 36 months – 1 journal
 * ?? months – 1 journal
 * 7 non-journals / indeterminate status


 * Publishers in the top 100
 * Elsevier (×10)
 * Cell Press (×5)
 * Academic Press (×1)
 * Oxford University Press (×7)
 * Nature Publishing Group (×6)
 * Springer Nature (x1)
 * American Society for Microbiology (x3)
 * ACS Publications (×3)
 * John Wiley & Sons (×3)
 * Wiley-VCH (×1)
 * American Physical Society (×2)
 * American Psychological Association (x2)
 * Cold Spring Harbor Laboratory Press (x2)
 * IOP Publishing (×2)
 * Royal Society (×2)
 * Taylor & Francis (x2)
 * Springer Science+Business Media (x1)
 * Springer Nature (x1)
 * 48 others appearing only once