Wikipedia:Meetup/DC/SIL Wikidata Workshop 2020

The Smithsonian Libraries is hosting a private online event for staff and collaborators to work on creating a data model for the Smithsonian Institution's Wikibase structured data project (under construction), based on Wikidata.

Date
10 June 2020

Time
10:00 EDT (Eastern (US & Canada)) | 16:00 CET (Berlin)

Where
Remote

Register
Participation by invitation only

Agenda

 * Data modeling in Wikibase (understanding items and properties, and how to map your structured data to items/properties)
 * Understanding different data types
 * How to do bulk uploading of data (existing tools and challenges)

Background
The above topics are to help participants working through the data modeling exercises so that the design of Wikibase will meet the Smithsonian Institution's collective needs. The Smithsonian Libraries' Wikidata team has also added a few topics that the team are working through. These may or may not be addressed at the workshop.

Wikibase's data model consists of items and properties. It follows the triple statement structure that items are linked via property. The initial need of the Smithsonian Libraries' Wikibase pilot is 1) to understand the Wikibase data model and 2) to customize Smithsonian-related authority data. We will identity needs for personal, family, geographic and corporate body names, with their necessary mapping from respective in-house databases.

The Smithsonian Libraries has begun a list of over 200 properties potentially useful to the pilot. The team has also identified names from potential collections that are candidates for Wikibase item generation.

Through this workshop, we will learn how to prepare the various types of data that SIL needs for bulk uploading into our Wikibase.

All of the referenced links are found in the (privately shared) Workshop Google document for ease of access.

Facilitators

 * Uncommon fritillary (talk) 17:33, 15 May 2020 (UTC) | Diane Shaw, Special Collections Cataloger, Smithsonian Libraries (facilitator)
 * J Shieh, Descriptive Data Management Librarian, Smithsonian Libraries | § ShiehJ (talk) 12:46, 18 May 2020 (UTC)

Participants
Please note: prior to the workshop, every participant must have:


 * a Wikipedia/Wikidata account with username and password (ask Diane for help if you have questions on how to do this)
 * a general understanding of RDF triple statements in the Wikidata environment. Links in the Wikidata training section of the Meetup page provide basic understanding of Wikidata (structure, creation and editing of items, etc., & linked data)

Discovery Services (DISC): Heidy Berthoud: Head of Resource Description Amy Watson: cataloging technician Mario Rups: librarian Digital Programs and Initiatives (DPI): Suzanne Pilsk: Head, Metadata Services Bess Missell, Librarian, Metadata Services Sue Graves, Metadata Technician, Metadata Services Richard Naples: Technical Information Specialist, Metadata Services Freer Gallery of Art and Arthur M. Sackler Gallery Library (FSG): Yue Shu: librarian for Chinese language Kathryn Phillips: librarian for South Asian, Southeast Asian and Middle Eastern art and architecture Mike Smith: librarian for Korean art, American art, Conservation and Scientific Research, and serials Reiko Yoshimura: head librarian, Freer | Sackler and librarian for Japanese art Smithsonian American Art & Portrait Gallery Library (AAPG): Anne Evenhaugen: head librarian National Portrait Gallery (NPG): Sue Garton, Collections Information Office Office of the Chief Info Officer: (OCIO): Nancy Kennedy (SIRIS/SIAC: Smithsonian Institution Research Information Service/Archives Center      Office of the Undersecretary for Museums and Culture/American Women’s History Initiative      Effie Kapsalis Senior Digital Program Officer   Smithsonian Archives (SIA)          Tammy Peters, Interim Director   National Museum of Natural History (NMNH)      Rebecca Snyder, Acting Informatics Branch Chief   Smithsonian Archives of American Art (AAA)      Tobias (Toby) Reiter: Web Developer      Jess Purkis: Digital Initiatives Archivist

Links of interest:

 * d:Wikidata:WikiProject_Linked_Data_for_Production
 * d:Wikidata:WikiProject_Linked_Data_for_Production/2020-05-15_Wikidata_Working_Hour
 * MediaWiki Data Model Primer https://www.mediawiki.org/wiki/Wikibase/DataModel/Primer
 * Wikidata list of properties List_of_properties
 * Wikidata propbrowse https://tools.wmflabs.org/hay/propbrowse/
 * Collections and Items from User:Ruthbrarian
 * "Integrating Wikidata at the Library of Congress," blog post by Matt Miller
 * "Authority Control Meets Wikibase-GND", Deutsche Nationalbibliothek
 * BnF FNE, French national entities file (authorities) + Wikibase
 * The Koninklijke Bibliotheek's (KB) “Using Wikidata for entity search in historical newspapers”
 * Cradle Tool

SourceMD
SourceMD is the "source metadata" tool for copying citation information to Wikidata.


 * instantly import DOIs and PMIDs
 * batch import paper identifiers, ISBNs, and ORCIDs

Data models
In general, a data model draws the representation of data, "the things", and the relationship between them. The exercises of data modeling are efforts to capture the semantics of an organization's business needs. For instance, in SI environment, the data modeling is conducted for the purpose of communicating both with the community needs and providing architecture for databases and system design. Semantic is important for systems to be able to communicate with one another. Ontologies, thesauri, glossaries serve as means to carry out their respective activities.

In Wikidata, a "data model" is a recommendation for what properties to apply to Wikidata items. Event participants can engage in Wikidata without consulting the below. However, if anyone wants inspiration when editing Wikidata items for people, publications, and institutions, etc., check the below:

Some Wikidata projects of interest to libraries:
 * d:Wikidata:WikiProject Books
 * d:Wikidata:WikiProject Libraries
 * d:Wikidata:WikiProject_Archival_Description
 * d:Wikidata:WikiProject Source MetaData
 * d:Wikidata:WikiProject Authority control
 * d:Wikidata:WikiProject Biographical Identifiers
 * d:Wikidata:WikiProject Q5
 * d:Template:Bibliographic_properties

Other relevant links

 * Smithsonian Libraries