User:Daniel Mietchen/Talks/JATS-Con 2014/Reuse

Databases are not dead ends: build them with reuse in mind

 * JATS was originally designed, and is primarily used, to enable publishing of scholarly work, and to exchange information between authors, publishers, and archives.


 * The needs of those who desire to do large-scale reuse of scholarly content are important!


 * History repeating itself?
 * The PDB example - emphasis on getting content in caused reuse problems later on, and corpus correction was costly.
 * Time to reconsider JATS-related procedures?
 * Other databases?
 * We recommend that the concept of use cases, and that collaboration with suitable use case partners, be adopted and incorporated into the workflows for development of the JATS standard, documentation, and various tools. Possible use cases to start with are:
 * The Open Access Media Importer Bot discussed in our paper,
 * Species disambiguation
 * "an open source, stand-alone software system capable of recognizing and normalizing species name mentions with speed and accuracy, and can therefore be integrated into a range of bioinformatics and text-mining applications."
 * Citation analyses
 * "the set of citing sentences for a given article [..] can be used as a surrogate for the actual article in a variety of scenarios. It contains information that was deemed by peers to be important. "
 * Automated query formulation
 * "We propose improving the formulation of full-text queries by using the open access literature as a proxy for the literature to be searched."

Signalling reusability of cited references

 * On Wikipedia:

Reference
Note the icons and links complementing the bibliographic information.

Outlook

 * PLOS are working on implementing a similar system
 * Both approaches are mindful of the NISO Working Group on Open Access Metadata and Indicators (OAMI)
 * How about implementing something similar elsewhere too?