User:Pelagic/Journal/2021/05

May–July 2021.

local timestamp: (+10 hours, AEST)

Fri 30 Jul
"Citations are the foundation of Wikipedia’s reliability: they trace the connection between content added by our community of volunteer contributors and its sources. For readers, citations provide a mechanism to validate and check for themselves that what Wikipedia says is sound and trustworthy: they act as a gateway towards a broader ecosystem of reliable knowledge. [Redi et al.]"

"Wikipedia relies on all kinds of sources, including sources whose access is restricted by a paywall. However, open access scientific sources are especially important. Since they do not require payment to access, they are immediately verifiable by a wider number of Wikipedia editors and readers."

(03:29 Sat 31, AEST)

"Wikipedia aims to be an open-access summary of all reliable knowledge—not a summary of only open-access knowledge."

The article by Orlowitz & Stinson also has interesting info about fair-use newspaper clippings at Newspapers.com and Newspaperarchive.com. See also

"The world is cold and lonely and Wikipedia is this generation’s only popular friendly player in nonprofit media. [blueras, ibid.]"

"Lay public access is so great, so disrupting, and so completely outside the professional experience of scientists or the commercial experience of scholarly publishers that they would not even know how to respond to a public demand for this content. [blueras, ibid.]"

My first mentee question
User talk:Pelagic, User talk:I-U-She Thomas

Mix n Match
Made my first catalogue – https://mix-n-match.toolforge.org/#/catalog/4533

The thing is: MnM only imports ID, Name, and Description, it doesn't scrape other properties from the source.

I can't for the life of me find how to get back to the scraper and edit it. (Note .*? is non-greedy match.)
 * Level 1: range 483592–483593
 * URL Pattern: https://www.neram.com.au/artwork-details/$1
 * Regex Block: (.*?)
 * Regex Pattern: Name/Title (.*?)  Maker (.*?)  (.*?) Date Made (.*?)
 * ID=$L1, Name=$1, Desc=$4 art work by $2

Tried making a Follow level that searches https://www.neram.com.au/search-nerams-collections?eHive_query=streeton for pattern https://www.neram.com.au/artwork-details/(\d+), and it only matched two hyperlinks in the test. Saved as https://mix-n-match.toolforge.org/#/catalog/4534.

Surprisingly, after letting it run, it reports 21 results! Seems like it didn't loop across the two results pages like I'd hoped, but it fetched everything from the first page. Unfortunately, in addition to the 12 real results, it also picked out the see-also links for other artists. The item links have wrong URL, I must have messed up my $1 and $2 (facepalm).

Prelimiary matching is understandably hit-and-miss, for example Cremorne matches the place. I wonder, had I set the scraper to make the items instances of, would it have matched more? Two of the good matches are to items that I previously created by hand: and.

There's another danger here for the unwary. Streeton often made two versions of a painting: a smaller plein-air study and a larger studio version. The two usually end up in different museums, but if the data item doesn't have recent holding/owner info, then which one is it? E.g. and.

Then you have paintings with the same name, artist, and year, but very different views of the subject.

In Powershell, I can do

Artstor
https://library.artstor.org/public/26756284 compare.

Errant nonsense

 * Wondering if knowing Dr Ravi lol and thankful to Google
 * MOMOMOM... reverted and blocked

Sun 23

 * There is a wikimedia mailing-list archive running Hyperkitty at https://lists.wikimedia.org/hyperkitty/
 * Before now, I normally use (Pipermail?) at ...
 * Oh, Freenode!
 * How newbies see templates

(03:29 Mon 24, AEST)
 * They're changing the default search on Commons

Don’t Scale
Do Things That Don’t Scale is about startups, but has some good quotable phrases. I wonder how some of the ideas can transfer to other situations.
 * You can be ornery when you're Scotty, but not when you're Kirk.
 * you'll find that delighting customers scales better than you expected
 * you can and should give users an insanely great experience with an early, incomplete, buggy product, if you make up the difference with attentiveness

Growth features

 * Mentorship Module questions in recent changes
 * Mentorship Panel questions in recent changes
 * Suggested Edits in recent changes
 * https://en.wikipedia.org/wiki/User:Tony1/Build_your_linking_skills
 * https://en.wikipedia.org/wiki/User:Tony1/Build_your_linking_skills

Thu 20

 * Looking through recent newsletters.
 * Mediawiki has a definition for when "a proposed blocker [should] be considered as possibly changing the course of a software project": Wikimedia Product Guidance/Community involvement (by ELappen). Some other interesting views on deployment process there.
 * 

Misc, open tabs

 * https://controlleddigitallending.org/
 * http://baskauf.blogspot.com/2021/03/writing-your-own-data-to-wikidata-using_11.html (might have noted this series of blog posts earlier)
 * Strategy/Wikimedia movement/2018-20/Transition/Global Conversations/Lessons Learned
 * https://www.australian-coins.com/collecting-coins/australian-coin-portraits-queen-elizabeth-ii/

2039 rule

 * https://www.gov.uk/government/publications/copyright-notice-duration-of-copyright-term/copyright-notice-duration-of-copyright-term
 * https://www.thebrandprotectionblog.com/the-2039-rule-uk-keeps-millions-of-very-old-unpublished-works-under-copyright-protection-for-another-24-years/

Edit summaries and changeset comments
Open StreetMap's advice parallels our use of edit summaries. (12:36 Sat 01, AEST) "Since April 21, 2009, users can attach Wikipedia-like edit summaries to their edits" (14:20 Sat 01, AEST)