User talk:Andrew Gray/Archives/121

sports problem
Hi, Andrew! I was reading the threads about your analysis at GGTF and WiR, and I started to kind of pull the discussion off track there with my statistical nerdiness so I thought I'd ask here. I was just astounded that 46% of BLPs are athletes. I'm trying to reconcile this in my head, because I'm reeling. Can it possibly be that almost half of BLPs are notably primarily due to their athletic career? That is, this isn't Donald Trump being coded as a golfer and therefore part of that 46%? If 46% of wikipedia's BLPs are for people who are notable primarily for their athletic career, that's not a gender problem, it's a sports problem. --valereee (talk) 09:58, 18 June 2019 (UTC)


 * It's really quite a startling figure, isn't it? I vaguely suspected it would be high, but nothing like that level. There are 536k athlete articles in total - 33% of all biographies, and about 9% of all articles. (It's a higher proportion of BLPs than all biographies because of the strong recency bias)
 * One caveat is that our definition of "athlete" is a little debatable. The Wikidata ontology, which we're using here, considers dancers to be athletes, which I guess is something you could argue either way; ditto chess players, racing drivers, etc. I would estimate no more than 20-40k people fall into these "maybe or maybe not what we mean when we say athlete" groups; the headline figure is probably reasonable, give or take 5-10%.
 * The second caveat is the one you raise - are we picking up people who are really notable for other reasons? In theory the Wikidata "occupation" field should not be used for people unless it is a significant part of their life - my feeling is that there will inevitably be some miscategorisation here, along the lines of your golf example, but it's relatively low level.
 * I've done some tests by looking for people who have some kind of other non-athletic occupation listed as well. Other than things like "football manager", the most common non-sport-related single occupation was "politician" (2.5k) or "actor" (3.5k, and a third of those are dancers). Given that some athletes do go on to become politicians (or even vice versa), this seems like a plausible sort of number and suggests the classification is reasonably clean.
 * Looking at the breakdown by field, I make it 171k (association) footballers, 33k (gridiron) footballers, 32k "athletes" in the more restrictive sense, 31k cricketers, 26k baseball players, 20k rugby players, 20k basketball players, 19k ice hockey players. Those groups together give us about two thirds of the total - there will be some overlap for individuals who're listed twice, of course, but it highlights the sort of thing that drives the numbers - large well-documented team sports. And of those, by far the largest proportion seems to be (historic?) football players... Andrew Gray (talk) 22:51, 18 June 2019 (UTC)
 * As I asked at WiR talk, where does this 46% figure come from? I'm not seeing it at all. The Denelezh figures look like 22% "sports figures" to me, & that's not BLPs, but people born after 1800. So, no, it can't possibly be. We have to be careful with things like this, as the next thing you know, claims like this are plastered over the world's media, and believed. Johnbod (talk) 00:03, 19 June 2019 (UTC)
 * , I actually think 'wikipedia has a sports problem' would be a better thing to be plastered all over the world's media than 'wikipedia is sexist.' --valereee (talk) 10:24, 19 June 2019 (UTC)
 * Not if it's 'wikipedia has a sports problem (that's twice as big as it actually is)'. Plus I think you'll find journalists and editors won't be very interested in that story. Can we confirm what the actual figure is? To avoid misleading quick skimmers, it should be corrected where you have posted it. Johnbod (talk) 13:27, 19 June 2019 (UTC)
 * I haven't written this up properly yet - hoping to find some time soon - but it followed on from the gender work here, when I was trying to extend it with some numbers on occupational subgroups. (It worked for politicians and athletes, broke down for researchers, & as I was most interested in them I didn't push further). A worked set of numbers follows, to make it clear how I got there...
 * As you note, denelezh can't get us "living people". For all enwiki biographies, denelezh gives us 536,346 athletes out of 1,632,072 people - ~33% of all biographies on enwiki. The 22% figure you've got, I think, is the share of "all biographies with any sitelink" which are athletes. (Apparently other projects are less sports-oriented than we are, though "any sitelink" includes Commons & Wikisource so those will probably inflate the share for artists/writers). NB these numbers do include pre-1800 data, it's just that it's not shown on the breakdown lines.
 * To find the numbers for BLPs, I looked at the intersection of articles identified as athletes using the same process as denelezh (occupation tagging in Wikidata), and articles in Category:Living people. Overall, I got 64,871 matches for "female athletes", and 348,968 for "male athletes"; total 413,839. At the time I did this there were 906,720 entries in Category:Living people, so 45.64% of BLPs are identifiable as athletes, which rounds up to our 46% figure. (I was only looking at M/F ratios so this does not include anyone not coded as M/F, but I do not anticipate that would change the overall figures substantially - at most ±0.1%).
 * As valereee noted, there is still the underlying question of people who are counted as "sports" but who we're mostly interested in for other reasons. It's hard to say for sure how many of these there might be, but my feeling is that it's not overwhelmingly high; the relatively low rates of overlap between athletic and non-athletic occupation entries seem to support that. It would be quite reasonable to round down the overall numbers to take account of this, but even with a generous estimate for "people who are primarily famous in other fields, I wouldn't think it would get much below, say, ~40% of BLPs. I'll see if I can think of more rigorous ways to investigate this. As I said, these numbers did surprise me somewhat, but overall, I find them reasonably plausible. Andrew Gray (talk) 20:37, 19 June 2019 (UTC)
 * Thanks, Andrew, but I must say I find it very hard to believe a figure this high. I wonder if there is a double-counting issue? Have you accounted for the lack of death dates problem? I suspect this affects athletes much more than other categories, for obvious reasons. What is the athletic % in the over-95 group for example, or over 100? One often finds people missing from Category:Living people btw - many editors don't add it.  But thanks for doing this stuff, no doubt there are many wrinkles we will learn to identify.  It's certainly worth nailing a correct figure.  Johnbod (talk) 21:31, 19 June 2019 (UTC)
 * Other than the issue of including people who're "not primarily athletes", there shouldn't be any double-counting based on people being athletes in multiple different ways - the methodology spits out a single list of distinct WP page titles/IDs and I've confirmed they're all unique.
 * WRT dates, I've only tested against the presence of Category:Living people and trusted to that being reasonably well maintained - at least, probably better maintained than the alternatives! I definitely agree that "we don't know if they're still living" may be an issue, particularly with all the "man who played two games for Partick Thistle in 1951 and then became a welder" type articles. I'll see if I can work out some way to isolate that group (say "born before 1940 or active before 1960, believed living"?) and run the stats seperately, though it may be tricky to do so. Andrew Gray (talk) 11:46, 20 June 2019 (UTC)

Wikidata weekly summary #370
Here's your quick overview of what has been happening around Wikidata over the last week. 
 * Discussions
 * Closed request for comments: semi-protection to prevent vandalism on most used Items


 * Events
 * Upcoming: 1st Iberoamerican Knowledge Graphs and Semantic Web Conference keynote on Wikidata by José E. Labra Villa Clara, Cuba 24-28 June
 * Upcoming: Wikidata meeting for researchers in Sfax University, Tunisia, 25-27 June 2019
 * Upcoming: Wikidata meetup in London, June 29th
 * Upcoming: July 3rd: UGent Wikidata and Wikibase Workshop 2019


 * Press, articles, blog posts
 * Demonstrating Spindra: A Geographic Knowledge Graph Management System, Yuhan Sun, et al (in 2019 IEEE 35th International Conference on Data Engineering)
 * Comparing DBpedia, Wikidata, and YAGO for Web Information Retrieval, Sini Govinda Pillai, et al.
 * WikiDataSets: Standardized sub-graphs from WikiData, Armand Boschin, in ArXiv
 * Ordia: A Web application for Wikidata lexemes, Finn Årup Nielsen
 * Combining embedding methods for a word intrusion task Finn Årup Nielsen, et al
 * Query expansion using Wikidata attributes’ values Sarah Dahir, Abderrahim et al., in ICCWCS'19
 * Automatic Question Generation based on MOOC Video Subtitles and Knowledge Graph Lin Ma, Yuchun Ma, in ICIET 2019
 * Wikidata as a linked-data hub for Biodiversity data, Andra Waagmeester, et al.
 * Using Crowd-curation to Improve Taxon Annotations on the Wikimedia Infrastructure, Andra Waagmeester, et al.
 * ''Using Shape Expressions (ShEx) to Share RDF Data Models and to Guide Curation with Rigorous Validation, et al. Best in-use paper award ESWC
 * From “an” Identifier to “the” Identifier, Theo van Veen
 * Developing workflows for local authority file conversion from MARC to Wikidata


 * Other Noteworthy Stuff
 * Structured Data on Commons: qualifiers for depicts support have been enabled on June 20th
 * Result format change for Query Service JSON query output
 * Filename scheme for Wikidata RDF entity dumps will change
 * The development of Wikidata Bridge (editing Wikidata's data from Wikipedia) started
 * CheckShex userscript adds an in-page way of checking if an item fits an Entityschema


 * Did you know?


 * Newest properties:
 * General datatypes: has written for, motif represents, solar irradiance, effective temperature
 * External identifiers: JIS standard, Réunionnais du monde ID, Corporate Number (South Korea), dbSNP ID, digilibIT author ID, Digital Prosopography of the Roman Republic ID, eBiodiversity ID, eurobasket.com coach ID, euroleague.net coach ID, Gamepedia Wiki ID, Hoopla artist ID, Hoopla publisher ID, Latvian National Encyclopedia Online ID, ECI Lok Sabha constituency code, IntraText author ID, Musixmatch artist ID, MAHG ID, Amburger database ID, ATP tennis tournament edition ID, Rugby League Project ID (general), RIAA artist ID


 * New property proposals to review:
 * General datatypes: extracted from, Instagram hashtag, Historical Archives of the European Union ID, number of pins, number of pin positions, Representative in legislature, subscribers
 * External identifiers: nchdb asset id, Biblioteca Nacional Aruba ID, Bangladesh administrative division code (2017-), Goodreads series ID, Heritage Gazetteer of Cyprus, Retrosheet ID, The DJ List artist ID, BBC artist ID, The Independent topic ID, NME artist ID, Metro topic ID, MangaSeek person ID, Soccerway coach ID, Nederlandse Top 40 artist ID, Dutch Charts artist ID, hitparade.ch artist ID, Aviation Safety Network Wikibase ID, Find & Connect ID, SA Flora ID, ATRF ID, geograph, ArtBrokerage artist ID, Indian gallantry awardee ID, Scandipop topic ID, iTunes movie collection ID, SLNSW unpublished item ID, LB.ua dossier, FPBR person ID, RBF amateur boxer ID, RBF professional boxer ID, Roskomnadzor media license number, 2014 Commonwealth Games athlete ID, Music certifictions ID, El portal de Música artist ID


 * Query examples:
 * [https://query.wikidata.org/embed.html#SELECT%20%3Ftotal%20%3Fcount%20%3Fdepicts_class%20%3Fdepicts_classLabel%20%3Fprop%20%3FpropLabel%20%3Fsamp%20%0A%0AWITH%20%7B%0A%20%20SELECT%20%3Fitem%20%3Fdepicts_stmt%20WHERE%20%7B%0A%20%20%20%20%20%20%3Fitem%20p%3AP180%20%3Fdepicts_stmt%20.%0A%20%20%7D%20%23%20LIMIT%20100000%0A%7D%20AS%20%25stmts%0A%0AWITH%20%7B%0A%20%20SELECT%20%3Fprop%20%28COUNT%28DISTINCT%28%3Fdepicts_stmt%29%29%20AS%20%3Ftotal%29%20WHERE%20%7B%0A%20%20%20%20INCLUDE%20%25stmts%20.%0A%20%20%20%20%3Fdepicts_stmt%20%3Fqual%20%5B%5D%20.%0A%20%20%20%20%3Fprop%20wikibase%3Aqualifier%20%3Fqual%20.%0A%20%20%7D%20GROUP%20BY%20%3Fprop%0A%7D%20AS%20%25totals%0A%20%20%20%20%0A%0AWITH%20%7B%0A%20%20SELECT%20%3Fdepicts_class%20%3Fprop%20%28COUNT%28DISTINCT%28%3Fdepicts_stmt%29%29%20AS%20%3Fcount%29%20%28SAMPLE%28%3Fitem%29%20AS%20%3Fsamp%29%20WHERE%20%7B%0A%20%20%20%20INCLUDE%20%25stmts%20.%0A%20%20%20%20%3Fdepicts_stmt%20ps%3AP180%20%3Fdepicts_val%20.%0A%20%20%20%20%3Fdepicts_val%20wdt%3AP31%3F%20%3Fdepicts_class%20.%0A%20%20%20%20%3Fdepicts_stmt%20%3Fqual%20%5B%5D%20.%0A%20%20%20%20%3Fprop%20wikibase%3Aqualifier%20%3Fqual%20.%0A%20%20%7D%20GROUP%20BY%20%3Fdepicts_class%20%3Fprop%0A%20%20ORDER%20BY%20DESC%28%3Fcount%29%20%0A%20%20LIMIT%205000%0A%7D%20AS%20%25data%0A%0AWITH%20%7B%0A%20%20SELECT%20%3Fdepicts_class%20%3Fprop%20%28COUNT%28DISTINCT%28%3Fdepicts_class1%29%29%20AS%20%3Frank%29%20WHERE%20%7B%0A%20%20%20%20%20INCLUDE%20%25data%20.%0A%20%20%20%20%20%7B%0A%20%20%20%20%20%20%20%20SELECT%20%28%3Fdepicts_class%20AS%20%3Fdepicts_class1%29%20%3Fprop%20%28%3Fcount%20AS%20%3Fcount1%29%20WHERE%20%7B%0A%20%20%20%20%20%20%20%20%20%20%20INCLUDE%20%25data%20.%0A%20%20%20%20%20%20%20%20%7D%0A%20%20%20%20%20%7D%0A%20%20%20%20%20FILTER%20%28%3Fcount1%20%3E%3D%20%3Fcount%29%20.%0A%20%20%20%7D%20GROUP%20BY%20%3Fdepicts_class%20%3Fprop%0A%7D%20AS%20%25ranks%20%0A%20%20%20%20%20%20%20%20%20%20%20%20%0AWHERE%20%7B%0A%20%20INCLUDE%20%25data%20.%0A%20%20INCLUDE%20%25ranks%20.%0A%20%20INCLUDE%20%25totals%20.%0A%20%20FILTER%20%28%3Frank%20%3C%2011%29%20.%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%7D%20ORDER%20BY%20DESC%28%3Ftotal%29%20%3Fprop%20DESC%28%3Fcount%29%0A Most commonly used qualifiers on depicts (P180) statements on Wikidata, with top 10 classes or items that each one is used on] (source)
 * Most commonly used properties connecting military people to military organisations (source)
 * Bubblechart of most frequent WMF import sources
 * French male first names ending with "ée" (source)
 * Newest WikiProjects: WikiProject Australia
 * Newest database reports: Property completion by country leaderboard


 * Development
 * Beginning of wb_terms migration (T221764)
 * Fixed a bug with phan on Wikibase (T226083)
 * Enable bugfix for wbeditentity setting aliases to empty array (T223303)
 * Fix an issue with pipe character on some special pages (T223270)
 * Fix an issue with removed constraint still displayed on Lexeme (T223372)
 * Make edits to EntitySchema pages autopatrolled (T224495)
 * Work on creating better edit summaries for wbeditentity API endpoint (T224010)
 * Deploy the work environment for new mobile termbox
 * Start setting up the technical environment for Wikidata bridge

You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.


 * Monthly Tasks
 * Add labels, in your own language(s), for the new properties listed above.
 * Comment on property proposals: all open proposals
 * Suggested and open tasks!
 * Contribute to a Showcase item.
 * Help translate or proofread the interface and documentation pages, in your own language!
 * Help merge identical items across Wikimedia projects.
 * Help write the next summary!

Read the full report &middot; Unsubscribe &middot; Lea Lacroix (WMDE) 15:21, 24 June 2019 (UTC)

Articles you might like to edit, from SuggestBot
Note: All columns in this table are sortable, allowing you to rearrange the table so the articles most interesting to you are shown at the top. All images have mouse-over popups with more information. For more information about the columns and categories, please consult the documentation and please get in touch on SuggestBot's talk page with any questions you might have.

SuggestBot picks articles in a number of ways based on other articles you've edited, including straight text similarity, following wikilinks, and matching your editing patterns against those of other Wikipedians. It tries to recommend only articles that other Wikipedians have marked as needing work. We appreciate that you have signed up to receive suggestions regularly; your contributions make Wikipedia better — thanks for helping!

If you have feedback on how to make SuggestBot better, please let us know on SuggestBot's talk page. -- SuggestBot (talk) 23:32, 17 June 2019 (UTC)

Articles you might like to edit, from SuggestBot
Note: All columns in this table are sortable, allowing you to rearrange the table so the articles most interesting to you are shown at the top. All images have mouse-over popups with more information. For more information about the columns and categories, please consult the documentation and please get in touch on SuggestBot's talk page with any questions you might have.

SuggestBot picks articles in a number of ways based on other articles you've edited, including straight text similarity, following wikilinks, and matching your editing patterns against those of other Wikipedians. It tries to recommend only articles that other Wikipedians have marked as needing work. We appreciate that you have signed up to receive suggestions regularly; your contributions make Wikipedia better — thanks for helping!

If you have feedback on how to make SuggestBot better, please let us know on SuggestBot's talk page. -- SuggestBot (talk) 23:38, 24 June 2019 (UTC)

The June 2019 Signpost is out!
 * Read this Signpost in full * Single-page * Unsubscribe * MediaWiki message delivery (talk) 15:51, 30 June 2019 (UTC)

Wikidata weekly summary #371
Here's your quick overview of what has been happening around Wikidata over the last week. 
 * Events
 * Upcoming: July 3rd: UGent Wikidata and Wikibase Workshop 2019
 * Upcoming: OSM TW x Wikidata TW Meetup, June 3rd, Taipei, Taiwan - Facebook event
 * Upcoming: Wikidata Lab XVII in São Paulo, Brazil, July 11th
 * Upcoming: Inventaire & OpenData Week 2019 in Wurzen, Germany, July 18th-25th
 * State of the Map 2019: early bird tickets are available until July 7th. The program will include a Wikidata workshop


 * Press, articles, blog posts
 * Placing EveryPolitician on hold by MySociety ("it's clear that Wikidata should be the natural global home for this type of data")
 * Celebrity Profiling by Matti Wiegmann et al.,
 * Published in ArXiv:
 * DocRED: A Large-Scale Document-Level Relation Extraction Dataset by Yuan Yao et al.
 * ConTrOn: Continuously Trained Ontology based on Technical Data Sheets and Wikidata by Kobkaew Opasjumruskit et al.
 * Wikidata as opportunity for special collections: the 20th Century Press Archives use case. Presentation at LIBER 2019 (LOD working group of the Association of European Research Libraries), by Joachim Neubert


 * Other Noteworthy Stuff
 * The Wikibase community Telegram group has been recreated, here's the invitation link


 * Did you know?


 * Newest properties:
 * General datatypes: LilyPond notation, target muscle, historical region, writing language, match interval, microarchitecture, literacy rate, era name, seconded by, moved by
 * External identifiers: MELPA package ID, Nchdb asset ID, National Film Board of Canada director identifier, Fundamental.org Baptist Church ID, Offizielle Deutsche Charts artist ID, Beatport artist ID, Bangladesh administrative division code (2017-), Djshop artist ID, NeoGeoSoft ID, New York City Neighborhood Tabulation area ID, NicoNicoPedia, NooSFere edition ID, ArtBrokerage artist ID, ATRF ID, Bebo profile ID, National Library of Aruba ID, BVLarramendi ID, Cameo ID, Charts in France artist ID, CIN ID, Dutch Charts artist ID, Equipboard artist ID, Facebook Gaming game ID, Gaana.com artist ID, Gambay ID, Heritage Gazetteer of Cyprus, Historical Archives of the European Union ID, hitparade.ch artist ID, Indian gallantry awardee ID, Juno Download artist ID, MangaSeek person ID, Metro topic ID, Moov artist ID, Murfie artist ID, Musicalics composer ID, Nederlandse Top 40 artist ID, NME artist ID, PCE Daisakusen ID, CUT code, Pro Football Hall of Fame ID, Repology project name, Rogerebert.com film ID, SA Flora ID, Syriac Biographical Dictionary ID, The DJ List artist ID, Who's Who of American Comic Books ID, SNBP ID


 * New property proposals to review:
 * General datatypes: double major, Poster, Throwing handedness, Batting handedness, Institut-ID in der Unternehmensdatenbank der BaFin, Wolfram Language unit code, National Transportation Safety Board report, enemy of, PHI Latin Texts author ID, default description for instances
 * External identifiers: ACNP journal ID, Personnel de l'administration préfectorale ID, Bursa Malaysia stock code, etrain.info station ID, Rupa Publications author ID, Pro Kabaddi League player ID, OYO Hotel ID, Anime Characters Database, IMFDB work ID, Australian Faunal Directory publication ID, Ident.Nr., Sverigetopplistan artist ID, Musiikkituottajat artist ID, Radio Courtoisie show ID, Biyografya ID


 * Query examples:
 * Nouns without grammatical gender in a chosen language (source)
 * Current UK Members of Parliament ranked by number of Wikimedia sitelinks (source)
 * Architectural drawings depicting cathedrals in France (source)
 * Music bands named after Tolkien's universe, by genre (source)
 * Newest database reports: education by country

You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
 * Development
 * More work on preparing the technical setup for Wikidata Bridge development
 * Implement Feature Flag for Wikidata Bridge (T225935)
 * Discuss the best way to connect template parameter to corresponding Wikidata property (T224832)
 * Improve the documentation for Wikibase sites setup (T218282)
 * Prepare the last steps for reading from new term store implementation (T225603) according to the migration plan (T221765)
 * Fixed a bug and reduced significantly the amount of addUsagesForPage jobs in our jobs queue (T205045)
 * Removed feature flag for the bugfix for wbeditentity setting aliases to empty array (T223305)
 * Decreased EntityUsageTable addUsage batch size to 100 (T225500)
 * Configured query service lag for Wikidata maxlag (T222193)
 * Helped fixing an issue with moving files on Commons (T226672)


 * Monthly Tasks
 * Add labels, in your own language(s), for the new properties listed above.
 * Comment on property proposals: all open proposals
 * Suggested and open tasks!
 * Contribute to a Showcase item.
 * Help translate or proofread the interface and documentation pages, in your own language!
 * Help merge identical items across Wikimedia projects.
 * Help write the next summary!

Read the full report &middot; Unsubscribe &middot; Lea Lacroix (WMDE) 15:35, 1 July 2019 (UTC)

WikiCup 2019 July newsletter
The third round of the 2019 WikiCup has now come to an end. The 16 users who made it to the fourth round needed to score at least 68 points, which is substantially lower than last year's 227 points. Our top scorers in round 3 were:


 * 🇳🇫 Cas Liber, our winner in 2016, with 500 points derived mainly from a featured article and two GAs on natural history topics
 * Adam Cuerden, with 480 points, a tally built on 16 featured pictures, the result of meticulous restoration work
 * SounderBruce, a finalist in the last two years, with 306 points from a variety of submissions, mostly related to sport or the State of Washington
 * 🇺🇸 Usernameunique, with 305 points derived from a featured article and two GAs on archaeology and related topics

Contestants managed 4 (5) featured articles, 4 featured lists, 18 featured pictures, 29 good articles, 50 DYK entries, 9 ITN entries, and 39 good article reviews. As we enter the fourth round, remember that any content promoted after the end of round 3 but before the start of round 4 can be claimed in round 4. Please also remember that you must claim your points within 14 days of "earning" them, and it is imperative to claim them in the correct round; one FA claim had to be rejected because it was incorrectly submitted (claimed in Round 3 when it qualified for Round 2), so be warned! When doing GARs, please make sure that you check that all the GA criteria are fully met.

If you are concerned that your nomination—whether it is at good article nominations, a featured process, or anything else—will not receive the necessary reviews, please list it on WikiCup/Reviews Needed (remember to remove your listing when no longer required). Questions are welcome on Wikipedia talk:WikiCup, and the judges are reachable on their talk pages or by email. Good luck! If you wish to start or stop receiving this newsletter, please feel free to add or remove your name from WikiCup/Newsletter/Send. Godot13 (talk), Sturmvogel 66 (talk), Vanamonde (talk) and Cwmhiraeth (talk). MediaWiki message delivery (talk) 20:11, 2 July 2019 (UTC)

Articles you might like to edit, from SuggestBot
Note: All columns in this table are sortable, allowing you to rearrange the table so the articles most interesting to you are shown at the top. All images have mouse-over popups with more information. For more information about the columns and categories, please consult the documentation and please get in touch on SuggestBot's talk page with any questions you might have.

SuggestBot picks articles in a number of ways based on other articles you've edited, including straight text similarity, following wikilinks, and matching your editing patterns against those of other Wikipedians. It tries to recommend only articles that other Wikipedians have marked as needing work. We appreciate that you have signed up to receive suggestions regularly; your contributions make Wikipedia better — thanks for helping!

If you have feedback on how to make SuggestBot better, please let us know on SuggestBot's talk page. -- SuggestBot (talk) 23:25, 1 July 2019 (UTC)

Wikidata weekly summary #372
Here's your quick overview of what has been happening around Wikidata over the last week. 
 * Discussions
 * Open request for adminship: ZI Jony


 * Events
 * Upcoming: Wikidata IRC office hour, Tuesday, June 16th, at 16:00 UTC (18:00 Berlin time), on the #wikimedia-office channel
 * Past: Celtic Knot Conference 2019 - minority languages on Wikimedia projects. Slides to be added in the Commons category


 * Press, articles, blog posts
 * A new Wikidata map and comparison with previous maps, by Addshore
 * The story of building the inteGraality tool at the Wikimedia Hackathon, by JeanFred
 * Des identifiants ouverts pour la science ouverte by the Comité pour la Science ouverte (Open Science Committee), June 2019. Extract: "Since 2012, the Wikidata database has gradually become the global point of convergence for open identifiers." (Depuis 2012, la base Wikidata est devenue progressivement le point de convergence mondial des identifiants ouverts.)
 * Lydia Pintscher submitted a chapter for the collaborative book Wikipedia@20, called Wikidata - Wikipedia’s not so little sister is finding its own way. Feel free to have a look, review is open until July 19th
 * You can also check out Denny Vrandečić's submission about Abstract Wikipedia and Wikidata


 * Other Noteworthy Stuff
 * Because of a server switch, Wikidata will be in read-only mode on July 30th from 05:00 to 05:30 AM UTC (T227063)
 * Change in the name pattern of new Wikidata RDF dumps: starting on July 15th, dumps will have a new name format (T226153)
 * Reminder for tool builders: Python tools should use a user-agent to access the Query Service. If you recently encountered issues with the Query Service, please check that your tool is compliant to User-Agent policy.
 * New user script for lexicographical data to add Forms on Lexemes that don't have any, by suggesting and filling out templates


 * Did you know?


 * Newest properties:
 * General datatypes: animator, video system, announcement date, access status, literary motif, review of, middle family name
 * External identifiers: FPBR person ID, PHI Latin Texts author ID, VG-lista artist ID, Bloodhound ID, Steam profile ID, Find & Connect ID, Goodreads series ID, Scandipop topic id, 2014 Commonwealth Games ID, El portal de Música artist ID, Balochistan Education Management Information System code, Khyber Pakhtunkhwa Education Management Information System code, Punjab Education Management Information System code, Sindh Education Management Information System code, IMVDb artist ID, Institute-id in the BaFin company database, Musiikkituottajat artist (certyfication) ID, IFPI Austria artist ID, IFPI Danmark artist ID, SNEP artist ID, hitparade.ch artist (certyfication) ID, BVMI artist ID, Music Canada artist ID, IFPI Norge artist ID, Napster artist ID, Personnel de l'administration préfectorale ID, RBF professional boxer ID, Retrosheet ID, SLNSW unpublished item ID, LB.ua dossier ID, ACNP journal ID, Australian Faunal Directory publication ID, Sverigetopplistan artist ID, Rupa Publications author ID, OYO Hotel ID, Hungarian National Namespace place ID, Hungarian National Namespace person ID, Hungarian National Namespace organisation ID


 * New property proposals to review:
 * General datatypes: catalogue raisonné, Wikia Article URL 2, Wikispecies template for this work, imprimatur, periphrasis, terms of service, privacy policy, Musisque Deoque author ID, SAR Value, IAAF competition category, Academic rank, MPG ID, newspaper archive, unabbreviated text, identifiant Q-Codes, ft.dk politician identifier
 * External identifiers: Biyografya ID, Magazine in BitArkivo.org, PharmGKB ID, World Encyclopedia of Puppetry Arts ID, identifiant Missions étrangères de Paris, DigitalNZ ID, Handball123 player ID, ScOT ID, Atlas of Living Australia ID, номер российской организации, AtlasFor ID, Repertorium van ambtsdragers en ambtenaren id, Adventure Gamers ID, Bebo profile numeric ID, MovieLens ID, WikiApiary, Scribd Book ID, Penguin India author ID, Spanish Catalogued Heritage


 * Query examples:
 * 762 people made edits on lexicographical data, 571 created at least one Lexeme (source)
 * [https://query.wikidata.org/#%23defaultView%3AMap%7B%22hide%22%3A%5B%22%3Fgeoshape%22%2C%22%3Frgb%22%2C%22%3FWelshSpeakerPercentage%22%5D%2C%22layer%22%3A%22%3FWelshSpeakerPercentage%22%7D%0ASELECT%20DISTINCT%20%0A%3Farea%20%3FareaLabel%0A%3F2011%0A%3FWelshSpeakerPercentage%0A%3Fgeoshape%20%0A%3Frgb%20%0AWHERE%20%7B%0A%20%20%20%0A%20%20%3Farea%20wdt%3AP3896%20%3Fgeoshape%20.%0A%20%20%20%0A%20%20%7B%20SELECT%20%3Farea%20%3F2011%20%3F2011_percent%20WHERE%20%7B%0A%20%20%20%20%0A%20%20BIND%28CONCAT%28%222001%3A%20%22%2C%20%3F2011_Population%2C%20%22%20%3B%20%22%2C%20%3F2011_Speakers%2C%20%22%20%22%2C%20%3F2011_Percent%29%20as%20%3F2011%29%0A%0A%20%20%7B%20SELECT%20%3Farea%20%3F2011_Population%20%3F2011_Speakers%20%3F2011_Percent%20%3F2011_percent%20WHERE%20%7B%0A%20%20%0A%20%20BIND%28CONCAT%28%22Population%3A%20%22%2C%20STR%28%3F2011_population%29%29%20as%20%3F2011_Population%29%0A%20%20BIND%28CONCAT%28%22Welsh%20speakers%3A%20%22%2C%20STR%28%3F2011_speakers%29%29%20as%20%3F2011_Speakers%29%0A%20%20BIND%28CONCAT%28%22%28%22%2C%20SUBSTR%28STR%28%3F2011_percent%29%2C0%2C5%29%2C%20%22%25%29.%22%29%20as%20%3F2011_Percent%29%0A%20%20%20%0A%20%20%7B%20SELECT%20%3Farea%20%3F2011_speakers%20%3F2011_population%20%3F2011_percent%20WHERE%20%7B%0A%20%20%0A%20%20BIND%28100%20%2a%20%3F2011_speakers%20%2F%20%3F2011_population%20AS%20%3F2011_percent%29%0A%0A%20%20%7B%20SELECT%20%3Farea%20%3F2011_speakers%20%3F2011_population%20WHERE%20%7B%20%0A%20%20%20%20%3Farea%20wdt%3AP31%20wd%3AQ15979307%20%3B%0A%20%20%20%20%20%20%20%20%20%20p%3AP2936%20%3F2011_SpeakersStatement%20%3B%0A%20%20%20%20%20%20%20%20%20%20p%3AP1082%20%3F2011_PopulationStatement%20.%0A%20%20%20%20%0A%20%20%20%20%3F2011_SpeakersStatement%20%3Fpq_qual%20%3Fpq_obj%20.%20%20%20%20%0A%20%20%20%20%3Fqual%20wikibase%3Aqualifier%20%3Fpq_qual%20.%0A%20%20%20%20%3F2011_SpeakersStatement%20pq%3AP585%20%222011-00-00T00%3A00%3A00Z%22%5E%5Exsd%3AdateTime%20%3B%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20pq%3AP1098%20%3F2011_speakers%20.%0A%20%20%20%20%0A%20%20%20%20%3F2011_PopulationStatement%20%3Fpq_qual%20%3Fpq_obj%20.%20%20%20%20%0A%20%20%20%20%3Fqual%20wikibase%3Aqualifier%20%3Fpq_qual%20.%0A%20%20%20%20%3F2011_PopulationStatement%20pq%3AP585%20%222011-00-00T00%3A00%3A00Z%22%5E%5Exsd%3AdateTime%20%3B%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20ps%3AP1082%20%3F2011_population%20.%0A%20%20%7D%20%7D%0A%20%20%20%20%7D%20%7D%0A%20%20%20%20%7D%20%7D%0A%20%20%20%20%7D%20%7D%0A%20%20%0A%20BIND%28%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2010%2C%20%220-10%25%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2020%2C%20%2210-20%25%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2030%2C%20%2220-30%25%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2040%2C%20%2230-40%25%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2050%2C%20%2240-50%25%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2060%2C%20%2250-60%25%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2070%2C%20%2260-70%25%22%2C%20%20%20%0A%20%20%20%20%2270%25%2B%22%29%29%29%29%29%29%29%0A%20%20%20%20AS%20%3FWelshSpeakerPercentage%29.%20%0A%20%20%0A%20%20BIND%28%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2010%2C%20%22ffe6e6%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2020%2C%20%22ffb3b3%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2030%2C%20%22ff8080%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2040%2C%20%22ff4d4d%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2050%2C%20%22ff1a1a%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2060%2C%20%22e60000%22%2C%0A%20%20%20%20IF%28%3F2011_percent%20%3C%2070%2C%20%22b30000%22%2C%20%20%20%0A%20%20%20%20%22800000%22%29%29%29%29%29%29%29%0A%20%20%20%20AS%20%3Frgb%29.%0A%20%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%20%20%7D%0AORDER%20BY%20%3FWelshSpeakerPercentage Map with percentage of Welsh speakers in the regions of Wales] (source)
 * Publication dates of works used as usage example for Swedish Lexemes (source)
 * Ratios of male/female organizers and speakers at conferences (source)
 * World map of sign languages by country (source)
 * Number of punk bands from Indonesia by region (source)
 * [https://query.wikidata.org/#%23defaultView%3AMap%0ASELECT%20DISTINCT%20%3Fsite%20%3FsiteLabel%20%3Fplace%20%3FplaceLabel%20%3FcountryLabel%20%3Fcoords%20%3Fimage%20%3FunescoUrl%20%3Flayer%20WHERE%20%7B%7B%0ASELECT%20DISTINCT%20%3Fsite%20%3Fplace%20%3Fcountry%20%3Fcoords%20%3Fimage%20%3FunescoUrl%20%3Flayer%20WHERE%20%7B%7B%0ASELECT%20%3Fcountry%20WHERE%20%7B%7B%20%3Fcountry%20wdt%3AP463%20wd%3AQ7809%3Bwdt%3AP30%20wd%3AQ46%20.%20%7D%20UNION%20%7B%20%3Fcountry%20wdt%3AP361%20wd%3AQ1191549%20%7D%20UNION%20%7B%20%3Fcountry%20wdt%3AP31%20wd%3AQ15304003%20%7D%7D%7D%0A%7BSELECT%20DISTINCT%20%3Fsite%20%3Fprotection%20%3Fcountry%20%28group_concat%28DISTINCT%20%3FsiteType%29%20AS%20%3FsiteType%29%20WHERE%20%7B%0A%3Fsite%20p%3AP1435%20%3FprotectionStatement%20%3Bwdt%3AP17%20%3Fcountry%20.%0AFILTER%20NOT%20EXISTS%20%7B%20%3Fx%20wdt%3AP527%20%3Fsite%20.%20%3Fx%20wdt%3AP1435%20%3Fprotection%20%7D%0A%3FprotectionStatement%20ps%3AP1435%20%3Fprotection%20.%0AFILTER%20NOT%20EXISTS%20%7B%20%3FprotectionStatement%20pq%3AP582%20%3Fx%20%7D%0AVALUES%20%3Fprotection%20%7B%20wd%3AQ9259%20wd%3AQ17278671%20wd%3AQ52683530%20wd%3AQ16617071%20wd%3AQ52683527%7D%0AOPTIONAL%20%7B%3Fsite%20wdt%3AP2614%20%3Fcriteria%20.%0ABIND%20%28IF%20%28%28%3Fcriteria%20in%20%28wd%3AQ23038972%2C%20wd%3AQ23038976%2C%20wd%3AQ23038977%2C%20wd%3AQ23038978%2C%20wd%3AQ23038979%2C%20wd%3AQ23038980%29%29%2C%20%22Cultural%22%2C%20%22Natural%22%29%20AS%20%3FsiteType%29%0A%7D%7D%0AGROUP%20BY%20%3Fsite%20%3Fprotection%20%3Fcountry%0A%7D%0A%7B%0A%3Fsite%20p%3AP757%20%3FidStatement%20.%0A%3FidStatement%20ps%3AP757%20%3Fid%20.%0AFILTER%20NOT%20EXISTS%20%7B%20%3FidStatement%20pq%3AP582%20%3Fx%20%7D%0Awd%3AP757%20p%3AP1630%20%3FformatterUrlStatement%20.%0A%7D%20UNION%20%7B%0A%3Fsite%20p%3AP4171%20%3FidStatement%20.%0A%3FidStatement%20ps%3AP4171%20%3Fid%20.%0AFILTER%20NOT%20EXISTS%20%7B%20%3FidStatement%20pq%3AP582%20%3Fx%20%7D%0Awd%3AP4171%20p%3AP1630%20%3FformatterUrlStatement%20.%0A%7D%0A%3FformatterUrlStatement%20ps%3AP1630%20%3Fformatterurl%20%3B%0Apq%3AP407%20wd%3AQ1860%20.%0ABIND%28IRI%28REPLACE%28%3Fid%2C%20%27%5E%28.%2B%29%24%27%2C%20%3Fformatterurl%29%29%20AS%20%3FunescoUrl%29.%0A%7B%3Fsite%20wdt%3AP527%2B%20%3Fplace%20.%0A%3Fplace%20wdt%3AP17%20%3Fcountry%20%3B%0AOPTIONAL%20%7B%20%3Fplace%20wdt%3AP625%20%3Fcoords%20%7D%0AOPTIONAL%20%7B%20%3Fplace%20wdt%3AP18%20%3Fimage%20%7D%0A%7D%20UNION%20%7B%0AFILTER%20NOT%20EXISTS%20%7B%20%3Fsite%20wdt%3AP527%20%3Fx%20%7D%0AOPTIONAL%20%7B%20%3Fsite%20wdt%3AP625%20%3Fcoords%20.%20%7D%0AOPTIONAL%20%7B%20%3Fsite%20wdt%3AP18%20%3Fimage%20%7D%0A%7D%0ABIND%20%28IF%20%28%3Fprotection%20%3D%20wd%3AQ17278671%2C%20%22Tentative%20list%22%2C%20%22World%20heritage%22%29%20AS%20%3FsiteProtection%29%0ABIND%20%28IF%20%28%3FsiteType%20%21%3D%20%22%22%2C%20CONCAT%28%3FsiteProtection%2C%20%22%20-%20%22%2C%20IF%20%28%3FsiteType%20%3D%20%22Cultural%20Natural%22%20%7C%7C%20%3FsiteType%20%3D%20%22Natural%20Cultural%22%2C%20%22Mixed%22%2C%20%3FsiteType%29%29%2C%20%3FsiteProtection%29%20AS%20%3Flayer%29%0A%7D%0A%7D%0ASERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%7D%0AORDER%20BY%20DESC%28%3Flayer%29%20%3FsiteLabel%20%3FplaceLabel Map of World Heritage places in Europe] (source)
 * Map of parishes of New South Wales, Australia, colour-coded by county (source)
 * Newest database reports: Glasnevin Cemetery, Mount Jerome Cemetery


 * Development
 * Enabled JSONLD support for entity data (example)
 * Fixed JSON dump issue (T226601)
 * Readded qualifier hashes in JSON output that accidentally got lost (T227207)
 * Made sure changes on Entity Schema pages by autoconfirmed users are autopatrolled (T224495)
 * Looked into issues with partial blocking on Wikidata entities (T207893)
 * Adding MediaInfo as an entity type that can be used in the "allowed entity types"-constraint (T224362)
 * Investigating an issue with the tours on Wikidata (T223999)
 * Initial work for the edit dialog that is needed for editing of Wikidata's data from Wikipedia (Wikidata Bridge)

You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.


 * Monthly Tasks
 * Add labels, in your own language(s), for the new properties listed above.
 * Comment on property proposals: all open proposals
 * Suggested and open tasks!
 * Contribute to a Showcase item.
 * Help translate or proofread the interface and documentation pages, in your own language!
 * Help merge identical items across Wikimedia projects.
 * Help write the next summary!

Read the full report &middot; Unsubscribe &middot; Lea Lacroix (WMDE) 15:22, 8 July 2019 (UTC)