User:Wikiqrdl/sandbox

I'm investigating category views on wikipedia.

Motivating story: Question - Imagine that I am a STEM educator and wanted to get a complete list of all on-topic categories, subcats, and page ids for just STEM articles on wikipedia. How would I do that?

categorylinks table is a non starter, as it's too over categorized. Eg: botany->plants->coats of arms with plants.

I've proposed this in several different places: https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(policy)#Category_views - Response check out portals, wikidata

https://www.wikidata.org/wiki/Wikidata:Project_chat#Is_there_a_way_to_get_all_STEM_pages_on_wikipedia? - Response - checkout wikiprojects

https://en.wikipedia.org/wiki/Wikipedia_talk:WikiProject_Council#Get_a_complete_list_of_STEM_categories,_their_subcategories,_and_page_ids - Response - checkout wikipedia categories :)

https://en.wikipedia.org/wiki/Wikipedia_talk:WikiProject_Portals#Get_a_complete_list_of_STEM_categories,_their_subcategories,_and_page_ids - response, none so far

https://forum.dbpedia.org/t/category-views-and-overcategorization-on-wikipedia/2448 - response none so far

https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(technical)#Is_there_a_way_to_get_all_STEM_pages_on_wikipedia? - response none so far

There's actually a lot of research on this - https://scholar.google.com/scholar?cites=5172723283304070766&as_sdt=2005&sciodt=0,5&hl=en but it's largely around automation. As an engineer, I'm a huge fan of automation, but it should all sprout from human knowledge, and categorization is the root of human knowledge.

https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(idea_lab)#Portals_should_be_strongly_encouraged_to_organize_around_sparql - Proposed this idea, bringing these efforts together seems like it will firm up the knowledge graph on wp

Todo - Will keep following up on suggestions. Ideally, this effort is already started and I can contribute to that effort.