User:Paradoxsociety/Projects/Wikiproject proposal: Data management

As of June 2020 I am actively researching the best way to start a Wikiproject about Data Management, or potentially do such work underneath the umbrella of an existing Wikiproject if a suitable one exists.

UPDATE - September 2021 - The only similar Wikiproject I have found thus far is Wikiproject Databases which is only semi-active and based on the content / commentary there, I think there is only a partial overlap between that project and my intentions for this one. I would like the scope for the Data Management Wikiproject to only cover the history, technology, and theory of data management. As this is a discipline that is only really recently beginning to mature, I think this focused scope should help attract more interest from recently active editors as well.

UPDATE - July 2022 - I have begun drafting the formal proposal a bit further down this page. I will be reorganizing this page in the coming weeks to clean up the proposal.

Articles that should be in scope

 * Data masking
 * Materialized view
 * Looker (company)
 * Snowflake Inc. popular company used for modern data management
 * Sixth normal form
 * Data definition language
 * Data warehouse
 * Codd's 12 rules
 * Data anonymization
 * Operational data store
 * SQL
 * NoSQL
 * Tableau Software
 * Domo (company)
 * Data cube
 * Dimension (data warehouse)
 * Measure (data warehouse)
 * Slowly changing dimension
 * Fact table
 * Aggregate (data warehouse)
 * Ralph Kimball
 * Bill Inmon
 * Malloy (query language)
 * LookML
 * Data build tool
 * Universally unique identifier
 * Data engineering

draft of formal proposal below
The content below is from the current subst template (as of 2022-07-19) for WikiProject Proposals and will eventually be moved to WP space when I'm ready to present the proposal to the community.

Description
This is a proposal for a new "data management" WikiProject to reflect modern organizational data practices, encompassing big data, data science, data management, business intelligence, and related fields. Paradox society  21:09, 19 July 2022 (UTC)

List of important pages and categories for this proposed group
 * category name (number of pages in the category: )
 * category name (number of pages in the category: )
 * category name (number of pages in the category: )
 * category name (number of pages in the category: )
 * category name (number of pages in the category: )
 * category name (number of pages in the category: )
 * category name (number of pages in the category: )
 * category name (number of pages in the category: )
 * category name (number of pages in the category: )


 * List of WikiProjects currently on the talk pages of those articles
 * Please invite these and any other similar groups to join the discussion about this proposal. See WikiProject_Council/Directory to find similar WikiProjects.




 * Why do you want to start a new group, instead of joining one of these existing groups?
 * Data management is my current profession and I have found the content and organization lacking on Wikipedia when I am trying to learn about certain concepts. As a practitioner the current set of Wikipedia articles is missing entire articles that should exist, and existing articles have outdated descriptions of concepts that do not reflect the modern practice of data management.
 * WikiProject Databases is the closest thing I've found but it is an inactive project and its scope is too broad / general for my interests. I would look into reviving it, but it seems like a fresh start would be the least cumbersome approach. Anyone who was involved with that project will be invited to join this one.

Support
Also, specify whether or not you would join the project.
 * 1) Paradox  society  21:09, 19 July 2022 (UTC)

Discussion

 * Related old proposal: WikiProject Council/Proposals/Data Science - terminology is something I want to help clarify across Wikipedia. Data management teams often support data science / analytics teams, and so there are a lot of closely related concepts. Scope could perhaps be broadened to encompass data science or perhaps Analytical Data Management. Will come back to this