Draft:Data science education

Data science is a practical and research domain focuses on extracting value and knowledge from data. Data science education is, accordingly, a domain that focuses on data science curricula and pedagogy or, in other words, on answering the questions “What should be taught teach in data science programs?” and “What is a suitable pedagogy for teaching data science?”. . Data science education is gaining growing importance as the need for data scientists increases and the need for data literacy expands to more and more populations, especially today when generative-AI is becoming more and more pervasive.

Data science is a new discipline in the collection of STEM disciplines. While gender imbalances are a well-known challenge in STEM disciplines, and specifically in computer science, data science may promote gender balance in STEM disciplines due to its interdisciplinary nature.

Main research topics in data science education
A survey of 1048 papers regarding data science education revealed that the main research topics of data science education can be categorized into (a) data science curricula, (b) data science pedagogy, (c) STEM skills, (d) domain adaptation, and (e) social aspects of data science aspects

The data science curriculum category refers to the question “What should be taught in data science programs”. Research focuses on principles of data science curriculum design, approaches to data science education, the Introduction to Data Science course, data science programs for data science majors, and data science for K-12 pupils.

The data science pedagogy category refers to the efficient methods of teaching data science. The research focuses on teaching AI and machine learning, general teaching methods for data science, online teaching, and tools and methods for data science education.

The STEM skills category refers to the integration and interaction between statistics, computer science, and data science. Both the statistical and computer science skills that are required within the context of data science, as well as the teaching of data science within the context of statistics and computer science, are discussed.

The domain adaptation category refers to the challenges of teaching data science within the context of specific domain knowledge, such as in business, health, digital technologies, biomedicine, education and more.

The social aspects of data science category is attracting growing attention and includes the topics and methods of teaching ethics, teaching data science as a skill, enhancing student engagement, and enhancing diversity in data science