User:Datakeeper/valuabledatasets

PAGE TITLE: List of datasets for machine learning research.

This is a list of noteworthy datasets for machine learning research. This list is not exhaustive, and is limited to noteworthy, high-quality datasets.

Datasets are an integral part of the field of machine learning. Major advances in this field can result from advances in learning algorithms (such as deep learning), computer hardware, and, less-intuitively, the availability of high-quality training datasets. High-quality labeled training datasets for supervised and semi-supervised machine learning algorithms are usually difficult and expensive to produce because of the large amount of time needed to label the data. Although they do not need to be labeled, high-quality datasets for unsupervised learning can also be difficult and costly to produce.