Aislyn Rose

Home

Resources for publicly available machine learning datasets, including speech

Published Jan 06, 2019

Purpose:

To collect resources of freely available datasets for machine learning (with emphasis on speech). Links are working as of January 2019.

Healthy Speech

Clinical Speech

Emotion in Speech

  • German: EmoDB

  • English: The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)

Speech Commands

General Sounds / Noise / Music

General Datasets

  • Github repo for machine learning datasets: Observations

  • Github repo comparing ConvNet and RNNs on time sequence data: locuslab

  • An amazing list of opensource datasets

Text

  • Sentiment Analysis Datasets.

  • Text Comprehension: SQuAD (The Stanford Question Answer Dataset)

Images

  • Object class recogntion: PASCAL VOC. One needs to register but should be able to download for free

  • Object class recogntion: COCO (common objects in context)

  • IMAGENET: “an image database organized according to the WordNet hierarchy”.

Pre-Trained Models