Speech/Music classification of audio files using machine learning techniques.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
Frank Blanning ecad9a1d03 added download script 6 years ago
..
.gitignore Add dataset 6 years ago
README.md Add dataset 6 years ago
downloadDataSet.sh added download script 6 years ago

README.md

Dataset

This dataset was downloaded from Marsyas website. It is the famous GTZAN dataset. A direct download link is this.

From Marsyas' website:

A dataset which was collected for the purposes of music/speech discrimination. The dataset consists of 120 tracks, each 30 seconds long. Each class (music/speech) has 60 examples. The tracks are all 22050Hz Mono 16-bit audio files in .wav format.