Downloads the Free Spoken Digits dataset and loads its metadata as
pandas dataframes. The audio samples are as
import dataget df = dataget.audio.free_spoken_digit().get()
dfdataframe has the
audio_pathcolumn which contains the relative path of each
sample. You can easily load them using scipy.io.wavfile.read.
Its recommended that you split train / test based on
user instead of randomly to avoid testing based on similar samples found in training.
||Relative path of the audio file|
||Target label in the range
||Name of the speaker|
||Repetition number for each (user, label) pair, i.e. each user repeats each digit multiple times|
- Folder name:
- Size on disk: