Have a personal or library account? Click to login
Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes Cover

Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes

Open Access
|Jun 2020

Abstract

The DALI dataset is a large dataset of time-aligned symbolic vocal melody notations (notes) and lyrics at four levels of granularity. DALI contains 5358 songs in its first version and 7756 for the second one. In this article, we present the dataset, explain the developed tools to work the data and detail the approach used to build it. Our method is motivated by active learning and the teacher-student paradigm. We establish a loop whereby dataset creation and model learning interact, benefiting each other. We progressively improve our model using the collected data. At the same time, we correct and enhance the collected data every time we update the model. This process creates an improved DALI dataset after each iteration. Finally, we outline the errors still present in the dataset and propose solutions to global issues. We believe that DALI can encourage other researchers to explore the interaction between model learning and dataset creation, rather than regarding them as independent tasks.
DOI: https://doi.org/10.5334/tismir.30 | Journal eISSN: 2514-3298
Language: English
Submitted on: Jan 24, 2019
Accepted on: Apr 9, 2020
Published on: Jun 11, 2020
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2020 Gabriel Meseguer-Brocal, Alice Cohen-Hadria, Geoffroy Peeters, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.