Have a personal or library account? Click to login
Improving Audio Chord Estimation by Alignment and Integration of Crowd-Sourced Symbolic Music Cover

Improving Audio Chord Estimation by Alignment and Integration of Crowd-Sourced Symbolic Music

Open Access
|Nov 2021

Abstract

Automatic Chord Estimation (ACE) is a fundamental task in Music Information Retrieval (MIR) and has applications in both music performance and MIR research. The task consists of segmenting a music recording or score and assigning a chord label to each segment. Although it has been a task in the annual benchmarking evaluation MIREX for over 10 years, ACE is not yet a solved problem, since performance has stagnated and modern systems have started to tune themselves to subjective training data. We propose DECIBEL, a new ACE system that exploits heterogeneous musical representations, specifically MIDI and tab files, to improve audio-based ACE methods. From an audio file and a set of MIDI and tab files corresponding to the same popular music song, DECIBEL first estimates chord sequences. For audio, state-of-the-art audio ACE methods are used. MIDI files are aligned to the audio, followed by a MIDI chord estimation step. Tab files are transformed into untimed chord sequences and then aligned to the audio. Next, DECIBEL uses data fusion to integrate all estimated chord sequences into one final output sequence. DECIBEL improves all tested state-of-the-art ACE methods by 0.5 to 13.6 percentage points. This result shows that the integration of crowd-sourced annotations from heterogeneous symbolic music representations using data fusion is a suitable strategy for addressing challenging MIR tasks such as ACE.

DOI: https://doi.org/10.5334/tismir.81 | Journal eISSN: 2514-3298
Language: English
Submitted on: Nov 3, 2020
Accepted on: Jul 5, 2021
Published on: Nov 9, 2021
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2021 Daphne Odekerken, Hendrik Vincent Koops, Anja Volk, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.