Have a personal or library account? Click to login
Audio-Based Music Structure Analysis: Current Trends, Open Challenges, and Applications Cover

Audio-Based Music Structure Analysis: Current Trends, Open Challenges, and Applications

Open Access
|Dec 2020

Figures & Tables

tismir-3-1-54-g1.png
Figure 1

Example of a flat structure annotation (track 10 from SALAMI). The left side displays the full track; a zoomed-in version of a segment boundary (marked with a dashed light blue rectangle on the left) is shown on the right. On top, log-mel power spectrograms of the audio signal are displayed, while at the bottom the annotations are plotted.

tismir-3-1-54-g2.png
Figure 2

SSM prototype of track 10 from SALAMI. Blocks contain homogeneous segments, diagonals represent repetitions (except the main one), and dashed-lines depict the reference annotation.

tismir-3-1-54-g3.png
Figure 3

Self similarity matrix (left) and its associated novelty curve (right) of track 10 from SALAMI. Brighter colors in the SSM indicate a greater degree of similarity. Dashed lines mark segment boundaries identified by annotator 5.

tismir-3-1-54-g4.png
Figure 4

Example of a hierarchical structure annotation from annotator 4 of track 10 in SALAMI. The functional level is plotted on top. In the middle, the coarse level is shown, with notable differences from those segmentations plotted in Figure 1 due to annotators disagreements. In the bottom, the fine level is displayed.

Table 1

Best performing evaluation metrics (percentages) for the MSA task in MIREX for the years 2012 to 2017. *: Smaller subset of SALAMI; †: 2015 submission by Grill and Schlüter (2015a); ‡: 2012 submission by Serrà et al. (2014); §: 2014 submission by Ullrich et al. (2014).

Dataset F1(PH,RH)0.5 F1(PH,RH)3 F1(PP,RP)
MIREX 200956.42 ± 17.04†70.35 ± 14.87†65.28 ± 15.11‡
MIREX 2010 (1)69.70 ± 13.59†79.34 ± 9.43†
MIREX 2010 (2)52.37 ± 17.54†73.80 ± 11.68§68.83 ± 11.91‡
SALAMI*54.09 ± 18.50†68.94 ± 17.51§58.09 ± 15.77‡
DOI: https://doi.org/10.5334/tismir.54 | Journal eISSN: 2514-3298
Language: English
Submitted on: Feb 29, 2020
Accepted on: Oct 6, 2020
Published on: Dec 11, 2020
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2020 Oriol Nieto, Gautham J. Mysore, Cheng-i Wang, Jordan B. L. Smith, Jan Schlüter, Thomas Grill, Brian McFee, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.