Have a personal or library account? Click to login
Unveiling the Hierarchical Structure of Music by Multi-Resolution Community Detection Cover

Unveiling the Hierarchical Structure of Music by Multi-Resolution Community Detection

Open Access
|Jun 2020

Figures & Tables

tismir-3-1-41-g1.png
Figure 1

Schematic overview of MSCOM with all the main steps of its workflow.

tismir-3-1-41-g2.png
Figure 2

The main steps detailed in Section 3.1 for the creation of the music graph for the track “SALAMI 676”. The recurrence graph R computed on the chroma features and its smoothed version R′ to enhance diagonal stripes are illustrated in the top quadrants. The bottom-left plot represents the proximity graph Δ with a zoomed area highlighting its upper and lower off-diagonals that ensure the linkage of nodes corresponding to temporally consecutive feature vectors. The graph Gµ in the last quadrant is a weighted sum of R′ and Δ as outlined in Equation 4.

Algorithm 1

Hierarchical community detection

Given the N × N adjacency matrix W of a graph
Given Δr, a fixed step increment for r
Let W[S] be the square sub-matrix obtained by selecting the rows and columns of W with index in S
1: l ←1
2: r2wN
3: WW + rI
4: C{C1={1,2,,N}}▷ all node indices in C1
5: While |Cl|< N do
6:         ll+1▷current level
7:         Cl ← {}
8:         for  Cj1  in  C1if|Cj1|>0do
9:             PCj{Cj,1,Cj,2,,Cj,m}  =
            optimal partition of W[Cj1]
10:           CCPCj
11:       end for
12:       rr + Δr
13:       WW + rI
14: end while
tismir-3-1-41-g3.png
Figure 3

Hierachical expansion of the first human annotation of SALAMI 1094. The two segmentation levels denoted with the upper and lower tags define the original hierarchy, whereas the coarse and the refined levels are obtained by contracting the upper level and refining the lower level respectively.

tismir-3-1-41-g4.png
Figure 4

Analysis of monotonicity in LSD’s hierarchical segmentations. Left: distribution of monotonicity for each couple of successive levels in the hierarchies estimated by LSD. Right: distribution of the level (or depth) in LSD’s hierarchies at which maximum monotonicity is no longer preserved.

Table 1

Overview of the segmentation performance - mean and standard deviation of the L-measures - of each algorithm under analysis with respect to the first reference annotation provided for each track in the SALAMI dataset. The evaluation is performed for both the original (left) and the extended (right) reference hierarchies.

Original reference hierarchiesExtended reference hierarchies
L-measureL-precisionL-recallL-measureL-precisionL-recall
LSD0.462 ± 0.1280.394 ± 0.1200.584 ± 0.1500.480 ± 0.1230.420 ± 0.1200.577 ± 0.143
LSDM0.301 ± 0.1790.377 ± 0.1580.289 ± 0.2050.309 ± 0.1790.402 ± 0.1580.282 ± 0.194
OLDA0.398 ± 0.1010.325 ± 0.0980.536 ± 0.1110.415 ± 0.0970.348 ± 0.0980.531 ± 0.104
MSCOM0.460 ± 0.1120.382 ± 0.1020.600 ± 0.1350.478 ± 0.1050.408 ± 0.0980.593 ± 0.129
DMSCOM0.480 ± 0.1110.403 ± 0.1030.611 ± 0.1330.500 ± 0.1040.430 ± 0.1000.607 ± 0.127
Table 2

Summary of the Kolmogorov-Smirnov statistical tests used to detect statistically significant differences between the algorithms’ performance on each evaluation metric. For each measure, ‘O’ denotes the evaluation performed on the original reference hierarchies, whereas ‘E’ refers to the extended counterpart. ns: not significant, p > 0.05; * p ≤ 0.05; ** p ≤ 0.01; *** p ≤ 0.001.

L-measureL-precisionL-recall
OEOEOE
MSCOM
LSDnsnsns*nsns
LSDM******************
OLDA******************
DMSCOM
LSD***************
LSDM******************
OLDA******************
MSCOM**********nsns
tismir-3-1-41-g5.png
Figure 5

Segmentation performance degradation, in terms of the L measures outlined in Section 4.2, as function of track duration. The first row reports the trend for the evaluation of OLDA, whereas the second one is related to DMSCOM. A regression line is plotted along with the data to facilitate the comparison of the graphs.

Table 3

L-measures for the quantification of inter-annotator agreement: an upper limit for the segmentation performance of the automatic methods.

Original hierarchiesExtended hierarchies
L-measure0.640 ± 0.1980.678 ± 0.168
L-precision0.641 ± 0.1970.683 ± 0.175
L-recall0.662 ± 0.20.694 ± 0.174
DOI: https://doi.org/10.5334/tismir.41 | Journal eISSN: 2514-3298
Language: English
Submitted on: Sep 6, 2019
Accepted on: May 12, 2020
Published on: Jun 24, 2020
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2020 Jacopo de Berardinis, Michail Vamvakaris, Angelo Cangelosi, Eduardo Coutinho, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.