Have a personal or library account? Click to login
Structural Segmentation of Alap in Dhrupad Vocal Concerts Cover

Figures & Tables

tismir-3-1-64-g1.png
Figure 1

Back cover of a Dhrupad concert audio CD by Pt. Nirmalya Dey.

Table 1

Dhrupad alap sections with descriptions.

SectionMusical characteristics
Alap-properRhythm free, slow and elaborate development of raga notes and phrases. A wide melodic range is spanned with focus gradually shifting from middle octave tonic to lower and then the higher octave. The melodic glide noom and mohra phrase serve as boundary cues.
JodIntroduction of regular and slow pulsations via syllable rate. Melodic development and boundary cues similar to Alap-proper.
JhalaPulsation accelerates indicating climax. Syllable articulation more regular. The melodic range spanned is relatively narrow.
tismir-3-1-64-g2.png
Figure 2

Waveform and spectrogram of a 30 s excerpt around a Jod-Jhala boundary (labeled with a vertical dashed line) showing the melodic glide noom (with its first harmonic in the box).

tismir-3-1-64-g3.png
Figure 3

Distribution of (a) section durations and (b) mean tempi of Dhrupad alap sections.

tismir-3-1-64-g4.png
Figure 4

One instance (fold) of the 20-fold cross-validation process adopted for train-test data splitting.

tismir-3-1-64-g5.png
Figure 5

Bi-phasic filter impulse response, with the discrete samples superposed, as applied to generate (a) onset detection function from sub-band energy, and (b) derivative features from short-time energy and short-time spectral centroid.

tismir-3-1-64-g6.png
Figure 6

Rhythmogram of the UB_AhirBhrv alap (dashed lines indicate labeled boundaries).

tismir-3-1-64-g7.png
Figure 7

Analysis of UB_AhirBhrv alap containing 4 sections: alap-proper, jod, jhala and jalad-jhala (a) Tempo, (b) Salience or pulse clarity, (c) Posteriors of rhythm, (d) Short-time energy difference, (e) Short-time centroid difference, and (f) MFCC C-1 coefficient. Dashed lines indicate manually labeled section boundaries.

tismir-3-1-64-g8.png
Figure 8

Mel-spectrogram of UB_AhirBhrv alap (dashed lines indicate labeled boundaries).

tismir-3-1-64-g9.png
Figure 9

Block diagram for the extraction of acoustic features.

Table 2

A key to the naming of feature subsets.

Feature subset nameFeatures
RhythmPosteriors of (tempo, salience)
MFCCFirst 13 MFCCs
TimbreMFCC
Short-time energy difference
Short-time centroid difference
AllRhythm
Timbre
tismir-3-1-64-g10.png
Figure 10

SDM for UB_AhirBhrv alap with (a) ACF, (b) rhythm features, (c) posterior features, and (d) MFCC.

tismir-3-1-64-g11.png
Figure 11

Novelty function for UB_AhirBhrv alap with (a) ACF, (b) rhythm features, (c) posterior features, and (d) MFCC. Dashed lines indicate manual boundaries.

Table 3

Performance of the unsupervised approach using different feature subsets with specified averaging window durations.

Feature subsetParametersPerformance
Window(s)PrecisionRecallF-score
Rhythm200.400.570.47
MFCC30.590.570.58
Timbre30.610.570.59
All20 or 30.720.660.69
Table 4

Performance of RF classifier using different feature subsets with an averaging window of 20 s, for values of context (C) and #trees giving the best F-scores. In parentheses are results without training data augmentation.

Feature subsetParametersPerformance
C (±s)# treesPrecisionRecallF-score
Rhythm50300.170.260.21
MFCC50100.850.740.79
Timbre20500.860.810.83
All201000.90
(0.89)
0.81
(0.75)
0.85
(0.81)
Table 5

Performance of CNN classifier with averaging window of 3 s with different context durations C.

ParametersPerformance
C (±s)PrecisionRecallF-score
200.690.770.73
500.920.810.86
Table 6

Performance comparison of different feature combinations and methods.

Segmentation approachPrecisionRecallF-score
Without rhythm features
RF0.860.810.83
CNN0.920.810.86
Unsupervised0.610.570.59
With rhythm features
RF0.900.810.85
Unsupervised0.720.660.69
Table 7

Segmentation performance of the different methods on test concerts.

Test alapUnsupervisedRF ClassifierCNN
PrecisionRecallF-scorePrecisionRecallF-scorePrecisionRecallF-score
PN_Jog0.500.500.500.900.900.900.501.00.67
PN_Maru0.500.500.500.470.700.56000
Table 8

The Dhrupad alap dataset used in this work (see Github repository mentioned in Section 7 for details).

Sl.#AlapArtistRagaDur (min)#Sections
1GB_AhirBhrvGundecha BrothersAhir Bhairav49:474
2GB_BhgGundecha BrothersBihag21:233
3GB_BhimGundecha BrothersBhimpalasi17:223
4GB_BhrvGundecha BrothersBhairav53:116
5GB_BKTGundecha BrothersBilaskhani Todi43:004
6GB_KRAGundecha BrothersKomal Rishabh Asavari36:304
7GB_MarGundecha BrothersMarwa48:345
8GB_MMalGundecha BrothersMiya Malhar45:425
9GB_YamGundecha BrothersYaman46:324
10RS_BindRitwik SanyalBindeshwari19:574
11RS_ShrRitwik SanyalShree26:903
12Sul_Man_YamSulabha – Manoj SarafYaman21:463
13UB_AhirBhrvUday BhawalkarAhir Bhairav48:004
14UB_BhgUday BhawalkarBihag51:103
15UB_BhrvUday BhawalkarBhairav50:223
16UB_JogUday BhawalkarJog25:463
17UB_MalkUday BhawalkarMalkauns61:163
18UB_MaruUday BhawalkarMaru35:353
19UB_ShrUday BhawalkarShree19:453
20WD_BhgWasifuddin DagarBihag40:223
DOI: https://doi.org/10.5334/tismir.64 | Journal eISSN: 2514-3298
Language: English
Submitted on: May 5, 2020
Accepted on: Jul 7, 2020
Published on: Sep 16, 2020
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2020 Preeti Rao, Thallam Prasad Vinutha, Mattur Ananthanarayana Rohit, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.