Have a personal or library account? Click to login
Dagstuhl ChoirSet: A Multitrack Dataset for MIR Research on Choral Singing Cover

Dagstuhl ChoirSet: A Multitrack Dataset for MIR Research on Choral Singing

Open Access
|Jul 2020

Figures & Tables

tismir-3-1-48-g1.png
Figure 1

Dagstuhl ChoirSet—an overview.

Table 1

Comparison of polyphonic singing datasets described in Section 2. The reported durations refer to the total recording duration (not counting multiple tracks per recording if available).

Name/AuthorMultitrackAnnotationsPublicly Available# RecordingsDuration (hh:mm:ss)
Su et al. (2016)NoMIDIOn Request5 excerpts00:02:11
Barbershop Quartets10YesMIDINo22 songs00:42:10
Bach Chorales11YesMIDINo26 songs00:58:20
Scherbaum et al. (2019)YesOn Request216 songs06:04:40
Erkomaishvili DatasetNoStructure, F0, Score, OnsetsYes101 songs07:05:00
(Rosenzweig et al. 2020)
Choral Singing Dataset (CSD)YesMIDI, F0, NotesYes3 songs00:07:14
(Cuesta et al., 2018)
Dagstuhl ChoirSet (DCS)YesMIDI, F0, BeatsYes2 songs, exercises00:55:30
tismir-3-1-48-g2.png
Figure 2

Anton Bruckner, Locus Iste WAB 23 (measures 1 to 11). The score was obtained from CPDL and edited by Brian Marble.13

Table 2

Overview of the audio recordings in DCS. The third column indicates the number of takes available for each piece and the last column refers to the total duration of all takes together.

PieceSetting# TakesDuration (mm:ss)
Locus IsteFull Choir307:22
Quartet A716:26
Quartet B614:02
Tebe PoemFull Choir505:27
Quartet A202:30
ExercisesFull Choir3306:00
Quartet A2503:43
Total8155:30
tismir-3-1-48-g3.png
Figure 3

Microphone setup for one singer.

tismir-3-1-48-g4.png
Figure 4

Comparison of LRX and DYN signals from a tenor singer. Excerpts correspond to the marked Locus Iste passage in Figure 2. (a) Magnitude spectrograms. CREPE F0-trajectories are plotted on top in the respective colors. (b) Smoothed CREPE confidence. (c) Binarized trajectory activations obtained by thresholding smoothed confidence (LRX threshold: 0.935, DYN threshold: 0.9).

tismir-3-1-48-g5.png
Figure 5

Screenshot (detail) of digital audio workstation (Logic Pro X) with multiple tracks.

Table 3

DCS dimensions.

DimensionShortcutMeaning
SongLILocus Iste
TPTebe Poem
SESystematic Exercises
SettingFullChoirFull Choir Setting
QuartetAQuartet A Setting
QuartetBQuartet B Setting
TakeTakeTake Number
VoiceSSoprano
AAlto
TTenor
BBass
StereoStereo Mic
StereoReverbStereo Mic Reverb
MicrophoneLRXLarynx Mic
DYNDynamic Mic
HSMHeadset Mic
STRStereo Mic R
STLStereo Mic L
STMStereo Mic L+R
Table 4

Evaluation results for pYIN trajectories averaged over two quartet recordings.

MicVRVFARPARCAOA
LRX0.99 (0.00)0.11 (0.06)0.95 (0.02)0.95 (0.01)0.93 (0.03)
HSM0.98 (0.01)0.33 (0.09)0.81 (0.10)0.91 (0.04)0.77 (0.08)
DYN0.99 (0.00)0.16 (0.11)0.93 (0.04)0.95 (0.01)0.90 (0.05)
Table 5

Evaluation results for CREPE trajectories averaged over two quartet recordings.

MicVRVFARPARCAOA
LRX0.96 (0.01)0.12 (0.02)0.96 (0.01)0.96 (0.01)0.93 (0.02)
HSM0.92 (0.02)0.32 (0.08)0.91 (0.01)0.91 (0.02)0.84 (0.02)
DYN0.93 (0.01)0.18 (0.07)0.93 (0.01)0.93 (0.01)0.90 (0.02)
tismir-3-1-48-g6.png
Figure 6

Averaged intonation cost (IC) measures for six takes of Locus Iste by Quartet A and five takes by Quartet B. The local standard deviations are indicated in light grey.

tismir-3-1-48-g7.png
Figure 7

Multiple-F0-estimation using DeepSalience (Bittner et al., 2017) with a threshold of 0.1. (a) Estimation results (excerpts) for the mix of DYN signals and the STM signal with reverb. (b) Evaluation metrics for all scenarios.

DOI: https://doi.org/10.5334/tismir.48 | Journal eISSN: 2514-3298
Language: English
Submitted on: Feb 14, 2020
Accepted on: Jun 10, 2020
Published on: Jul 29, 2020
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2020 Sebastian Rosenzweig, Helena Cuesta, Christof Weiß, Frank Scherbaum, Emilia Gómez, Meinard Müller, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.