Have a personal or library account? Click to login
BPSD: A Coherent Multi-Version Dataset for Analyzing the First Movements of Beethoven’s Piano Sonatas Cover

BPSD: A Coherent Multi-Version Dataset for Analyzing the First Movements of Beethoven’s Piano Sonatas

Open Access
|Sep 2024

Figures & Tables

tismir-7-1-196-g1.png
Figure 1

Schematic overview of the BPSD. For the first movements of all 32 piano sonatas, the dataset comprises raw data in different representations (versions) such as score images, symbolic score representations, and audio recordings of different performances. For the different versions, we provide time-aligned annotations of measure positions, beats, global and local keys, chords, and structural elements. The score and the four audio versions indicated in dark blue are in the public domain (EU).

Table 1

Overview of the folder structure of the BPSD. Score-based folders contain files named in the format Beethoven_workID.ext, while audio-based folders contain files in the format Beethoven_workID_performerID.ext.

Folder nameContent
- 0_RawDataRaw audio and symbolic data
| - audio_rippedAudio files as ripped from the CD
| - WK64
| ...
| - FG67
| - score_pdf_scanScanned score from IMSLP
| - score_pdf_repetitionsSymbolic score in PDF format with repeat signs
| - score_pdf_unfoldedSymbolic score in PDF format with unfolded repetitions
| - score_sibelius_repetitionsSymbolic score in Sibelius format with repeat signs
| - score_sibelius_unfoldedSymbolic score in Sibelius format with unfolded repetitions
| - score_xml_repetitionsSymbolic score in MusicXML format with repeat signs
| - score_xml_unfoldedSymbolic score in MusicXML format with unfolded repetitions
| - score_midiMIDI export of the symbolic score
- 1_AudioAudio files with coherent structure
- 2_AnnotationsAnnotations with musical and physical timelines
| - ann_score_noteNote events with start and end given in musical time
| - ann_score_chordHarmony annotations given in musical time
| - ann_score_localkeyLocal key annotations given in musical time
| - ann_score_globalkeyGlobal key annotations
| - ann_score_structureFineFine structure annotations given in musical time
| - ann_score_structureCoarseCoarse structure annotations given in musical time
| - ann_audio_noteNote events with start and end given in physical time
| - ann_audio_midiNote events in physical time in MIDI format
| - ann_audio_beatBeat annotations given in physical time
| - ann_audio_measureMeasure annotations given in physical time
| - ann_audio_startEndStart and end of audio recordings (for removing silence/applause) given in physical time
| - ann_audio_syncInfoAlignment tuples for converting between musical and physical timeline
| - ann_audio_modificationsAnnotations for structural modifications of recordings
| - ann_audio_chordHarmony annotations given in physical time
| - ann_audio_localkeyLocal key annotations given in physical time
| - ann_audio_structureFineFine structure annotations given in physical time
| - ann_audio_structureCoarseCoarse structure annotations given in physical time
- 3_ScriptsPython scripts to convert raw data into the structured format
Table 2

Overview of audio recordings in the BPSD. The upper four performances with identifiers WK64, FG58, FJ62, and AS35 are in the public domain and freely accessible within the BPSD. All remaining recordings are commercially available and can be identified using the EAN code. Durations are presented in the format hh:mm:ss.

IDPerformerYearLabelEAN CodeOrig. Dur.Final Dur.
WK64Wilhelm Kempff1964Deutsche Grammophon02894479662903:18:2603:45:31
FJ62Fritz Jank1962Instituto Piano Brasileiroavailable at IMSLP03:35:1303:41:26
FG58Friedrich Gulda1958Decca02894851451903:34:0003:34:00
AS35Artur Schnabel1935Warner Classics019029597505003:31:0303:33:35
MC22Muriel Chemin2022Odradek85531700361504:08:2204:05:11
MB97Malcolm Bilson et al.1997Claves761993197072103:52:2303:46:08
AB96Alfred Brendel1996Philips02894125752903:54:3403:52:28
JJ90Jeno Jando1990NAXOS73009915022403:41:0603:39:14
DB84Daniel Barenboim1984Deutsche Grammophon028941375926, 02894137662603:58:3703:58:37
VA81Vladimir Ashkenazy1981London Records02894437062103:48:1603:46:27
FG67Friedrich Gulda1967Amadeo02894768761003:25:0203:25:02
Total40:47:0841:07:45
Table 3

Overview of the first movements of Beethoven’s 32 Piano Sonatas. The table displays information including the work ID, trivial name (if applicable), global key, mean, minimum, and maximum duration of available recordings (see Table 2), number of measures, and the coarse structure. All durations are presented in the format mm:ss.

No.Work IDNameKeyMean Dur.Min. Dur.Max. DurMeas.Structure
01Op002No1-01F:min03:4703:22 (AS35)04:33 (WK64)200E-E-D-R
02Op002No2-01A:maj07:0406:23 (FG67)07:45 (MC22)452E-E-D-R
03Op002No3-01C:maj10:1509:47 (FG58)11:25 (MC22)347E-E-D-R-C
04Op007-01Grand SonataEb:maj08:1707:27 (AS35)08:58 (MC22)497E-E-D-R-C
05Op010No1-01C:min05:3304:41 (AS35)06:13 (MC22)388E-E-D-R
06Op010No2-01F:maj05:3805:03 (FG67)06:14 (VA81)268E-E-D-R
07Op010No3-01D:maj06:5906:26 (FJ62)07:53 (JJ90)467E-E-D-R-C
08Op013-01PathétiqueC:min08:5608:06 (FG58)09:57 (MC22)431I-E-E-D-R-C
09Op014No1-01E:maj06:3505:31 (VA81)07:25 (AB96)222E-E-D-R-C
10Op014No2-01G:maj07:0605:49 (AS35)07:56 (AB96)263E-E-D-R-C
11Op022-01Bb:maj07:2606:43 (AS35)08:36 (MC22)267E-E-D-R
12Op026-01Funeral MarchAb:maj08:0106:51 (FG67)10:02 (AS35)219T-V1-V2-V3-V4-V5
13Op027No1-01Son. q. u. fant.Eb:maj05:1204:36 (AB96)05:42 (FG58)106An-Al-T1
14Op027No2-01MoonlightC#:min06:0104:58 (AS35)07:28 (FG58)69P1-P2-P3-C
15Op028-01PastoralD:maj09:5808:58 (FJ62)11:39 (MC22)622E-E-D-R-C
16Op031No1-01G:maj06:2305:44 (FG58)07:19 (MC22)435E-E-D-R-C
17Op031No2-01TempestD:min08:2706:49 (FG58)09:52 (MC22)320E-E-D-R-C
18Op031No3-01The HuntEb:maj08:2907:53 (FG67)09:07 (MB97)341E-E-D-R-C
19Op049No1-01Easy SonataG:min04:3503:41 (JJ90)05:17 (MB97)143E-E-D-R-C
20Op049No2-01Easy SonataG:maj04:3704:19 (FJ62)05:10 (MC22)174E-E-D-R
21Op053-01WaldsteinC:maj10:3809:25 (FG67)11:36 (MC22)387E-E-D-R-C
22Op054-01F:maj05:3804:58 (AS35)06:13 (MC22)154M1-Tr1-M2-Tr2-M3-C
23Op057-01AppassionataF:min09:3507:35 (FG67)10:39 (DB84)262E-D-R-C
24Op078-01A ThérèseF#:maj07:0406:20 (FG58)08:18 (MC22)206I-E-E-D-R-D-R
25Op079-01CuckooG:maj04:4003:58 (AS35)05:12 (MC22)372E-E-D-R-D-R-C
26Op081a-01Les adieuxEb:maj07:0406:00 (FG67)07:50 (DB84)308I-E-E-D-R-C
27Op090-01E:min05:3504:34 (FG67)06:19 (MB97)245E-D-R-C
28Op101-01A:maj04:0003:35 (WK64)04:29 (DB84)102E-D-R-C
29Op106-01Hammer-klavierBb:maj11:0608:54 (AS35)13:04 (DB84)530E-E-D-R-C
30Op109-01E:maj03:4603:14 (WK64)04:19 (DB84)99E-D-R-C
31Op110-01Ab:maj06:3306:00 (FJ62)07:33 (DB84)116E-D-R-C
32Op111-01C:min09:0508:20 (AS35)10:04 (VA81)209I-E-E-D-R-C
Table 4

Accuracy of synchronization approaches. The table presents absolute errors between measure estimates obtained from audio-audio synchronization (based on manually annotated measure positions for WK64) and score-audio synchronization. Mean, median, and the confidence interval for all measures (left side) and for only those measures with a note onset (right side) are reported. All values are given in milliseconds.

All MeasuresMeasures With Note Onset
VersionMeanMedian95% Conf.MeanMedian95% Conf.
WK64201340141140
FJ62251960191845
FG58231641171440
AS35251554181340
MC22272063212060
MB97302060202047
AB96281846181740
JJ90241743161640
DB84291960191854
VA81241756171641
FG672594017840
All251753181640
tismir-7-1-196-g2.png
Figure 2

Overview of various annotations in the BPSD illustrated using the first measures of the Sonata Op. 14, No. 2 in G Major. Measure positions are marked with red ticks, while beat positions are indicated by red dashed lines.

tismir-7-1-196-g3.png
Figure 3

Synchronized score-audio training pair for learning pitch-class representations using a frame-wise loss function.

tismir-7-1-196-g4.png
Figure 4

Visualization of a time-diatonic representation derived from the WK64 recording of the first movement of the Piano Sonata Op. 14 No 2 in G Major. The local-key reference annotations are indicated by the overlaid red rectangles.

tismir-7-1-196-g5.png
Figure 5

Cross-version chord recognition for the initial measures of the Piano Sonata Op. 31 No. 1 in G major. The results are presented for all 11 performances, alongside the majority vote and the reference annotations.

DOI: https://doi.org/10.5334/tismir.196 | Journal eISSN: 2514-3298
Language: English
Submitted on: Mar 28, 2024
Accepted on: Aug 12, 2024
Published on: Sep 19, 2024
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2024 Johannes Zeitler, Christof Weiß, Vlora Arifi-Müller, Meinard Müller, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.