Have a personal or library account? Click to login
The GigaMIDI Dataset with Features for Expressive Music Performance Detection Cover

The GigaMIDI Dataset with Features for Expressive Music Performance Detection

Open Access
|Feb 2025

Figures & Tables

tismir-8-1-203-a1.png
Heuristic 1

Calculation of Distinctive Note Velocity/Onset Deviation Ratios

tismir-8-1-203-a2.png
Heuristic 2

Calculation of Note Onset Median Metric Level

tismir-8-1-203-g1.png
Figure 1

Four classes (NE = non‑expressive, EO = expressive onset, EV = expressive velocity, and EP = expressively performed) using heuristics in Section 4.2 for the expressive performance detection of MIDI tracks in GigaMIDI.

Table 1

Sample of symbolic datasets in multiple formats, including MIDI, ABC, MusicXML, and Guitar Pro formats.

DatasetFormatFilesHoursInstruments
GigaMIDIMIDI>1.43M>40,000Misc.
MetaMIDIMIDI436,631>20,000Misc.
Lakh MIDIMIDI174,533>9,000Misc.
DadaGPGuitar Pro22,677>1,200Misc.
ATEPPMIDI11,6771,000Piano
Essen FolksongABC9,03456.62Piano
NES MusicMIDI5,27846.1Misc.
MID‑FiLDMIDI4,422>40Misc.
MAESTROMIDI1,282201.21Piano
Groove MIDIMIDI1,15013.6Drums
JSB ChoralesMusicXML382>4Misc.

[i] ATEPP = Automatically Transcribed Expressive Piano Performances.

tismir-8-1-203-g2.png
Figure 2

Distribution of the duration in bars of the files from each subset of the GigaMIDI dataset. The x‑axis is clipped to 300 for better readability.

Table 2

Number of MIDI note events by instrument group in percentage (IGN = instrument group number, CP = chromatic percussion, and FX = effect).

IGN: 1‑8EventsIGN: 9‑16Events
Piano60.2%Reed/Pipe1.1%
CP2.4%Drums17.4%
Organ1.8%Synth Lead0.5%
Guitar6.7%Synth Pad0.6%
Bass4.2%Synth FX0.3%
String1.1%Ethnic0.3%
Ensemble2.1%Percussive FX0.3%
Brass0.7%Sound FX0.3%
tismir-8-1-203-g3.png
Figure 3

Distribution of files in GigaMIDI according to (a) MIDI notes, and (b) ticks per quarter note (TPQN).

tismir-8-1-203-g4.png
Figure 4

Musicmap style topology (Crauwels, 2016).

tismir-8-1-203-g5.png
Figure 5

Distribution of musical style in GigaMIDI.

tismir-8-1-203-g6.png
Figure 6

Example of each duple onset metric level grid in different colors using circles and dotted lines for the position of onsets, where k = 6.

Table 3

Optimal threshold selection results based on the 80% training set, showing the optimal threshold value for each heuristic where the P4 value is maximized.

HeuristicThresholdP4
Distinct velocity520.7727
Distinct onset420.7225
DNVR40.965%0.7727
DNODR4.175%0.9529
NOMMLLevel 120.9952
Table 4

Detection results (%) for expressive performance in each MIDI track class within the GigaMIDI dataset.

Class &
NE (62.5%) < 42 & < 52
EO (7.2%) ≥ 42 & < 52
EV (27.4%) < 42 & ≥ 52
EP (2.9%) ≥ 42 & ≥ 52

[i] The analysis is based on the number of distinct velocity levels (D‑V = distinct velocity) and onset‑time deviations (D‑O = distinct onset). Categories include non‑expressive (NE), expressive onset (EO), expressive velocity (EV), and expressively performed (EP).

Table 5

Results (%) of expressive performance detection for each MIDI track class in GigaMIDI based on the calculation of (DNODR), and (DNVR).

Class &
NE (52.3%) < 4.175% & < 40.965%
EO (9.1%) ≥ 4.175% & < 40.965%
EV (24.2%) < 4.175% & ≥ 40.965%
EP (14.4%) ≥ 4.175% & ≥ 40.965%
tismir-8-1-203-g7.png
Figure 7

Distribution of MIDI tracks according to (a) NOMML (level between 0 and 12, where k = 6) for MIDI tracks in GigaMIDI. The NOMML heuristic investigates duple and triplet onsets, including onsets that cannot be categorized as duple‑ or triplet‑based MIDI grids, and (b) instruments for expressively performed tracks in the GigaMIDI dataset.

Table 6

Classification accuracy of each heuristic for expressive performance detection.

Detection heuristicsClassification accuracyRanking
Distinct velocity77.9%4
Distinct onset77.9%4
DNVR83.4%3
DNODR98.2%2
NOMML100%1
Table 7

True positives (TP), true negatives (TN), false positives (FP), and false negatives (FN) based on the threshold set by P4 for heuristics, including correct negatives (CN) (in percentages).

Heuristic (%)TPTNFPFNCN
Distinct velocity35.442.521.20.998.0
Distinct onset24.853.110.611.582.2
DNVR35.448.021.20.998.2
DNODR34.563.701.7797.3
NOMML36.363.700100
DOI: https://doi.org/10.5334/tismir.203 | Journal eISSN: 2514-3298
Language: English
Submitted on: May 14, 2024
Accepted on: Nov 11, 2024
Published on: Feb 7, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Keon Ju Maverick Lee, Jeff Ens, Sara Adkins, Pedro Sarmento, Mathieu Barthet, Philippe Pasquier, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.