Have a personal or library account? Click to login
STAR Drums: A Dataset for Automatic Drum Transcription Cover
Open Access
|Jul 2025

Figures & Tables

tismir-8-1-244-g1.png
Figure 1

Overview of the STAR Drums creation process. The non‑drum stem and the original drum stem are obtained from the original mix by using an MSS algorithm. An ADT algorithm creates an estimated annotation from the original drum stem, which is then used to render the re‑synthesized drum stem by using virtual drum instruments. The re‑synthesized drum stem is mixed with the non‑drum stem, resulting in the STAR mix, which forms the STAR Drums dataset together with the estimated annotation, which is now regarded as the reference annotation.

Table 1

Overview of available ADT datasets.

DatasetNon‑drum Instr.DrumsVocalsMelodic Instr.# Drum ClassesLen. [h]
RWC Music Database (Goto et al., 2002)Rec.Rec.YesYes2918.1
ENST Drums (Gillet and Richard, 2006)Rec.Rec. & Synth.NoYes201.0
MDB Drums (Southall et al., 2017)Rec.Rec.YesYes200.4
RBMA13 (Vogl et al., 2017)Rec.Rec.YesYes231.9
TMIDT (Vogl et al., 2018)Synth.Synth.NoYes18257.1
Slakh (Manilow et al., 2019)Synth.Synth.NoYes118.3
A2MD (Wei et al., 2021)Rec.Rec.YesYes334.5
ADTOF‑RGW (Zehren et al., 2021)Rec.Rec.YesYes589.2
ADTOF‑YT (Zehren et al., 2023)Rec.Rec.YesYes5202.2
Proposed STAR DrumsRec.Synth.YesYes18124.5

[i] In Slakh, no mapping from MIDI notes to drum classes is provided. Therefore, the number of supported classes depends on the mapping created by the user.

tismir-8-1-244-g2.png
Figure 2

MIDI note velocity distribution of all drum classes and all tracks.

Table 2

Input data for STAR Drums.

DatasetInstrument stems provided# Total Tracks# Used TracksLen. Used Tracks [h]
MUSDB18 (Rafii et al., 2017)Yes1501509.8
ISMIR04 (Cano et al., 2006)No2000122898.4
MTG‑Jamendo (Bogdanov et al., 2019)No550004807302.9
Table 3

Splits of STAR Drums.

SplitOrigin of dataMSS algorithm appliedFull tracksLen. [h]
TrainingISMIR04YesNo20.6
TrainingMTG‑JamendoYesNo94.1
Training (total)ISMIR04 + MTG‑JamendoYesNo114.7
ValidationMUSDB18NoYes8.3 (6.7)
TestMUSDB18NoYes1.6 (0.3)

[i] Values in brackets indicate the duration of audio files that users must create by executing a mixing script. This is necessary because some track licenses of MUSDB18 do not permit the redistribution of remixed versions.

Table 4

Drum classes used with mapping to eight‑, five‑, and three‑class vocabulary, based on Vogl et al. (2018) and Zehren et al. (2023).

Class name# Classes
18853
Bass drumBDBDBDBD
Snare drumSDSDSDSD
Side stickSS
Hand clapCLP
Closed hi‑hatCHHHHHHHH
Pedal hi‑hatPHH
Open hi‑hatOHH
TambourineTB
Low tomLTTTTT
Mid tomMT
High tomHT
Splash cymbalSPCCYCY
Chinese cymbalCHC
Crash cymbalCRC
Ride cymbalRDRD
Ride bellRBBE
CowbellCB
Clave/sticksCLCL
tismir-8-1-244-g3.png
Figure 3

Genre distribution of the STAR Drums dataset.

tismir-8-1-244-g4.png
Figure 4

Relative class frequencies of STAR Drums, MDB Drums, ENST Drums, and RBMA13.

tismir-8-1-244-g5.png
Figure 5

Total number of detected drum sounds when transcribing ideal and non‑ideal non‑drum stems and ideal and non‑ideal drum stems of MUSDB18.

Table 5

Global F‑measure when training with TMIDT, Slakh, ADTOF, or STAR Drums and testing with MDB Drums, ENST Drums, RBMA13, or STAR Drums for models transcribing 3, 5, 8, and 18 classes.

ModelTest Datasets
CLMDB DrumsENST DrumsRBMA13STAR Drums Test
3TMIDT0.780.710.620.72
Slakh0.760.770.550.73
STAR Drums0.810.780.670.85
ADTOF‑RGW0.800.800.670.77
ADTOF‑YT0.830.790.620.72
5TMIDT0.650.690.550.61
Slakh0.680.720.480.59
STAR Drums0.790.770.620.82
ADTOF‑RGW0.780.750.600.72
ADTOF‑YT0.790.760.590.66
8TMIDT0.630.660.520.63
Slakh0.660.710.470.61
STAR Drums0.750.740.610.80
18TMIDT0.580.610.410.55
Slakh0.590.630.390.58
STAR Drums0.670.660.500.78
tismir-8-1-244-g6.png
Figure 6

Global F‑measure and F‑measure per instrument on MDB Drums for five classes when training with TMIDT, Slakh, ADTOF‑RGW, ADTOF‑YT, STAR Drums, and the original mix of STAR Drums (see Section 4.2.4). Class abbreviations are explained in Table 4.

tismir-8-1-244-g7.png
Figure 7

Global F‑measure and F‑measure per instrument on MDB Drums for 18‑class vocabulary when training with TMIDT, Slakh, STAR Drums, and the original mix of STAR Drums (see Section 4.2.4). The classes hand clap, cowbell, and clave/sticks are excluded as MUSDB18 does not contain annotations for these classes. Class abbreviations are explained in Table 4.

Table 6

Global F‑measure results for training with the original mix and estimated annotations (pseudo‑labels), STAR random mix that combines re‑synthesized drum stems and non‑drum stems from different tracks, and a combination of both methods when transcribing 5 and 18 classes.

ModelTest Datasets
CLSTAR DrumsTraining DataMDB DrumsENST DrumsRBMA13STAR Drums Test
5STAR mix0.790.770.620.82
Original mix0.780.800.600.73
STAR random mix0.780.770.610.81
STAR mix + original mix0.780.780.610.77
STAR mix + STAR random mix0.770.740.600.80
Original mix + STAR random mix0.780.780.600.77
18STAR mix0.670.660.510.78
Original mix0.660.720.530.65
STAR random mix0.650.670.510.77
STAR mix + original mix0.680.700.540.73
STAR mix + STAR random mix0.670.660.510.78
Original mix + STAR random mix0.670.690.530.73

[i] For comparison, results using the STAR mix are also provided.

DOI: https://doi.org/10.5334/tismir.244 | Journal eISSN: 2514-3298
Language: English
Submitted on: Dec 11, 2024
|
Accepted on: Jun 16, 2025
|
Published on: Jul 29, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Philipp Weber, Christian Uhle, Meinard Müller, Matthias Lang, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.