Skip to main content
Have a personal or library account? Click to login
PianoCoRe: Combined and Refined Piano MIDI Dataset Cover
By: Ilya Borovik  
Open Access
|Apr 2026

Figures & Tables

Table 1

Comparison of major symbolic piano performance datasets and PianoCoRe dataset with its tiers. Sources: R—recorded (Disklavier/Hardware), T—transcribed (Audio‑to‑MIDI), T‑HQ—transcribed labeled as high quality. Metadata: P—performer, S—piano solo probability, D—deduplication flag, Q—quality label. Annotations are not available for all performances. Number of unique composer names computed from raw metadata, not manually verified.

DatasetComposersPiecesPerformancesHoursSourcesScoresAlignmentsMetadata
MAESTRO431,276199RnonoP
(n)ASAP162221,06792R100%beat/noteP
GiantMIDI2,78610,85510,8551,237TnonoS
ATEPP251,59611,7421,009T43.6%noP, Q
Aria‑MIDI19,0211,186,253100,629TnonoS, P
PERiScoPe822,73846,4733,784R, T81.9%noteP
PianoCoRe‑C4835,625250,04621,763R, T75.3%noP
PianoCoRe‑B4785,591214,09218,757R, T75.0%noP, D, Q
PianoCoRe‑A1511,591157,20712,509R, T100%noteP, D, Q
PianoCoRe‑A*1371,517130,27510,330R, T‑HQ100%noteP, D, Q
Figure 1

The three‑stage data matching and annotation pipeline used to create PianoCoRe dataset.

Figure 2

Statistical overview of the PianoCoRe‑C dataset for the 50 most represented composers. Top: The total number of unique pieces per composer (blue) and the number of pieces with a musical score (light blue). Bottom: The average number of performances per piece, accumulated by the MIDI source.

Figure 3

Distribution of the number of musical pieces by the number of performances in PianoCoRe‑C.

Figure 4

MIDI performances from ASAP (orange) and ATEPP (blue) grouped by original labels and mapped as a function of performance‑to‑score note ratio Rn and adjusted alignment ratio Ra.

Table 2

Distribution of MIDI quality labels computed using the alignment‑based heuristics for the deduplicated, aligned performances in PianoCoRe‑B.

HQLQCNo Label
170,3124,54514040,597
Table 3

MIDI quality classification dataset splits.

SHQLQC
training2,5002,5002,5002,500
 real9532,5001,00086
 synth1,54701,5002,414
test20020020054
calibration6626,52589354
Table 4

Evaluation of MIDI quality classifiers using F1 scores. Best scores in bold. no synth—no synthetic training data, mean—mean pooling (no [CLS]), no TL—no transformer layer before the classifier head, no MLM—token embeddings and classifier only. The last block shows feature‑masking ablations.

ModelSHQLQCAvg.
base1.0000.8390.7770.9460.891
no synth1.0000.7590.7780.9460.871
mean1.0000.8280.7520.8810.865
mean, no TL0.9930.8020.7130.8510.840
no MLM0.9950.7730.6670.8420.819
mask Pitch1.0000.8030.7230.9130.860
mask Timing0.9900.7880.7470.8510.844
mask Velocity1.0000.8340.7760.8930.876
Table 5

PianoCoRe dataset and its source subsets labeled by the MIDI quality classifier.

SourceSHQLQC
ASAP01,06600
ATEPP010,231900433
GiantMIDI112,071525
PERiScoPe8234,596914
Aria‑MIDI1,151180,97718,35917
PianoCoRe1,244228,94119,402459
Figure 5

Real‑world alignment challenges motivating the RAScoP pipeline. Top: local timing errors (crossed links) and missing/extra notes. Bottom: large structural deviation from a missing score segment, causing incorrect links. Other performed notes remain usable. Alignments were computed with Parangonar.

Figure 6

Note‑level alignment and the RAScoP pipeline for alignment refinement. The processing steps are demonstrated using an artificial example containing all types of errors. Score notes are drawn in black and performance notes are drawn in blue and green.

Figure 7

Distribution of inter‑onset deviations and beat tempos for alignments before processing (‑), after hole processing (H), after onset cleaning (O), and after both hole and onset cleaning (H + O).

Table 6

Mean alignment recall R after different alignment refinement stages and the ratio of sequences (%) inside different recall bands.

RawAfter HAfter H+O
BandRA%RH%RH+O%
0.95–1.000.97554.30.97553.90.97342.9
0.90–0.950.92926.60.92926.70.92830.4
0.85–0.900.87910.10.87810.00.87813.3
0.80–0.850.8284.70.8284.60.8286.5
0.75–0.800.7792.10.7782.20.7773.2
0.70–0.750.7251.10.7271.00.7281.6
0.60–0.700.6600.70.6631.10.6611.5
0.00–0.600.4710.40.4640.50.4620.6
all0.935100.00.934100.00.920100.0
Figure 8

Validation loss curves for PianoFlow trained on different subsets of the data. Larger and refined training datasets reduce overfitting in the long run.

Table 7

Correlation between the features of the rendered and PianoCoRe‑A performances. First row—intra‑set correlations, other rows—models trained on different data subsets. Vel—velocity, IOI—inter‑onset‑interval, OD—relative onset deviation, Art—sustained articulation. The best scores are in bold.

VelIOIODArt
Dataset0.57±0.190.90±0.060.22±0.170.44±0.19
ASAP0.37±0.170.83±0.110.07±0.150.28±0.13
+ ATEPP0.42±0.160.85±0.110.12±0.140.35±0.15
+ PERiScoPe0.41±0.170.86±0.110.11±0.170.36±0.17
PianoCoRe‑A0.40±0.170.86±0.110.10±0.170.35±0.17
RRAScoP0.70.39±0.160.85±0.110.09±0.160.35±0.18
 w/o RAScoP0.41±0.160.85±0.110.09±0.160.36±0.18
Table 8

Conditional performance rendering (performance continuation) results across training subsets and unseen source sequences. Size denotes the training set size. Vel—Velocity (MIDI bins), TS—TimeShift (s), TD—TimeDurationSustain (s). Lower is better; best values are in bold.

ASAPATEPPPERiScoPeAria‑MIDI
DatasetSizeVelTSTDVelTSTDVelTSTDVelTSTD
ASAP1 k9.8850.0230.1879.9280.0220.2069.8930.0230.2309.9570.0270.275
+ ATEPP6 k9.1570.0170.1688.2300.0150.1918.7820.0160.2168.7210.0190.252
+ PERiScoPe25 k8.8510.0160.1547.8880.0130.1898.1170.0150.1928.1330.0170.230
PianoCoRe‑A124 k8.6130.0160.1557.9670.0140.1948.0940.0150.1947.8720.0170.205
RRAScoP0.7141 k8.6310.0160.1587.9440.0140.1968.0710.0150.1947.9210.0170.206
 w/o RAScoP124 k8.7340.0170.1598.0590.0150.1938.1990.0160.1968.0550.0180.211
DOI: https://doi.org/10.5334/tismir.333 | Journal eISSN: 2514-3298
Language: English
Submitted on: Aug 17, 2025
Accepted on: Mar 16, 2026
Published on: Apr 27, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Ilya Borovik, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.