Table 1
Piano MIDI datasets. GP is the abbreviation for GiantMIDI-Piano.
| DATASET | COMPOSERS | WORKS | HOURS | TYPE |
|---|---|---|---|---|
| Piano-midi.de | 26 | 571 | 37 | Seq. |
| Classical Archives | 133 | 856 | 46 | Seq. |
| Kunstderfuge | 598 | – | – | Seq. |
| KernScores | – | – | – | Seq. |
| SUPRA | 111 | 410 | – | Perf. |
| ASAP | 16 | 222 | – | Perf. |
| MAESTRO | 62 | 529 | 84 | Perf. |
| MAPS | – | 270 | 19 | Perf. |
| GiantMIDI-Piano | 2,786 | 10,855 | 1,237 | 90% Perf. |
| Curated GP | 1,787 | 7,236 | 875 | 89% Perf. |

Figure 1
Number of solo piano works in the curated GP dataset. Top 100 are shown.

Figure 2
Duration of solo piano works in the curated GP dataset. Top 100 are shown.

Figure 3
Distribution of composers’ nationalities for the full GP dataset.

Figure 4
Pitch distribution of the top 100 composers in the curated GP dataset.

Figure 5
Note histogram for the curated GP dataset.

Figure 6
Note histogram for J.S. Bach, Beethoven, and Liszt from the curated GP dataset.

Figure 7
The number of notes per second of the top 100 composers in the curated GP dataset.

Figure 8
Pitch class distribution of six composers for the curated GP dataset.

Figure 9
Interval distribution of six composers for the curated GP dataset.

Figure 10
Trichord distribution of six composers for the curated GP dataset showing relative (rel.) frequencies of the top six trichords.

Figure 11
Tetrachord distribution of six composers for the curated GP dataset showing the top six tetrachords.

Figure 12
Precision, recall, and F1 score of solo piano detection.
Table 2
Accuracy of retrieved music works of six composers.
| J. S. BACH | MOZART | BEETHOVEN | CHOPIN | LISZT | DEBUSSY | |
|---|---|---|---|---|---|---|
| Correct | 147 | 85 | 82 | 102 | 197 | 29 |
| Incorrect | 102 | 35 | 70 | 171 | 22 | 9 |
| Accuracy | 59% | 71% | 54% | 37% | 90% | 76% |
Table 3
Accuracy of retrieved music works of six composers, using the surname constraint.
| J. S. BACH | MOZART | BEETHOVEN | CHOPIN | LISZT | DEBUSSY | |
|---|---|---|---|---|---|---|
| Correct | 129 | 72 | 76 | 96 | 141 | 27 |
| Incorrect | 44 | 16 | 5 | 21 | 6 | 3 |
| Accuracy | 75% | 82% | 94% | 82% | 96% | 90% |
Table 4
Piano transcription evaluation on the GiantMIDI-Piano dataset.
| D | I | S | ER | |
|---|---|---|---|---|
| Maestro | 0.009 | 0.024 | 0.018 | 0.061 |
| GiantMIDI-Piano | 0.015 | 0.051 | 0.069 | 0.154 |
| Relative difference | 0.006 | 0.026 | 0.047 | 0.094 |

Figure 13
From left to right: error rate (ER) of 52 solo piano works in the MAESTRO dataset; ER of 52 solo piano works in the GiantMIDI-Piano dataset; relative ER between the MAESTRO and the GiantMIDI-Piano dataset.
