Have a personal or library account? Click to login
PiJAMA: Piano Jazz with Automatic MIDI Annotations Cover
Open Access
|Sep 2023

Figures & Tables

Table 1

Overview of automatic piano transcription techniques and their performance on the datasets MAPS and MAESTRO. For Hawthorne et al. (2019), the MAPS results are from a training configuration with data augmentation, and the MAESTRO results are without augmentation. For Kong et al. (2021), the MAPS results were evaluated with the published checkpoint and the MAESTRO results are the published numbers. Note that this model was trained without data augmentation.

MAPSMAESTRO (V1)
MODELFRAME F1ONSET F1ON+OFFSET F1FRAME F1ONSET F1ON+OFFSET F1
Sigtia et al. (2016)72.2246.5818.38
Hawthorne et al. (2018)78.3082.2950.22
Hawthorne et al. (2019)84.9186.4467.4390.1595.3280.50
Kong et al (2021)82.7882.4056.5989.7196.7682.47
Hawthorne et al. (2021)88.0095.9583.46
Table 2

Evaluation on transcribed solo jazz piano performances. Due to varying quality in the transcriptions, we report metrics for both 50- and 100-millisecond note onset tolerance. The results on RWC Jazz and Jazz Web show little improvement from the increased tolerance, whereas the metrics on the human labeled evaluation sets show significant improvement, suggesting greater misalignment in these sources.

DATASET#HAWTHORNE ET AL.KONG ET AL.
NOTE F1 (50MS)NOTE F1 (100MS)NOTE F1 (50MS)NOTE F1 (100MS)
RWC Jazz40.9320.9380.9090.910
Jazz Web50.9560.9590.9260.926
Joe Bagg50.8760.9120.8060.858
Daan Schreuder80.8890.9100.8650.881
per recording average220.9080.9250.8730.891
tismir-6-1-162-g1.png
Figure 1

Diagram of the data collection process for the PiJAMA dataset. Stages with a filtering effect are represented with an arrow block symbol.

tismir-6-1-162-g2.png
Figure 2

Scatter plots depicting the relationship between transcription agreement and note onset F1 score. Each data point is computed from a performance in the MAPS test set.

tismir-6-1-162-g3.png
Figure 3

Pitch histogram of all note events in the PiJAMA dataset.

tismir-6-1-162-g4.png
Figure 4

Pitch histograms from pianists Jessica Williams (above) and Erroll Garner (below).

Table 3

Most frequently repeated compositions in the PiJAMA dataset.

FREQUENCYCOMPOSITION(S)
17Body and Soul
13All the Things You Are, Yesterdays
12Sophisticated Lady
11’Round Midnight
10Blue Monk
9Alone Together, Prelude to a Kiss, Sweet and Lovely
8Someday My Prince Will Come, Jitterbug Waltz, Night and Day, My Funny Valentine, Darn That Dream, Someone to Watch Over Me, Don’t Blame Me, Blue Bolero, I Should Care, Lush Life, Everything Happens to Me, In a Sentimental Mood, Con Alma
tismir-6-1-162-g5.png
Figure 5

Histogram grouping the number of artists by their duration of performance data, in half-hour increments. One pianist (Dick Hyman) is an outlier with over 18 hours of solo piano recordings.

tismir-6-1-162-g6.png
Figure 6

Total performance duration for each artist in the PiJAMA-30 subset.

tismir-6-1-162-g7.png
Figure 7

Bar plot of notes-per-second.

tismir-6-1-162-g8.png
Figure 8

Bar plot of mean sliding pitch class entropy.

Table 4

Accuracy of artist prediction models. Two test scores are presented for each model condition: the accuracy on the track-split (all tracks of the dataset shuffled into an 80-10-10 split) and the average accuracy across three album-splits (one random album held out for each artist, yielding roughly an 80-10-10 split). The Album Effect column is the difference between accuracies on the track-split and average album-split.

MODEL CONDITIONSPLITTEST ACCURACYALBUM EFFECT
Spectrogram CRNNTrackAlbum0.9140.2670.647
Spectrogram CRNN (Data Augmentation)TrackAlbum0.7820.3990.383
Transcription Feature CRNNTrackAlbum0.6320.4570.176
Transcription Feature CRNN (Data Augmentation)TrackAlbum0.6290.5450.085
Piano Roll CRNNTrackAlbum0.5560.5020.055
DOI: https://doi.org/10.5334/tismir.162 | Journal eISSN: 2514-3298
Language: English
Submitted on: Mar 2, 2023
|
Accepted on: Aug 4, 2023
|
Published on: Sep 15, 2023
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2023 Drew Edwards, Simon Dixon, Emmanouil Benetos, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.