Have a personal or library account? Click to login
Jazz Trio Database: Automated Annotation of Jazz Piano Trio Recordings Processed Using Audio Source Separation Cover

Jazz Trio Database: Automated Annotation of Jazz Piano Trio Recordings Processed Using Audio Source Separation

Open Access
|Aug 2024

Figures & Tables

Table 1

Comparison of existing datasets for each instrument in the jazz piano trio.

InstrumentNameMethodTracksDuration (s)AnnotationsMetadata
PianoWJDManual65823,149 onsetsBeat, chord, section
PiJAMAAutomatic2,777804,9607,108,460 MIDI notesN/A
RWC-JazzManual51,672N/A*Beat, section
JTD (ours)Automatic1,294159,668866,116 onsets, 2,174,833 MIDI notesBeat
BassWJDAutomatic42649,0105,000 beat-wise pitchesBeat, chord, section
FiloBassAutomatic + manual4817,88053,646 MIDI notesDownbeat, chord
RWC-JazzManual51,672N/A*Beat, section
JTD (ours)Automatic1,294159,668543,693 onsetsBeat
DrumsWJDAutomatic + manual676,50628,851 cymbal onsetsBeat, chord, section
RWC-JazzManual51,672N/A*Beat, section
JTD (ours)Automatic1,294159,668796,604 onsetsBeat

[i] *Note: RWC-Jazz has not been made available free and open source, meaning that it is not possible to provide full detail here.

tismir-7-1-186-g1.png
Figure 1

Diagram shows the process for constructing JTD. Arrow block symbols indicate stages where tracks/artists may be removed.

tismir-7-1-186-g2.png
Figure 2

Total streams (“scrobbles”) of all recordings made by the top 20 “trio” artists most frequently tagged as “Jazz” on Last.fm.

tismir-7-1-186-g3.png
Figure 3

Duration of piano solo excerpts by all bandleaders: bar color indicates subset (either JTD or JTD-300).

tismir-7-1-186-g4.png
Figure 4

The number of tracks featuring the 10 performers on each instrument with the most recordings in JTD.

tismir-7-1-186-g5.png
Figure 5

Histogram grouping number of tracks by year of recording: bar color indicates subset.

Table 2

Optimized results per instrument, showing F-measure, precision, and recall (mean ± SD).

PianoBassDrumsBeatsDownbeats
F0.93 ± 0.030.93 ± 0.050.95 ± 0.030.97 ± 0.050.63 ± 0.44
P0.93 ± 0.040.94 ± 0.040.96 ± 0.040.97 ± 0.050.63 ± 0.44
R0.93 ± 0.040.93 ± 0.070.94 ± 0.040.97 ± 0.050.63 ± 0.44
Table 3

Results per method, piano only (mean ± SD).

MIDI transcription (1)Spectral flux (2)CNN: no filtering (3)CNN: narrow filter (4)
F0.77 ± 0.130.84 ± 0.060.92 ± 0.030.92 ± 0.03
P0.71 ± 0.160.79 ± 0.100.90 ± 0.060.95 ± 0.03
R0.86 ± 0.090.90 ± 0.040.93 ± 0.030.89 ± 0.05
tismir-7-1-186-g6.png
Figure 6

Mean duration of solos by different JTD pianists. Error bars show standard errors.

tismir-7-1-186-g7.png
Figure 7

Distribution of tempi (in quarter note beats per-minute).

tismir-7-1-186-g8.png
Figure 8

Each panel shows the distribution of log2 beat–upbeat ratios among instruments across JTD, normalized such that the height of the largest bar in each panel is 1. Dotted vertical lines show peaks of the density estimates; straight lines correspond to the musical notation given along the top of the panel.

tismir-7-1-186-g9.png
Figure 9

Markers show mean log2 beat–upbeat ratio and tempo; solid lines show predictions (without random effects); and shaded areas show 95% confidence intervals (obtained via bootstrapping over data from different pianists, N = 10,000)

tismir-7-1-186-g10.png
Figure 10

Diagram shows kernel density estimates for the relative position of beats by each instrument, indicated by color. Density estimates are scaled such that the maximum height of the curve for each instrument is 1.

DOI: https://doi.org/10.5334/tismir.186 | Journal eISSN: 2514-3298
Language: English
Submitted on: Feb 7, 2024
Accepted on: Jul 6, 2024
Published on: Aug 27, 2024
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2024 Huw Cheston, Joshua L. Schlichting, Ian Cross, Peter M. C. Harrison, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.