Have a personal or library account? Click to login
The Potential of Unsupervised Induction of Harmonic Syntax for Jazz Cover

The Potential of Unsupervised Induction of Harmonic Syntax for Jazz

Open Access
|Jun 2025

Abstract

Hierarchical structures describing a syntax of harmony have long been studied and proposed by music theorists, but algorithms that model these structures either require costly expert annotations for training or are based on music theorists' predispositions about harmonic syntax. We build upon a line of work that models harmonic sequences with probabilistic context-free grammars (PCFGs), inspired by the well-known formalism for syntax in human language. By using neural networks for parameter sharing when estimating PCFG rule probabilities, we learn the grammar in an entirely unsupervised manner. Our model induces a harmonic syntax purely from data, with minimal bias, and with parse trees as latent variables, while simply maximizing the likelihood of training sequences. This frees us from the need, for the first time, both for expert-annotated harmonic syntax trees and for human-defined grammar rules. We propose improvements inspired by music theory, including chord symbol representations and a training objective that facilitates the inclusion of short and frequent chord progressions that are based on musical relations. Experiments show that our methods can model harmony in datasets of jazz pieces, often resulting in realistic parse trees that overlap with expert annotations, without access to these annotations during training at all. Code, models, and predictions are publicly available.1

DOI: https://doi.org/10.5334/tismir.217 | Journal eISSN: 2514-3298
Language: English
Submitted on: Sep 2, 2024
Accepted on: May 12, 2025
Published on: Jun 20, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Ruben Cartuyvels, John Koslovsky, Marie-Francine Moens, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.