Improving Motif Discovery of Symbolic Polyphonic Music with Motif Note Identification

Jun-You Wang; Yu-Chia Kuo; Li Su

doi:10.5334/tismir.250

Abstract

Motif discovery in polyphonic symbolic music data is an important yet challenging task in music processing. In this paper, we propose a novel motif-discovery method created by combining the traditional rule-based repeated pattern discovery algorithms with a machine learning–based model that performs the task of motif note identification, i.e., identifying whether or not a note belongs to a motif. More specifically, the motif note identification model extracts motif notes for subsequent repeated pattern discovery. Removing non-motif notes can reduce the unwanted outputs in repeated pattern discovery and thereby improve performance. With a limited amount of training data, motif note identification can be implemented by fine-tuning a pre-trained model for symbolic music using pseudo-labels. The results demonstrate the feasibility of applying data-driven methods to assist the motif-discovery task, specifically on the occurrence and three-layer metrics, under the situation that labeled training data of the motif and repeated pattern are scarce.

References

Benammar, R., Largeron, C., Eglin, V., and Pardoen, M. (2017). Discovering motifs with variants in music databases. In 16th International Symposium, IDA (pp. 14–26). Springer.
Back to article
Björklund, O. (2022). SIATEC‑C: Computationally efficient repeated pattern discovery in polyphonic music. In Proceedings of the 23rd International Society for Music Information Retrieval Conference (ISMIR) (pp. 59–66). Bengaluru, India: ISMIR.
Back to article
Boot, P., Volk, A., and de Haas, W. B. (2016). Evaluating the role of repeated patterns in folk song classification and compression. Journal of New Music Research, 45(3), 223–238.
Back to article
Cambouropoulos, E., Crochemore, M., Iliopoulos, C., Mouchard, L., and Pinzon, Y. (2002). Algorithms for computing approximate repetitions in musical sequences. International Journal of Computer Mathematics, 79(11), 1135–1148.
Back to article
Chew, E., and Wu, X. (2004). Separating voices in polyphonic music: A contig mapping approach. In U. K. Wiil (Ed.), International Symposium on Computer Music Modeling and Retrieval (CMMR) (pp. 1–20). Springer.
Back to article
Chou, Y.‑H., Chen, I.‑C., Chang, C.‑J., Ching, J., and Yang, Y.‑H. (2021). MidiBERT‑Piano: Large‑scale pre‑training for symbolic music understanding. arXiv preprint arXiv:2107.05223
Back to article
Collins, T. (2011). Improved Methods for Pattern Discovery in Music, With Applications in Automated Stylistic Composition (Doctoral dissertation). The Open University.
Back to article
Collins, T. (2013). 2013: Discovery of Repeated Themes & Sections. https://www.music-ir.org/mirex/wiki/2013:Discovery_of_Repeated_Themes_%26_Sections.
Back to article
Collins, T., Arzt, A., Flossmann, S., and Widmer, G. (2013). SIARCT‑CFP: Improving precision and the discovery of inexact musical patterns in point‑set representations. In Proceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR (pp. 549–554). Curitiba, Brazil: ISMIR.
Back to article
Collins, T., Thurlow, J., Laney, R. C., Willis, A., and Garthwaite, P. H. (2010). A comparative evaluation of algorithms for discovering translational patterns in baroque keyboard works. In Proceedings of the 11th International Society for Music Information Retrieval Conference (ISMIR) (pp. 3–8). Utrecht, Netherlands: ISMIR.
Back to article
Deutsch, D. (2013). Grouping mechanisms in music. In The Psychology of Music (pp. 183–248). Elsevier.
Back to article
Devlin, J., Chang, M., Lee, K., and Toutanova, K. (2019). BERT: Pre‑training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL‑HLT (pp. 4171–4186). Minneapolis, MN, USA: Association for Computational Linguistics.
Back to article
Drabkin, W. (2001). Motif. https://www.oxfordmusiconline.com/grovemusic/view/10.1093/gmo/9781561592630.001.0001/omo-9781561592630-e-0000019221.
Back to article
Finkensiep, C., Déguernel, K., Neuwirth, M., and Rohrmeier, M. (2020). Voice‑leading schema recognition using rhythm and pitch features. In Proceedings of the 21st International Society for Music Information Retrieval Conference (ISMIR) (pp. 520–526). Montreal, Canada: ISMIR.
Back to article
Forth, J., and Wiggins, G. A. (2009). An approach for identifying salient repetition in multidimensional representations of polyphonic music. In London Algorithmics 2008: Theory and Practice (pp. 44–58).
Back to article
Foscarin, F., Karystinaios, E., Nakamura, E., and Widmer, G. (2024). Cluster and separate: A GNN approach to voice and staff prediction for score engraving. In Proceedings of the 25th International Society for Music Information Retrieval Conference (ISMIR) (pp. 503–510). San Francisco, California, USA: ISMIR.
Back to article
Frieler, K., Höger, F., Pfleiderer, M., and Dixon, S. (2018). Two web applications for exploring melodic patterns in jazz solos. In Proceedings of the 19th International Society for Music Information Retrieval Conference (ISMIR) (pp. 777–783). Paris, France: ISMIR.
Back to article
Guiomard‑Kagan, N., Giraud, M., Groult, R., and Levé, F. (2016). Improving voice separation by better connecting contigs. In Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR) (pp. 164–170). New York City, USA: ISMIR.
Back to article
Hsiao, W., Liu, J., Yeh, Y., and Yang, Y. (2021). Compound word transformer: Learning to compose full‑song music over dynamic directed hypergraphs. In Thirty‑Fifth AAAI Conference on Artificial Intelligence, AAAI (pp. 178–186). Virtual Event: AAAI Press.
Back to article
Hsiao, Y., and Su, L. (2021). Learning note‑to‑note affinity for voice segregation and melody line identification of symbolic music data. In Proceedings of the 22nd International Society for Music Information Retrieval Conference (ISMIR) (pp. 285–292). Online: ISMIR.
Back to article
Hsiao, Y., Hung, T., Chen, T., and Su, L. (2023). BPS‑motif: A dataset for repeated pattern discovery of polyphonic symbolic music. In Proceedings of the 24th International Society for Music Information Retrieval Conference (ISMIR) (pp. 281–288). Milan, Italy: ISMIR.
Back to article
Hsu, J.‑L., Chen, A. L., and Liu, C.‑C. (1998). Efficient repeating pattern finding in music databases. In Proceedings of the 7th International Conference on Information and Knowledge Management (pp. 281–288). Bethesda, Maryland, USA: ACM.
Back to article
Hu, Z., Ma, X., Liu, Y., Chen, G., and Liu, Y. (2022). The beauty of repetition in machine composition scenarios. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 1223–1231). Lisboa, Portugal: ACM.
Back to article
Huron, D. (2001). Tone and voice: A derivation of the rules of voice‑leading from perceptual principles. Music Perception, 19(1), 1–64.
Back to article
Janssen, B., De Haas, W. B., Volk, A., and Van Kranenburg, P. (2014). Finding repeated patterns in music: State of knowledge, challenges, perspectives. In Sound, Music, and Motion: 10th International Symposium on Computer Music Multidisciplinary Research (CMMR) (pp. 277–297). Springer.
Back to article
Janssen, B., van Kranenburg, P., and Volk, A. (2017). Finding occurrences of melodic segments in folk songs employing symbolic similarity measures. Journal of New Music Research, 46(2), 118–134.
Back to article
Karystinaios, E., Foscarin, F., and Widmer, G. (2023). Musical voice separation as link prediction: Modeling a musical perception task as a multi‑trajectory tracking problem. In Proceedings of the Thirty‑Second International Joint Conference on Artificial Intelligence (IJCAI) (pp. 3866–3874). Macao, China: IJCAI.
Back to article
Kilian, J., and Hoos, H. H. (2002). Voice separation: A local optimization approach. In Proceedings of the 3rd International Conference on Music Information Retrieval (ISMIR). Paris, France: ISMIR.
Back to article
Kosta, K., Lu, W. T., Medeot, G., and Chanquion, P. (2022). A deep learning method for melody extraction from a polyphonic symbolic music representation. In Proceedings of the 23rd International Society for Music Information Retrieval Conference (ISMIR) (pp. 757–763). Bengaluru, India: ISMIR.
Back to article
Krause, M., Zalkow, F., Zalkow, J., Weiß, C., and Müller, M. (2020). Classifying leitmotifs in recordings of operas by Richard Wagner. In Proceedings of the 21st International Society for Music Information Retrieval Conference (ISMIR) (pp. 473–480). Montreal, Canada: ISMIR.
Back to article
Lartillot, O. (2014). In‑depth motivic analysis based on multiparametric closed pattern and cyclic sequence mining. In Proceedings of the 15th International Society for Music Information Retrieval Conference (ISMIR) (pp. 361–366). Taipei, Taiwan: ISMIR.
Back to article
Loshchilov, I., and Hutter, F. (2019). Decoupled weight decay regularization. In 7th International Conference on Learning Representations (ICLR). New Orleans, LA, USA: ICLR.
Back to article
Louboutin, C., and Meredith, D. (2016). Using general‑purpose compression algorithms for music analysis. Journal of New Music Research, 45(1), 1–16.
Back to article
Lu, W. T., and Su, L. (2018). Deep learning models for melody perception: An investigation on symbolic music data. In Asia‑Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (pp. 1620–1625). Honolulu, HI, USA: IEEE.
Back to article
Meredith, D. (2006). Point‑set algorithms for pattern discovery and pattern matching in music. In Dagstuhl Seminar Proceedings on Content‑Based Retrieval. Schloss Dagstuhl, Germany: Dagstuhl.
Back to article
Meredith, D. (2013). COSIATEC and SIATECCompress: Pattern discovery by geometric compression. In Music Information Retrieval Evaluation eXchange (MIREX).
Back to article
Meredith, D. (2016). Analysing music with point‑set compression algorithms. In Computational Music Analysis (pp. 335–366). Springer.
Back to article
Meredith, D. (2019). RECURSIA‑RRT: Recursive translatable point‑set pattern discovery with removal of redundant translators. In International Workshops of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) (pp. 485–493). Würzburg, Germany: Springer.
Back to article
Meredith, D., Lemström, K., and Wiggins, G. A. (2002). Algorithms for discovering repeated patterns in multidimensional representations of polyphonic music. Journal of New Music Research, 31(4), 321–345.
Back to article
Pinto, A. (2010). Relational motif discovery via graph spectral ranking. In Proceedings of the Eighth Workshop on Mining and Learning with Graphs, MLG (pp. 102–109). Washington, D.C., USA: ACM.
Back to article
Shen, Z., Yang, L., Yang, Z., and Lin, H. (2023). More than simply masking: Exploring pre‑training strategies for symbolic music understanding. In Proceedings of the 2023 ACM International Conference on Multimedia Retrieval (pp. 540–544). Thessaloniki, Greece: ACM.
Back to article
Shih, Y.‑J., Wu, S.‑L., Zalkow, F., Müller, M., and Yang, Y.‑H. (2023). Theme transformer: Symbolic music generation with theme‑conditioned transformer. IEEE Transactions on Multimedia, 25, 3495–3508.
Back to article
Simonetta, F., Cancino‑Chacón, C. E., Ntalampiras, S., and Widmer, G. (2019). A convolutional approach to melody line identification in symbolic scores. In Proceedings of the 20th International Society for Music Information Retrieval Conference (ISMIR) (pp. 924–931). Delft, The Netherlands: ISMIR.
Back to article
Srinivasamurthy, A., Gulati, S., Caro Repetto, R., and Serra, X. (2021). Saraga: Open datasets for research on indian art music. Empirical Musicology Review, 16(1), 85–98.
Back to article
Uitdenbogerd, A. L., and Zobel, J. (1999). Melodic matching techniques for large music databases. In Proceedings of the 7th ACM International Conference on Multimedia (pp. 57–66). Orlando, FL, USA: ACM.
Back to article
van Kranenburg, P., Janssen, B., and Volk, A. (2016). The meertens tune collections: The annotated corpus (mtc‑ann) versions 1.1 and 2.0.1. Meertens Online Reports. 2016‑1
Back to article
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., and Polosukhin, I. (2017). Attention is all you need. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems (pp. 5998–6008). Long Beach, CA, USA: NIPS.
Back to article
Velarde, G., Meredith, D., and Weyde, T. (2016). A wavelet‑based approach to pattern discovery in melodies. In D. Meredith (Ed.), Computational Music Analysis (pp. 303–333). Springer International Publishing.
Back to article
Whittall, A. (2011). Motif. In A. Latham (Ed.), The Oxford Companion to Music. Oxford University Press.
Back to article
Wu, Y., Dannenberg, R. B., and Xia, G. (2023). Motif‑centric representation learning for symbolic music. CoRR, abs/2309.10597.
Back to article
Zhao, J., Taniar, D., Adhinugraha, K., Baskaran, V. M., and Wong, K. (2023). Multi‑MMLG: A novel framework of extracting multiple main melodies from MIDI files. Neural Computing and Applications, 35(30), 22687–22704.
Back to article
Zhao, Z. (2024). Adversarial‑MidiBERT: Symbolic music understanding model based on unbias pre‑training and mask fine‑tuning. CoRR, abs/2407.08306.
Back to article

Improving Motif Discovery of Symbolic Polyphonic Music with Motif Note Identification

Abstract

Paradigm

My account