Have a personal or library account? Click to login
On Evaluation of Inter- and Intra-Rater Agreement in Music Recommendation Cover

On Evaluation of Inter- and Intra-Rater Agreement in Music Recommendation

Open Access
|Nov 2021

Abstract

Our work is concerned with the subjective perception of music similarity in the context of music recommendation. We present two user studies to explore inter- and intra-rater agreement in quantification of general similarity between pieces of recommended music. Contrary to previous efforts, our test participants are of more uniform age and share a comparable musical background to lower variation within the participant group. The first study uses carefully curated song material from five distinct genres while the second uses songs from a single genre only, with almost all songs in both studies previously unknown to test participants. Repeating the listening tests with a two week lag shows that intra-rater agreement is higher than inter-rater agreement for both studies. Agreement for the single genre study is lower since genre of songs seems a major factor in judging similarity between songs. Mood of raters at test-time is found to have an influence on intra-rater agreement. We discuss the impacts of our results on evaluation of music recommenders and question the validity of experiments on general music similarity.

DOI: https://doi.org/10.5334/tismir.107 | Journal eISSN: 2514-3298
Language: English
Submitted on: Mar 26, 2021
Accepted on: Oct 12, 2021
Published on: Nov 24, 2021
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2021 Arthur Flexer, Taric Lallai, Katja Rašl, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.