Have a personal or library account? Click to login
Environment Recognition for Digital Audio Forensics Using MPEG-7 and MEL Cepstral Features Cover

Environment Recognition for Digital Audio Forensics Using MPEG-7 and MEL Cepstral Features

Open Access
|Aug 2011

Abstract

Environment recognition from digital audio for forensics application is a growing area of interest. However, compared to other branches of audio forensics, it is a less researched one. Especially less attention has been given to detect environment from files where foreground speech is present, which is a forensics scenario. In this paper, we perform several experiments focusing on the problems of environment recognition from audio particularly for forensics application. Experimental results show that the task is easier when audio files contain only environmental sound than when they contain both foreground speech and background environment. We propose a full set of MPEG-7 audio features combined with mel frequency cepstral coefficients (MFCCs) to improve the accuracy. In the experiments, the proposed approach significantly increases the recognition accuracy of environment sound even in the presence of high amount of foreground human speech.

DOI: https://doi.org/10.2478/v10187-011-0032-0 | Journal eISSN: 1339-309X | Journal ISSN: 1335-3632
Language: English
Page range: 199 - 205
Published on: Aug 26, 2011
In partnership with: Paradigm Publishing Services
Publication frequency: 6 issues per year

© 2011 Ghulam Muhammad, Khalid Alghathbar, published by Slovak University of Technology in Bratislava
This work is licensed under the Creative Commons License.

Volume 62 (2011): Issue 4 (July 2011)