Have a personal or library account? Click to login
Telephone Speech Endpoint Detection using Mean-Delta Feature Cover

Telephone Speech Endpoint Detection using Mean-Delta Feature

By: Atanas Ouzounov  
Open Access
|Jul 2014

Abstract

In the study the efficiency of three features for trajectory-based endpoint detection is experimentally evaluated in the fixed-text Dynamic Time Warping (DTW) - a based speaker verification task with short phrases of telephone speech. The employed features are Modified Teager Energy (MTE), Energy-Entropy (EE) feature and Mean-Delta (MD) feature. The utterance boundaries in the endpoint detector are provided by means of state automaton and a set of thresholds based only on trajectory characteristics. The training and testing have been done with noisy telephone speech (short phrases in Bulgarian language with length of about 2 s) selected from BG-SRDat corpus. The results of the experiments have shown that the MD feature demonstrates the best performance in the endpoint detection tests in terms of the verification rate.

DOI: https://doi.org/10.2478/cait-2014-0025 | Journal eISSN: 1314-4081 | Journal ISSN: 1311-9702
Language: English
Page range: 127 - 139
Published on: Jul 15, 2014
Published by: Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2014 Atanas Ouzounov, published by Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.