Automatic speech signal segmentation based on the innovation adaptive filter

Makowski, Ryszard; Hossa, Robert

Automatic speech signal segmentation based on the innovation adaptive filter

International Journal of Applied Mathematics and Computer Science

Volume 24 (2014): Issue 2 (June 2014)

By:

Ryszard Makowski and Robert Hossa

Open Access

|Jun 2014

References

Almpanidis, G. and Kotropoulos, C. (2007). Phonetic segmentation using the generalized Gamma distribution and small sample Bayesian information criterion, Speech Communication50(1): 38–55.10.1016/j.specom.2007.06.005
Search in Google Scholar
Almpanidis, G., Kotti, M. and Kotropoulos, C. (2009). Robust detection of phone boundaries using model selection criteria with few observations, IEEE Transactions on Audio, Speech, and Signal Processing17(2): 287–298.10.1109/TASL.2008.2009162
Search in Google Scholar
Barkat, M. (1991). Signal Detection and Estimation, Artech House, Boston, MA.
Search in Google Scholar
Brandt, A.V. (1983). Detecting and estimating the parameters jumps using ladder algorithms and likelihood ratio test, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Boston, MA, USA, pp. 1017–1020.
Search in Google Scholar
Brugnara, F., Falavinga, D. and Omolongo, M. (1993). Automatic segmentation and labeling of speech based on hidden Markov models, Speech Communication12(4): 357–370.10.1016/0167-6393(93)90083-W
Search in Google Scholar
Delacourt, P. and Wellekens, C.J. (2000). DISTBIC: A speaker-based segmentation for audio data indexing, Speech Communication32(1–2): 111–126.10.1016/S0167-6393(00)00027-3
Search in Google Scholar
Gomez, J.A. and Calvo, M. (2011). Improvements on automatic speech segmentation at the phonetic level, in C. San Martin and S.-W. Kim (Eds.), CIARP 2011, Lecture Notes in Computer Science, Vol. 7042, Springer-Verlag, Berlin/Heidelberg, pp. 557–564.10.1007/978-3-642-25085-9_66
Search in Google Scholar
Haykin, S. (1996). Adaptive Filter Theory, Prentice-Hall, Englewood Cliffs, NJ.
Search in Google Scholar
Jamouli, H., Al Hail, M.A. and Sauter, D. (2012). A mixed active and passive GLR test for a fault tolerant control system, International Journal of Applied Mathematics and Computer Science22(1): 9–23, DOI: 10.2478/v10006-012-0001-1.10.2478/v10006-012-0001-1
Search in Google Scholar
Kay, S.M. (1988). Modern Spectral Estimation, Prentice-Hall, Englewood Cliffs, NJ.
Search in Google Scholar
Kay, S.M. (1998). Fundamentals of Statistical Signal Processing, Vol. II: Detection Theory, Prentice-Hall, Englewood Clifft, NJ.
Search in Google Scholar
Kroon, P. and Deprettere, E.F. (1988). A class of analysis-by-synthesis predictive coders for high quality speech coding at rates between 4.8 and 16 kbits/s, IEEE Journal on Selected Areas in Communications6(2): 353–363.
Search in Google Scholar
Lee, D.T.L., Morf, M. and Friedlander, B. (1981). Recursive least squares ladder estimation algorithms, IEEE Transactions on Circuits and Systems28(6): 627–641.10.1109/TASSP.1981.1163587
Search in Google Scholar
Lopatka, M., Adam, O., Laplanche, C., Zarzycki, J. and Motsch, J-F. (2005). Effective analysis of non-stationary short-time signals based on the adaptive Schur filter, IEEE/SP 13th Workshop on Statistical Signal Processing, Bordeaux, France, pp. 251–256.
Search in Google Scholar
Lopatka, M., Adam, O., Laplanche, C., Motsch, J-F. and Zarzycki, J. (2006). Sperm whale click analysis using a recursive time-variant lattice filter, Applied Acoustics67(11–12): 1118–1133.10.1016/j.apacoust.2006.05.011
Search in Google Scholar
Makowski, R. and Zimroz, R. (2013). A procedure for weighted summation of the derivatives of reflection coefficients in adaptive Schur filter with application to fault detection in rolling element bearings, Mechanical Systems and Signal Processing38(1): 65–77.10.1016/j.ymssp.2012.05.005
Search in Google Scholar
Mporas, I., Ganchev, T. and Fakotakis, N. (2008). Phonetic segmentation using multiple speech features, International Journal of Speech Technology11(1): 73–85.10.1007/s10772-009-9038-4
Search in Google Scholar
Park, S.S. and Kim, N.S. (2007). On using multiple models for automatic speech segmentation, IEEE Transactions on Audio, Speech, and Language Processing15(8): 2202–2212.10.1109/TASL.2007.903933
Search in Google Scholar
Prasad, V.K., Nagarajan, T. and Murthy, H.A. (2004). Automatic segmentation of continuous speech using minimum phase delay functions, Speech Communication42(3–4): 429–446.10.1016/j.specom.2003.12.002
Search in Google Scholar
Puig, V. (2010). Fault diagnosis and fault tolerant control using set-membership approaches: Application to real case studies, International Journal of Applied Mathematics and Computer Science20(4): 619–635, DOI: 10.2478/v10006-010-0046-y.10.2478/v10006-010-0046-y
Search in Google Scholar
Rabiner, L. and Gold, B. (1975). Theory and Application of Digital Signal Processing, Prentice-Hall, Englewood Cliffs, NJ.
Search in Google Scholar
Rabiner, L. and Juang, B-H. (1993). Fundamentals of Speech Recognition, Prentice-Hall, Englewood Cliffs, NJ.
Search in Google Scholar
Rudoy, D., Quatieri, T.F. and Wolfe, P.J. (2011). Time-varying autoregressions in speech: Detection theory and applications, IEEE Transaction on Audio, Speech, and Language Processing19(4): 977–989.10.1109/TASL.2010.2073704
Search in Google Scholar
Scharenborg, O., Wan, V. and Ernestus, M. (2010). Unsupervised speech segmentation: An analysis of the hypothesized phone boundaries, Journal of Acoustical Society of America127(2): 1084–1095.10.1121/1.327719420136229
Search in Google Scholar
Schwarz, P., Matejka, P. and Cernocky, J. (2006). Hierarchical structures of neural networks for phoneme recognition, IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, France, Vol. 1, pp. 325–328.
Search in Google Scholar
Sharma, M. and Mammone, R. (1996). Blind speech segmentation: Automatic segmentation of speech without linguistic knowledge, Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, USA, pp. 1237–1240.
Search in Google Scholar
Toledano, D.T., Hernandez Gomez, L.A. and Villarrubia Grande, L. (2003) Automatic phonetic segmentation, IEEE Transactions on Speech and Audio Processing11(6): 617–625.10.1109/TSA.2003.813579
Search in Google Scholar
Tyagi, V., Bourlard, H. and Wellekens, C. (2006). On variable-scale piecewise stationary analysis of speech signals for ASR, Speech Communication48(9): 1182–1191.10.1016/j.specom.2006.04.002
Search in Google Scholar

DOI: https://doi.org/10.2478/amcs-2014-0019 | Journal eISSN: 2083-8492 | Journal ISSN: 1641-876X

Journal RSS Feed

Language: English

Page range: 259 - 270

Submitted on: Jan 21, 2013

Published on: Jun 26, 2014

Published by: University of Zielona Góra

In partnership with: Paradigm Publishing Services

Publication frequency: 4 times per year

Keywords:

automatic speech segmentation,

inter-phoneme boundaries,

Schur adaptive ﬁltering,

detection threshold determination

Related subjects:

Mathematics,

Applied mathematics

© 2014 Ryszard Makowski, Robert Hossa, published by University of Zielona Góra
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Previous article Volume 24 (2014): Issue 2 (June 2014)Next article