Have a personal or library account? Click to login
Statistical Analysis of Spectral Properties and Prosodic Parameters of Emotional Speech Cover

Statistical Analysis of Spectral Properties and Prosodic Parameters of Emotional Speech

By: J. Přibil and  A. Přibilová  
Open Access
|Sep 2009

References

  1. Iriondo, I., et al. (2009). Automatic refinement of an expressive speech corpus assembling subjective perception and automatic classification. Speech Communication, 51 (9), 744-758.10.1016/j.specom.2008.12.001
  2. Gobl, C., Ní Chasaide, A. (2003). The role of voice quality in communicating emotion, mood and attitude. Speech Communication, 40 (1-2), 189-212.10.1016/S0167-6393(02)00082-1
  3. d'Alessandro, C., et al. (1998). Effectiveness of a periodic and aperiodic decomposition method for analysis of voice sources. IEEE Transactions on Speech and Audio Processing, 6, 12-23.10.1109/89.650305
  4. Schoentgen, J. (2003). Decomposition of vocal cycle length perturbations into vocal jitter and vocal microtremor, and comparison of their size in normophonic speakers. Journal of Voice, 17, 114-125.10.1016/S0892-1997(03)00014-6
  5. Shahnaz, C., et al. (2006). A new technique for the estimation of jitter and shimmer of voiced speech signal. In Proceedings of the Canadian Conference on Electrical and Computer Engineering, CCECE 2006. IEEE, 2112-2115.10.1109/CCECE.2006.277799
  6. Farrús, M., et al. (2007). Jitter and shimmer measurements for speaker recognition. In Proceedings of the International Conference Interspeech 2007. Curran Associates, 778-781.10.21437/Interspeech.2007-147
  7. Perrot, P., et al. (2007). Voice disguise and automatic detection: review and perspectives. In Stylianou, Y., Faundez-Zanuy, M., Esposito, A. (eds.) Progress in Nonlinear Speech Processing (Lecture Notes in Computer Science / Image Processing, Computer Vision, Pattern Recognition, and Graphics). Springer, 101-117.10.1007/978-3-540-71505-4_7
  8. Murphy, P. (2008). Source-filter comparison of measurements of fundamental frequency perturbation and amplitude perturbation for synthesized voice signals. Journal of Voice, 22, 125-137.10.1016/j.jvoice.2006.09.00717147983
  9. Juslin, P.N., Laukka, P. (2003). Communication of emotions in vocal expression and music performance: different channels, same code? Psychological Bulletin, 129, 770-814.10.1037/0033-2909.129.5.77012956543
  10. Tao, J., et al. (2009). Realistic visual speech synthesis based on hybrid concatenation method. IEEE Transactions on Audio, Speech, and Language Processing, 17, 469-477.10.1109/TASL.2008.2011538
  11. Přibilová, A., Přibil, J. (2006). Non-linear frequency scale mapping for voice conversion in text-to-speech system with cepstral description. Speech Communication, 48, 1691-1703.10.1016/j.specom.2006.08.001
  12. Přibilová, A., Přibil, J. (2009). Spectrum modification for emotional speech synthesis. In Esposito, A., Hussain, A., Marinaro, M., Martone, R. (eds.) Multimodal Signals: Cognitive and Algorithmic Issues (Lecture Notes in Artificial Intelligence). Springer, 232-241.10.1007/978-3-642-00525-1_23
  13. Vích, R. (2000). Cepstral speech model, Padé approximation, excitation, and gain matching in cepstral speech synthesis. In Proceedings of the 15th Biennial EURASIP Conference Biosignal 2000. Brno: University of Technology, 77-82.
  14. Gray, A.H., Jr., Markel, J.D. (1974). A spectral-flatness measure for studying the autocorrelation method of linear prediction of speech analysis. IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-22, 207-217.10.1109/TASSP.1974.1162572
  15. Ito, T., et al. (2005). Analysis and recognition of whispered speech. Speech Communication, 45, 139-152.10.1016/j.specom.2003.10.005
  16. Přibil, J., Přibilová, A. (2006). Voicing transition frequency determination for harmonic speech model. In Proceedings of the 13th International Conference on Systems, Signals and Image Processing, 25-28.
  17. Scherer, K.R. (2003). Vocal communication of emotion: a review of research paradigms. Speech Communication, 40, 227-256.10.1016/S0167-6393(02)00084-5
  18. Iida, A., et al. (2003). A corpus-based speech synthesis system with emotion. Speech Communication, 40, 161-187.10.1016/S0167-6393(02)00081-X
  19. Oppenheim, A.V., Schafer, R.W. (1989). Digital Signal Processing. New Jersey: Prentice Hall.
  20. Suhov, Y., Kelbert, M. (2005). Probability and Statistics by Example: Volume I, Basic Probability and Statistics. Cambridge University Press.
  21. Boersma, P., Weenink, D. (2008). Praat: doing phonetics by computer (Version 5.0.32) [Computer Program]. Retrieved August 12, 2008, from http://www.praat.org/
  22. Boersma, P., Weenink, D. (2007). Praat — tutorial. Intro 4. Pitch analysis. Retrieved September 5, 2007, from http://www.fon.hum.uva.nl/praat/manual/Intro_4___Pitch_analysis.html
  23. Vich, R., Nouza, J., Vondra, M. (2008). Automatic speech recognition used for intelligibility assessment of text-to-speech systems In Esposito, A., et al. (eds.) Verbal and Nonverbal Features of Human-Human and Human-Machine Interactions (Lecture Notes in Artificial Intelligence). Springer, 136-148.10.1007/978-3-540-70872-8_10
Language: English
Page range: 95 - 104
Published on: Sep 3, 2009
Published by: Slovak Academy of Sciences, Institute of Measurement Science
In partnership with: Paradigm Publishing Services
Publication frequency: Volume open

© 2009 J. Přibil, A. Přibilová, published by Slovak Academy of Sciences, Institute of Measurement Science
This work is licensed under the Creative Commons License.

Volume 9 (2009): Issue 4 (August 2009)