Modified Method for Fundamental Frequency Detection of Voiced/Unvoiced Speech Signal in Noisy Environment

Md. Arifur Rahman, Md. Mahfuz Alam, Md. Firoz Ahmed, M. A. F. M. Rashidul Hasan

Abstract


An efficient fundamental frequency detection method is introduced in this paper. The method is based on time domain fundamental frequency detection method. In our proposed method, instead of the original speech signal, we employ its center clipping signal for obtaining the modified autocorrelation function and this function is weighted by the reciprocal of the average magnitude difference function for fundamental frequency detection. The performance of the proposed fundamental frequency detection method is compared in terms of gross pitch error and fine pitch error with the other related method. A comprehensive evaluation of the fundamental frequency estimation results on female and male voices in white noise show the superiority of the proposed method over three related method under low levels of signal to noise ratio (SNR).

Keywords


Fundamental frequency, Pitch, Center Clipping, White Noise.

Full Text:

PDF

References


Hess W., Pitch Determination of Speech Signals, Springer-Verlag, 1983.

Rabiner L. R., and Schafer R. W., Theory and Applications of Digital Speech Processing, 1st ed., Prentice Hall, 2010.

Beigi H., Fundamental of Speaker Recognition, Springer, 2011.

Rosenberg A. E., and Sambur M. R., “New Techniques for Automatic Speaker Verification,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. ASSP-23, no. 2, pp. 169-176, 1975.

Tamura M., Masuko T., Takuda K., and Kobayashi T., “Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR”, In Proc. IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP’01), pp. 805-808, 2001.

Razak A. A., Abidin M. I. Z., and Komiya R., “Emotion pitch variation analysis in Malay and English voice samples”, In Proc. 9th Asia-Pacific Conference on Communications (APCC’03), vol. 1, pp. 108-112, 2003.

Rabiner L. R., “On the use of autocorrelation analysis for pitch detection,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. ASSP-25, no. 1, pp. 24-33, 1977.

Hasan M. A. F. M. R., and Shimamura T., “An efficient pitch estimation method uning windowless and normalized autocorrelation functions in noisy environment,” International Journal of Circuits, Systems and Signal Processing, Issue 3, vol. 6, pp. 197-204, 2012

Noll A. M., “Cepstrum pitch determination,” Journal of Acoust. Soc. Am., vol. 41, no. 2, pp. 293-309, 1967.

Ahmadi S., and Spanias A. S., “Cepstrum based pitch detection using a new statistical V/UV classification algorithm,” IEEE Trans. Speech and Audio Processing, vol. 7, no. 3, pp. 333-338, 1999.

Hasan M. A. F. M. R., Rahman M. S., and Shimamura T., ”Windowless autocorrelation based Cepstrum method for pitch extraction of noisy speech,” Journal of Signal Processing, vol. 16, no. 3, pp. 231-239, 2012.

Rabiner L. R., Cheng M. J., Rosenberg A. M., and McGonegal C. A., “A comparative performance study of several pitch detection algorithms,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. ASSP-24, no. 5, pp. 399-417, 1976.

Veprek P., Scordilis M. S., “Analysis, enhancement and evaluation of five pitch determination techniques,” Speech Communication, vol. 37, pp. 249-270, 2002.

Plante F., Meyer G., and Ainsworth W. A., “A pitch extraction reference database”, In Proc. EUROSPEECH, pp. 837-840, 1995.

Sondhi M. M., “New methods of pitch extraction,” IEEE Trans. Audio Electroacoust., vol. AU-16, pp. 262-266, 1968.

Ross M. J., Schafer H. L., Cohen A., R. F. B, and Manley H., “Average magnitude difference function pitch extraction,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. ASSP-22, no. 5, pp. 353-362, 1974.

Rumy S. A., Hasan M. A. F. M. R., Yasmin R., and Rahman M. S., “A method for pitch detection of speech signal in noisy environment,” In Proc. 1st National Conference on Intelligent Computing and Information Technology, pp. 90-94, 2013.

Shimamura T., and Kobayashi H., “Weighted autocorrelation for pitch extraction of noisy speech,” IEEE Trans. on Speech and Audio Processing, vol. 9, no. 7, pp. 727-730, 2001.

NTT, “Multilingual Speech Database for Telephometry,” NTT Advance Technology Corp., Japan, 1994.

Cheveigne A., and Kawahara H., “YIN. a fundamental frequency estimation for speech and music,” Journal of Acoust. Soc. Am., vol. 111, no. 4, pp. 1917-1930, 2002.

Hasan M. K., Hussain S., Hossain M. T., and Nazrul M. N., “Signal reshaping using dominant harmonic for pitch estimation of noisy speech,” Signal Processing, vol. 86, pp. 1010-1018, 2006.

Mirza A. F. M. Rashidul Hasan, “A pitch detection algorithm based on windowless autocorrelation function and modified cepstrum method in noisy environments”, International Journal of Computer Science and Network Security (IJCSNS), vol.17, no. 2, pp. 106-112, 2017.




DOI: http://dx.doi.org/10.52155/ijpsat.v45.1.6244

Refbacks

  • There are currently no refbacks.


Copyright (c) 2024 M. Firoz Ahmed

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.