A Speech Enhancement Method Based on Kalman Filtering

Автор: Chaogang Wu, Bo Li, Jin Zheng

Журнал: International Journal of Wireless and Microwave Technologies(IJWMT) @ijwmt

Статья в выпуске: 2 Vol.1, 2011 года.

Бесплатный доступ

The enhancement of speech degraded by non-stationary interferers is a highly relevant and difficult task for many signal processing applications. In this study, we present a monaural speech enhancement method based on spectral subtraction and Kalman filtering (KF) by extracting the Liljencrants–Fant (LF) excitation during voiced speech, in which the nature of glottal flow can be maintained. Therefore, the approach could preserve the glottal pulse's nature characteristic in Kalman filtering and thus achieve significant improvements on objective quality. The quality of the enhanced speech has been evaluated by perceptual evaluation of speech quality (PESQ) score. The results indicate that the proposed algorithm could improve the output speech quality compared with the conventional KF algorithm and sub-band spectral subtraction.

Еще

Speech enhancement, LF glottal flow, source separation

Короткий адрес: https://sciup.org/15012732

IDR: 15012732

Список литературы A Speech Enhancement Method Based on Kalman Filtering

  • D. Vincent, O. Rosec, and T. Chonavel, "A New Method for Speech Synthesis and Transformation Based On an ARX-LF Source-Filter Decomposition and HNM Modeling," ICASSP. pp. 525–528, 2007.
  • H. Zhao, and X. Zou, "A Speech Enhancement Preprocessor for Low Bit Rate Speech Coding," Pacific-Asia Conference on Circuits, Communications and System. pp. 443–445, 2009.
  • Christian D. Sigg, Tomas Dikk, and Joachim M. Buhmann, "Speech Enhancement with Sparse Coding in Learning," Dictionaries in Proc. ICASSP. pp. 4758–4761, 2010.
  • B. Yegnanarayana, S. R. Mahadeva Prasanna and K. Sreenivasa Rao, "Speech Enhancement Using Excitation Source Information," ICASSP. vol. 1, pp. 541–544, 2002.
  • D. Vincent, O. Rosec, and T. Chonavel, "Estimation of LF glottal source parameters based on ARX model," Interspeech. pp. 333–336, 2005.
Статья научная