آستانه‌گذاري وفقي ضرائب موجك برای پاکسازی سیگنال گفتار نویزی

محورهای موضوعی : مهندسی برق و کامپیوتر

فاطمه شیخ علیشاهی ¹ , حمیدرضا ابوطالبی ² , محمدرضا تابان ³

1 - دانشگاه یزد
2 - دانشگاه یزد
3 - دانشگاه يزد

تاریخ دریافت : 1386/06/06 تاریخ پذیرش : 1387/04/14 تاریخ انتشار : 1388/01/01

کلید واژه: بهسازي گفتارتبديل موجكآستانه‌گذاري وفقی,

چکیده مقاله :

اين مقاله به مبحث بهسازي گفتار در حوزه موجك مي‌پردازد. در روش پیشنهادی، بعد از تجزيه سيگنال نويزي به باندهاي موجك تابع آستانه‌گذاري وفقي روي ضرايب موجك اعمال مي‌شود. در زيرباندهایی كه دارای انرژی گفتار با محتوای بسیار زیاد هستند، از حد آستانه كوچك‌تر و تابع آستانه‌گذاري سخت استفاده می‌شود و برعکس، در زيرباندهاي با محتوای ناچیز از انرژی گفتار، حد آستانه بزرگ‌تر و تابع آستانه‌گذاري نرم مورد استفاده واقع می‌شود. در نواحی با وضعیت بینابین دو حالت فوق، تابع آستانه‌گذاري به‌صورت وفقی و مابين دو وضعیت حدی آستانه‌گذاري سخت و آستانه‌گذاری نرم تعیین می‌شود. پارامتري كه تابع آستانه‌گذاري و حد آستانه را در هر زيرباند موجك تعيين مي‌كند با نسبت توان گفتار و نويز در هر زیرباند رابطه دارد. آزمایش‌های انجام‌شده در مقایسه با روش‌های قبلی نشان می‌دهد كه با اعمال اين تكنيك، نويز به‌نحو مطلوبي حذف شده و میزان اعوجاج در گفتار خروجي کاهش می‌یابد. علاوه بر اين، نتايج شبيه‌سازي حکایت از آن دارد كه افزايش رشد درخت موجك در بهبود خروجي سيستم بهسازي تأثير داشته و نوع موجك مناسب، وابسته به نوع نویز موجود در محیط می‌باشد.

چکیده انگلیسی:

This paper addresses the problem of speech enhancement in wavelet domain. After decomposition of noisy signal into wavelet sub-bands, an adaptive thresholding process is applied on wavelet coefficients. In the proposed technique, small threshold value and hard thrsholding function are used in sub-bands with high speech energy; vice versa, in sub-bands with low speech energy, large threshold value and soft thresholding function are employed. For other sub-bands (between above two extreme cases for speech energy), we use an adaptive thresholding function that is actually between soft- and hard-thresholding functions. The threshold value and thresholding function are determined by a parameter related to the ratio of speech and noise powers in each sub-band. Our extensive experiments show the superiority of proposed method in removing the background noise and reduction of speech distortion. It was also shown that both wavelet tree structure and wavelet type affect on the performance of speech de-noising system.

منابع و مأخذ:

[1]D. L. Donoho, "Denoising by soft thresholding,"IEEE Trans. On Information Theory, vol. 41, no. 3, pp. 613-627, May 1995.
[2]M. Bahoura and J. Rouat, "A new approach for wavelet speech enhancement," in Proc. of European Conf. on Speech Communication and Technology, Eurospeech’01, vol. 3, pp. 1937-1940, Sep. 2001.
[3]H. R. Abutalebi, F. Sheikhalishahi, and M. R. Taban, "Speech enhancement in wavelet domain by use of speech features," in Proc.of the 14th Iranian Conf. on Electrical Engineering, ICEE’06,Tehran, Iran, May 2006.
[4]H. Sheikhzadeh and H. R. Abutalebi, "An improved wavelet - basedspeech enhancement system," in Proc. of European Conf. on Speech Communication and Technology, Eurospeech’01, vol. 3, pp. 1855- 1858, Sep. 2001.
[5]S. Chang, Y. Kwon, S. I. Yang, and I. J. Kim, "Speech enhancement for non - stationary noise environment by adaptive wavelet packet,"in Proc. of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, ICASSP’02, vol. 1, pp. 561-564, May 2002.
[6]C. T. Lu and H. C. Wang, "Enhancement of single channel speech based on masking property and wavelet transform,"Speech Communication, vol. 41, no. 2-3, pp. 409-427, Feb. 2003.
[7]C. T. Lu and H. C. Wang, "Speech enhancement using robust weighting factors for critical-band-wavelet-packet transform," in Proc. of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, ICASSP’04 , vol. 1, pp. 721-724, May. 2004.
[8]J. W. Seok and K. S. Bae, "Speech enhancement with reduction of noise components in the wavelet domain," in Proc. of the IEEE Int.Conf. on Acoustics, Speech, and Signal Processing, ICASSP’97,vol. 2, pp. 1323-1326, Apr. 1997.
[9]S. H. Chen and J. F. Wang, "Speech enhancement using perceptual wavelet packet decomposition and teager energy operator,"J. ofVLSI Signal Processing , vol. 36, no. 2-3, pp. 125-139, Nov. 2004.
[10]Y. Ghanbari and M. R. Karimi -Mollaei, "A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets,"Speech Communication , vol. 48, no. 8, pp. 927-940,Aug. 2006.
[11]M. T. Johnson, X. Yuan, and Y. Ren, "Speech signal enhancement through adaptive wavelet thresholding," Speech Communication ,vol. 49, no. 2, pp. 123-133, Feb. 2007.
[12]C. S. Burrus, R. A. Gopinath, and H. Guo, Introduction to Wavelet and Wavelet Transforms, A primer , Upper Saddle River; NJ: Prentic-Hall, 1998.
[13]X. Huang, R. A. Acero, H. Kon, and R. Reddy, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development, Prentic -Hall, 2001.
[14]R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. on Speech and Audio Processing , vol. 9, no. 5, pp. 504-512, Jul. 2001.
[15]B. J. Yoon and P. P. Vaidyanathan, "Wavelet -based denoising by customized thresholding," in Proc. of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, ICASSP’04, vol. 2, pp. 925-928, May 2004.
[16]http://www.rcisp.com/FarsDat.aspx
[17]http://svr-ww.eng.cam.ac.uk/comp.speech/Section1/Data/noisex.html
[18]S. R. Quackenbush, T. P. Barnwell, and M. A. Clements,Objective Measures of Speech Quality , Prentic-Hall, Englewood Cliffs,NJ, 1998.
[19]J. H. L. Hansen and B. L. Pellom, "An effective quality evaluation protocol for speech enhancement algorithms," in Proc. of 6th Int. Conf. on Spoken Language Processing, ICSLP’00, vol. 7, pp. 2819- 2822, Dec. 2000.
[20]Y. Hu and P. C. Loizou, "Evaluation of objective measures for speech enhancement," in Proc. of Interspeech , pp. 1447-1450,Sep. 2006.
[21]http://www.utdallas.edu/~loizou/speech/software.htm
[22]A. W. Rix, M. P. Hollier, A. P. Hekstra, and J. G. Beerends, "Perceptual evaluation of speech quality (PESQ): the new ITU standard for end -to-end speech quality assessment. Part I: time-delay compensation," J. of the Audio Engineering Society , vol. 50,no. 10, pp. 755-764, Oct. 2002.

اشتراک گذاری

آدرس مقاله

آستانه‌گذاري وفقي ضرائب موجك برای پاکسازی سیگنال گفتار نویزی

رایمگ

پیوندهای سایت

مراکز مرتبط

پشتیبانی

صفحات رسمی