Estimation of late reverberation spectrum: optimization of parameters

Arkadiy M. Prodeus; Valeriy P. Ovsianyk

doi:10.3103/S0735272715070043

Authors

Arkadiy M. Prodeus National Technical University of Ukraine "Kyiv Polytechnic Institute", Ukraine https://orcid.org/0000-0001-7640-0850
Valeriy P. Ovsianyk National Technical University of Ukraine "Kyiv Polytechnic Institute", Ukraine

DOI:

https://doi.org/10.3103/S0735272715070043

Keywords:

late reverberation suppression, late reverberation spectrum, automatic speech recognition, speech signal quality, Acc%, PESQ, BSD, SRR, LSD

Abstract

Correction of speech signals distorted by reverberation is topical in building communications systems, automatic speech recognition systems, and hearing aids. The late reverberation suppression by the spectral subtraction method or the frequency correction method involves the need of estimating the late reverberation spectrum. Though the procedure of such estimation is generally developed, a number of uncertain items related to its optimization still exist. Recommendations elaborated in this study make it possible to optimize the estimation of late reverberation spectrum in terms of such criteria as the speech signal quality and the accuracy of automatic speech recognition by using computer simulation methods.

References

COHEN, ISRAEL; HUANG, YITENG; CHEN, JINGDONG; BENESTY, JACOB. Noise Reduction in Speech Processing. Berlin, Heidelberg: Springer-Verlag, 2009, DOI: http://dx.doi.org/10.1007/978-3-642-00296-0.
NAYLOR, PATRIK A.; GAUBITCH, NIKOLAY D. (eds.), Speech Dereverberation. London: Springer, 2010, ISBN-10: 1849960550.
EPHRAIM, Y.; MALAH, D. Speech enhancement using a minimum mean square-square error log-spectral amplitude estimator. IEEE Trans. Acoust., Speech, Signal Process., Apr. 1985, v.33, n.2, p.443-445, DOI: http://dx.doi.org/10.1109/TASSP.1985.1164550.
YOSHIOKA, T.; SEHR, A.; DELCROIX, M.; KINOSHITA, K.; MAAS, R.; NAKATANI, T.; KELLERMANN, W. Making machines understand us in reverberant rooms: robustness against reverberation for automatic speech recognition. IEEE SP Magazine (Signal Processing), Nov. 2012, v.29, n.6, p.114-126, DOI: http://dx.doi.org/10.1109/MSP.2012.2205029.
LEBART, K.; BOUCHER, J.M.; DENBIGH, P.N. A new method based on spectral subtraction for speech dereverberation. Acta Acust. United Ac., 2001, v.87, n.3, p.359-366, http://www.ingentaconnect.com/content/dav/aaua/2001/00000087/00000003/art00008.
HABETS, EMANUEL. Single- and multi-microphone speech dereverberation using spectral enhancement. Dissert. … Doctor of Philosophy Electrical Engineering. Technische Universiteit Eindhoven, 2007, 241 p.
PRODEUS, A.; LADOSHKO, O. On existence of optimal boundary value between early reflections and late reverberation. Proc. of IEEE 34th Int. Sci. Conf. on Electronics and Nanotechnology, ELNANO, 15-18 April 2014, Kyiv, Ukraine. IEEE, 2014, p.442-446, DOI: http://dx.doi.org/10.1109/ELNANO.2014.6873443.
BROOKS, M. VOICEBOX: Speech Processing Toolbox for MATLAB. Imperial College London, Electrical Engineering Department, http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html.
YOUNG, S.; EVERMANN, G.; GALES, M.; ET AL. The HTK Book. Cambridge: University Engineering Department, 2009, 375 p.
LOIZOU, P. Speech Enhancement: Theory and Practice. Boca Raton: CRC Press, 2007, 648 p.
BEERENDS, JOHN G.; LARSEN, ERIK; IYER, NANDINI; VAN VUGT, JEROEN M. Measurement of speech intelligibility based on the PESQ approach. Proc. of Int. Conf. on Measurement of Speech and Audio Quality in Networks, MESAQIN, 2 June 2004, Prague, Czech Republic. Prague, 2004.