DOI: https://doi.org/10.3103/S0735272715070043
Open Access Open Access  Restricted Access Subscription Access
Relationship Acc%(Tl) for different values of averaging parameter

Estimation of late reverberation spectrum: optimization of parameters

Arkadiy M. Prodeus, Valeriy P. Ovsianyk

Abstract


Correction of speech signals distorted by reverberation is topical in building communications systems, automatic speech recognition systems, and hearing aids. The late reverberation suppression by the spectral subtraction method or the frequency correction method involves the need of estimating the late reverberation spectrum. Though the procedure of such estimation is generally developed, a number of uncertain items related to its optimization still exist. Recommendations elaborated in this study make it possible to optimize the estimation of late reverberation spectrum in terms of such criteria as the speech signal quality and the accuracy of automatic speech recognition by using computer simulation methods.

Keywords


late reverberation suppression; late reverberation spectrum; automatic speech recognition; speech signal quality; Acc%; PESQ; BSD; SRR; LSD

Full Text:

PDF

References


COHEN, ISRAEL; HUANG, YITENG; CHEN, JINGDONG; BENESTY, JACOB. Noise Reduction in Speech Processing. Berlin, Heidelberg: Springer-Verlag, 2009, DOI: http://dx.doi.org/10.1007/978-3-642-00296-0.

NAYLOR, PATRIK A.; GAUBITCH, NIKOLAY D. (eds.), Speech Dereverberation. London: Springer, 2010, ISBN-10: 1849960550.

EPHRAIM, Y.; MALAH, D. Speech enhancement using a minimum mean square-square error log-spectral amplitude estimator. IEEE Trans. Acoust., Speech, Signal Process., Apr. 1985, v.33, n.2, p.443-445, DOI: http://dx.doi.org/10.1109/TASSP.1985.1164550.

YOSHIOKA, T.; SEHR, A.; DELCROIX, M.; KINOSHITA, K.; MAAS, R.; NAKATANI, T.; KELLERMANN, W. Making machines understand us in reverberant rooms: robustness against reverberation for automatic speech recognition. IEEE SP Magazine (Signal Processing), Nov. 2012, v.29, n.6, p.114-126, DOI: http://dx.doi.org/10.1109/MSP.2012.2205029.

LEBART, K.; BOUCHER, J.M.; DENBIGH, P.N. A new method based on spectral subtraction for speech dereverberation. Acta Acust. United Ac., 2001, v.87, n.3, p.359-366, http://www.ingentaconnect.com/content/dav/aaua/2001/00000087/00000003/art00008.

HABETS, EMANUEL. Single- and multi-microphone speech dereverberation using spectral enhancement. Dissert. … Doctor of Philosophy Electrical Engineering. Technische Universiteit Eindhoven, 2007, 241 p.

PRODEUS, A.; LADOSHKO, O. On existence of optimal boundary value between early reflections and late reverberation. Proc. of IEEE 34th Int. Sci. Conf. on Electronics and Nanotechnology, ELNANO, 15-18 April 2014, Kyiv, Ukraine. IEEE, 2014, p.442-446, DOI: http://dx.doi.org/10.1109/ELNANO.2014.6873443.

BROOKS, M. VOICEBOX: Speech Processing Toolbox for MATLAB. Imperial College London, Electrical Engineering Department, http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html.

YOUNG, S.; EVERMANN, G.; GALES, M.; ET AL. The HTK Book. Cambridge: University Engineering Department, 2009, 375 p.

LOIZOU, P. Speech Enhancement: Theory and Practice. Boca Raton: CRC Press, 2007, 648 p.

BEERENDS, JOHN G.; LARSEN, ERIK; IYER, NANDINI; VAN VUGT, JEROEN M. Measurement of speech intelligibility based on the PESQ approach. Proc. of Int. Conf. on Measurement of Speech and Audio Quality in Networks, MESAQIN, 2 June 2004, Prague, Czech Republic. Prague, 2004.







© Radioelectronics and Communications Systems, 2004–2020
When you copy an active link to the material is required
ISSN 1934-8061 (Online), ISSN 0735-2727 (Print)
tel./fax +38044 204-82-31, 204-90-41