Research on potentialities of audio information recovery from video without audio track

Authors

DOI:

https://doi.org/10.3103/S0735272719060050

Keywords:

speech recovery from video, speech intelligibility, signal-to-noise ratio, SNR, filtration

Abstract

The article analyzed the possibility of the appearance of an acoustic information leakage channel that is formed by analyzing the video stream on the video record. The authors investigated the possibilities of speech recovery from a low quality recording, determined by the signal-to-noise ratio (SNR), sampling frequency, number of quantization levels, and clipping level, taking into account the features of the leakage channel under study. As a result, the required frame rate of the video image, the minimum SNR, the number of quantization levels, and a sufficient dynamic displacement range of the oscillating object are determined. The authors also investigated the requirements for the leakage channel parameters and possible ways for an attacker to improve its quality. The requirements for the displacement of an object oscillating under the action of acoustic waves in a video were calculated. The article justified the potential of reducing the requirements for the displacement of an object by applying averaging of a large number of different points on the object. The authors performed an assessment of the existing noise reduction software for sound recordings, which is used to increase the intelligibility of the message that is intercepted by the attacker in the considered information leakage channel. Obtained results revealed that there are potential causes for the leakage of acoustic information by analyzing the video stream on the video. The conditions for the emergence of such a channel are not excessive. Therefore, the possibility of its appearance is a security risk and it is necessary to provide the means to protect the object of information activity from it.

References

OLEYNIKOV, A.N.; SHIROKIY, A.N. “Mathematical modeling of the acoustic channel of voice information leakage,” Radiotekhnika (Kharkiv), n.177, p.161, 2014. URI: http://nure.ua/wp-content/uploads/2014/Scientific_editions/177/26.pdf.

GRANZOTTO, N.; BETTARELLO, F.; FERLUGA, A.; MARSICH, L.; SCHMID, C.; FAUSTI, P.; CANIATO, M. “Energy and acoustic performances of windows and their correlation,” Energy Buildings, v.136, p.189, 2017. DOI: https://doi.org/10.1016/j.enbuild.2016.12.024.

TETER, A.; GAWRYLUK, Ja. “Experimental modal analysis of a rotor with active composite blades,” Composite Structures, v.153, p.451, 2016. DOI: https://doi.org/10.1016/j.compstruct.2016.06.013.

ROTHBERG, S.J.; ALLEN, M.S.; CASTELLINI, P.; DI MAIO, D.; DIRCKX, J.J.J.; EWINS, D.J.; HALKON, B.J.; MUYSHONDT, P.; PAONE, N.; RYAN, T.; STEGER, H.; TOMASINI, E.P.; VANLANDUIT, S.; VIGNOLA, J.F. “An international review of laser Doppler vibrometry: Making light work of vibration measurement,” Optics Lasers Engineering, v.99, p.11, 2017. DOI: https://doi.org/10.1016/j.optlaseng.2016.10.023.

ZALEVSKY, Z.; BEIDERMAN, Y.; MARGALIT, I.; GINGOLD, S.; TEICHER, M.; MICO, V.; GARCIA, J. “Simultaneous remote extraction of multiple speech sources and heart beats from secondary speckles pattern,” Opt. Express, v.17, n.24, p.21566, 2009. DOI: https://doi.org/10.1364/OE.17.021566.

WADHWA, N.; RUBINSTEIN, M.; DURAND, F.; FREEMAN, W.T. “Riesz pyramids for fast phase-based video magnification,” Proc. of IEEE Int. Conf. on Computational Photography, ICCP, 2-4 May 2014, Santa Clara, USA. IEEE, 2014. DOI: https://doi.org/10.1109/ICCPHOT.2014.6831820.

DAVIS, Abe; RUBINSTEIN, Michael; WADHWA, Neal; MYSORE, Gautham J.; DURAND, Fredo; FREEMAN, William T. “The visual microphone: passive recovery of sound from video,” MIT, Microsoft Res., Adobe Res. URI: https://people.csail.mit.edu/mrub/papers/VisualMic_SIGGRAPH2014.pdf.

LOIZOU, P.C. “Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum,” IEEE Trans. Speech Audio Processing, v.13, n.5, p.857, 2005. DOI: https://doi.org/10.1109/TSA.2005.851929.

BOLL, S. “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoustics, Speech, Signal Processing, v.27, n.2, p.113, 1979. DOI: https://doi.org/10.1109/TASSP.1979.1163209.

PEETERS, J.; LOUARROUDI, E.; DE GREEF, D.; VANLANDUIT, S.; DIRCKX, J.J.J.; STEENACKERS, G. “Time calibration of thermal rolling shutter infrared cameras,” Infrared Physics & Technology, v.80, p.145, 2017. DOI: https://doi.org/10.1016/j.infrared.2016.12.001.

LYKOV, Yu.V.; MOROZOVA, A.D.; KUKUSH, V.D. “Influence of features of information leakage channels on intelligibility of eavesdropped voice messages,” Technol. Audit Production Reserves, v.1, n.2, p.4, 2017. DOI: https://doi.org/10.15587/2312-8372.2017.90571.

LYKOV, Yu.V.; MOROZOVA, H.D. “Researching the possibility of restoring audio information from a video without audio track,” Proc. of 21st Int. Forum of Young Scientists on Radio Electronics and Youth in the XXI Century, 25-27 Apr. 2017, Kharkiv, Ukraine. KNURE, Kharkiv, 2017, v.3.

SAPOZHKOV, M.A. Electroacoustics. Textbook for Universities [in Russian]. Svyaz’, Moscow, 1978.

Published

2019-06-29

Issue

Section

Research Articles