Archives of Acoustics, 46, 2, pp. 213–228, 2021

Analysis and Experiment on the Limitations of Static and Dynamic Transaural Reproduction with Two Frontal Loudspeakers

Lulu LIU
South China University of Technology

Bosun XIE
South China University of Technology

By duplicating the binaural pressures of an actual source, transaural reproduction with two frontal loudspeakers is expected to recreate a virtual source in arbitrary direction. However, experiments indicated that in static transaural reproduction, the perceived virtual source is usually limited to the frontalhorizontal plane. The reasons for this limitation, as guessed, are that, in static reproduction, the dynamic cues for front-back and vertical localisation are incorrect, and the high-frequency spectral cues are unstable with head movement. To validate this hypothesis, the variations of ITD (interaural time difference) caused by head turning in both static and dynamic transaural reproductions are analysed. The results indicate that dynamic reproduction is able to create appropriate low-frequency ITD variations, and the static transaural reproduction is unable to do so. Psychoacoustic experiments are conducted to compare virtual source localisation in static and dynamic reproductions. The results indicate that dynamic reproduction is able to recreate the front, back, and vertical virtual source for low-pass stimuli below 3 kHz, while for full audible bandwidth stimulus, appropriate low-frequency dynamic cue and unstable high-frequency spectral cues in dynamic reproduction result in two splitting virtual sources.
Keywords: transaural reproduction; vertical localisation; dynamic cue
Full Text: PDF


Bauck J., Cooper D.H. (1996), Generalized transaural stereo and applications, Journal of the Audio Engineering Society, 44(9): 683–705.

Blauert J. (1997), Spatial hearing, [in:] The Psychophysics of Human Sound Localization, MIT Press, Cambridge.

Cooper D.H., Bauck J.L. (1989), Prospects for transaural recording, Journal of the Audio Engineering Society, 37(1/2): 3–19.

Damaske P. (1971), Head‐related two‐channel stereophony with loudspeaker reproduction, Journal of the Acoustical Society of America, 50(4B): 1109–1115, doi: 10.1121/1.1912742.

Gardner W.G. (1997), 3-D audio using loudspeakers, Ph.D. Thesis, University of Massachusetts Institute of Technology.

Gumerov N.A., Duraiswami R. (2009), A broadband fast multipole accelerated boundary element method for the three dimensional Helmholtz equation, Journal of the Acoustical Society of America, 125(1): 191–205, doi: 10.1121/1.3021297.

Hawksford M.J. (2002), Scalable multichannel coding with HRTF enhancement for DVD and virtual sound systems, Journal of the Audio Engineering Society, 50(11): 894–913.

Herre J., Hilpert J., Kuntz A., Plogsties J. (2015), MPEG-H audio – The new standard for coding of immersive spatial audio, IEEE Journal of Selected Topics in Signal Processing, 9(5): 770–779, doi: 10.1109/JSTSP.2015.2411578.

ITU-R Report BS.2159-7 (2015), Multichannel sound technology in home and broadcasting applications, International Telecommunication Union, Geneva.

Jiang J.L., Xie B.S., Mai H.M., Liu L.L., Yi K.L., Zhang C.Y. (2019), The role of dynamic cue in auditory vertical localisation, Applied Acoustics, 146: 398–408, doi: 10.1016/j.apacoust.2018.12.002.

Kawano S., Taira M., Matsudaira M., Abe Y. (1998), Development of the virtual sound algorithm, IEEE Transactions on Consumer Electronics, 44(3): 1189–1194, doi: 10.1109/30.713254.

Kirkeby O., Nelson P.A., Hamada H. (1998), The 'stereo dipole': A virtual source imaging system using two closely spaced loudspeakers, Journal of the Audio Engineering Society, 46(5): 387–395.

Kirkeby O., Nelson P.A. (1999), Digital filter design for inversion problems in sound reproduction, Journal of the Audio Engineering Society, 47(7/8): 583–595.

Kurabayashi H., Otani M., Itoh K., Hashimoto M., Kayama M. (2014), Sound image localization using dynamic transaural reproduction with non-contact head tracking, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 97(9): 1849–1858, doi: 10.1587/transfun.E97.A.1849.

Lopez J.J., Gonzalez A. (2001), Experimental evaluation of cross-talk cancellation regarding loudspeakers’ angle of listening, IEEE Signal Processing Letters, 8(1): 13–15, doi: 10.1109/97.889637.

Nelson P.A., Orduna-Bustamante F., Engler D., Hamada H. (1996), Experiments on a system for the synthesis of virtual acoustic sources, Journal of the Audio Engineering Society, 44(11): 990–1007.

Papadopoulos T., Nelson P.A. (2010), Choice of inverse filter design parameters in virtual acoustic imaging systems, Journal of the Audio Engineering Society, 58(1/2): 22–35.

Perrett S., Noble W. (1997), The effect of head rotations on vertical plane sound localization, Journal of the Acoustical Society of America, 102(4): 2325–2332, doi: 10.1121/1.419642.

Rui Y.Q., Yu G.Z., Xie B.S., Liu Y. (2013), Calculation of individualized near-field head-related transfer function database using boundary element method, Audio Engineering Society 134th Convention, Paper No. 8901, Rome.

Sakamoto N., Gotoh T., Kogure T., Shimbo M., Clegg A.H. (1981), Controlling sound-image localization in stereophonic reproduction, Journal of the Audio Engineering Society, 29(11): 794–799.

Sandvad J. (1996), Dynamic aspects of auditory virtual environments, Audio Engineering Society 100th Convention, Preprint No. 4226, Copenhagen.

Schroeder M.R., Atal B.S. (1963), Computer simulation of sound transmission in rooms, Proceedings of the IEEE, 51(3): 536–537, doi: 10.1109/PROC.1963.2180.

Schroeder M.R. (1970), Digital simulation of sound transmission in reverberant spaces, Journal of the Acoustical Society of America, 47(2A): 424–431, doi: 10.1121/1.1911541.

Takeuchi T., Nelson P.A., Kirkeby O., Hamada H. (1998), Influence of individual head-related transfer function on the performance of virtual acoustic imaging systems, Audio Engineering Society 104th Convention, Tokyo.

Toh C.W., Gan W.S. (1999), A real-time virtual surround sound system with bass enhancement, Audio Engineering Society 107th Convention, Preprint No. 5052, New York.

Wallach H. (1940), The role of head movement and vestibular and visual cue in sound localization, Journal of Experimental Psychology, 27(4): 339–368, doi: 10.1037/h0054629.

Ward D.B., Elko G.W. (1999), Effect of loudspeaker position on the robustness of acoustic crosstalk cancellation, IEEE Signal Processing Letters, 6(5): 106–108, doi: 10.1109/97.755428.

Xie B.S. (2013), Head-Related Transfer Function and Virtual Auditory Display, J. Ross Publishing,USA.

Xie B.S., Shi Y., Xie Z.W., Guan S. (2005), Virtual reproducing system for 5.1 channel surround sound, Journal of South China University of Technology, 24(1): 76–88.

DOI: 10.24425/aoa.2021.136577

Copyright © Polish Academy of Sciences & Institute of Fundamental Technological Research (IPPT PAN)