Steered Response Power-Based Direction-of-Arrival Estimation Exploiting an Auxiliary Microphone
Abstract: Accurately estimating the direction-of-arrival (DOA) of a speech source using a compact microphone array (CMA) is often complicated by background noise and reverberation. A commonly used DOA estimation method is the steered response power with phase transform (SRP-PHAT) function, which has been shown to work reliably in moderate levels of noise and reverberation. Since for closely spaced microphones the spatial coherence of noise and reverberation may be high over an extended frequency range, this may negatively affect the SRP-PHAT spectra, resulting in DOA estimation errors. Assuming the availability of an auxiliary microphone at an unknown position which is spatially separated from the CMA, in this paper we propose to compute the SRP-PHAT spectra between the microphones of the CMA based on the SRP-PHAT spectra between the auxiliary microphone and the microphones of the CMA. For different levels of noise and reverberation, we show how far the auxiliary microphone needs to be spatially separated from the CMA for the auxiliary microphone-based SRP-PHAT spectra to be more reliable than the SRP-PHAT spectra without the auxiliary microphone. These findings are validated based on simulated microphone signals for several auxiliary microphone positions and two different noise and reverberation conditions.
- M. Omologo and P. Svaizer, “Use of the crosspower-spectrum phase in acoustic event location,” IEEE Trans. on Audio, Speech, Language Processing, no. 3, pp. 288–292, 1997.
- J. P. Dmochowski and J. Benesty, “Steered beamforming approaches for acoustic source localization,” in Speech Processing in Modern Communication: Challenges and Perspectives. Springer, 2010, pp. 307–337.
- P. Pertilä, A. Brutti, P. Svaizer, and M. Omologo, “Multichannel source activity detection, localization, and tracking,” in Audio source separation and speech enhancement. Wiley, 2018, pp. 47–64.
- T. Dietzen, E. De Sena, and T. van Waterschoot, “Low-complexity steered response power mapping based on Nyquist-Shannon sampling,” in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 2021, pp. 206–210.
- D. Salvati, C. Drioli, and G. L. Foresti, “Incoherent frequency fusion for broadband steered response power algorithms in noisy environments,” IEEE Signal Processing Letters, vol. 21, no. 5, pp. 581–585, 2014.
- J.-M. Valin, F. Michaud, and J. Rouat, “Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering,” Robotics and Autonomous Systems, vol. 55, no. 3, pp. 216–228, 2007.
- H. R. Abutalebi and H. Momenzadeh, “Performance improvement of TDOA-based speaker localization in joint noisy and reverberant conditions,” EURASIP Journal on Advances in Signal Processing, vol. 2011, pp. 1–13, 2011.
- G. Athanasopoulos, “Contributions to acoustic localization for robotic audition,” Ph.D. dissertation, Vrije Universiteit Brussel, 2016.
- S. Braun, W. Zhou, and E. A. Habets, “Narrowband direction-of-arrival estimation for binaural hearing aids using relative transfer functions,” in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 2015, pp. 1–5.
- C. Knapp and G. Carter, “The generalized correlation method for estimation of time delay,” IEEE Trans. on Acoustics, Speech, and Signal Processing, vol. 24, no. 4, pp. 320–327, 1976.
- J. Chen, J. Benesty, and Y. Huang, “Time delay estimation in room acoustic environments: An overview,” EURASIP Journal on Applied Signal Processing, pp. 1–19, 2006.
- C. Zhang, D. Florêncio, and Z. Zhang, “Why does PHAT work well in low noise, reverberative environments?” in Proc. IEEE International Conference of Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, NV, USA, 2008, pp. 2565–2568.
- J. Velasco, C. J. Martin-Arguedas, J. Macias-Guarasa, D. Pizarro, and M. Mazo, “Proposal and validation of an analytical generative model of SRP-PHAT power maps in reverberant scenarios,” ELSEVIER Signal Processing, vol. 119, pp. 209–228, 2016.
- M. Farmani, M. S. Pedersen, Z.-H. Tan, and J. Jensen, “Informed sound source localization using relative transfer functions for hearing aid applications,” IEEE/ACM Trans. on Audio, Speech, Language Processing, vol. 25, no. 3, pp. 611–623, 2017.
- M. Farmani, M. S. Pedersen, Z.-H. Tan, and J. Jensen, “Bias-compensated informed sound source localization using relative transfer functions,” IEEE/ACM Trans. on Audio, Speech, Language Processing, vol. 26, no. 7, pp. 1275–1289, 2018.
- U. Kowalk, S. Doclo, and J. Bitzer, “Signal-informed DNN-based DOA estimation combining an external microphone and GCC-PHAT features,” in Proc. International Workshop on Acoustic Echo and Noise Control (IWAENC), Bamberg, Germany, 2022, pp. 1–5.
- D. Fejgin and S. Doclo, “Comparison of binaural RTF-vector-based direction of arrival estimation methods exploiting an external microphone,” in Proc. European Signal Processing Conference (EUSIPCO), Online, 2021, pp. 241–245.
- D. Fejgin and S. Doclo, “Exploiting an external microphone for binaural RTF-vector-based direction of arrival estimation for multiple speakers,” in Proc. Forum Acusticum, Turin, Italy, 2023.
- K. Brümann and S. Doclo, “Exploiting an external microphone to improve time-difference-of-arrival estimates for Euclidean distance matrix-based source localization,” in Proc. ITG Conference on Speech Communication, Aachen, Germany, 2023, pp. 16–20.
- G. W. Elko, “Spatial coherence functions for differential microphones in isotropic noise fields,” in Microphone Arrays: Signal Processing Techniques and Applications. Springer, 2001, pp. 61–85.
- J. B. Allen and D. A. Berkley, “Image method for efficiently simulating small-room acoustics,” Journal of the Acoustical Society of America, vol. 65, no. 4, pp. 943–950, 1979.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.