核心概念
Utilizing RTF and LOCA for robust acoustic scene mapping in reverberant environments.
摘要
The article introduces an unsupervised data-driven approach for acoustic scene mapping that overcomes the limitations of traditional methods sensitive to reverberation. By leveraging the Relative Transfer Function (RTF) as a feature vector, the proposed scheme learns an isometric representation of microphone spatial locations. The Local Conformal Autoencoder (LOCA) is adapted to extract standardized data coordinates, enabling extrapolation over new regions. Experimental results demonstrate superior performance compared to classical approaches and other dimensionality reduction schemes. The method shows robustness against reverberation and offers efficient inference capabilities.
統計資料
"We define the acoustic transfer functions Ai(k) as the Fourier transform of the RIRs ai(n)."
"For remote points, however, the Euclidean distance is meaningless."
"We eventually end up with a data tensor of shape [N, M, D] = [3136, 7, 760]."
"It turns out that picking a portion of the RTF bins is preferable."
"Our learned embedding demonstrates a high correlation between the main directions of the embedding and the true x − y axes."
引述
"Our method outperforms existing kernel-based schemes in terms of mapping accuracy and time efficiency."
"LOCA presents a fast and simple inference process based on DNN’s forward pass."
"LOCA leads to the best results in terms of MAE for all reverberation levels."