The article introduces an unsupervised data-driven approach for acoustic scene mapping that overcomes the limitations of traditional methods sensitive to reverberation. By leveraging the Relative Transfer Function (RTF) as a feature vector, the proposed scheme learns an isometric representation of microphone spatial locations. The Local Conformal Autoencoder (LOCA) is adapted to extract standardized data coordinates, enabling extrapolation over new regions. Experimental results demonstrate superior performance compared to classical approaches and other dimensionality reduction schemes. The method shows robustness against reverberation and offers efficient inference capabilities.
Başka Bir Dile
kaynak içeriğinden
arxiv.org
Önemli Bilgiler Şuradan Elde Edildi
by Idan Cohen,O... : arxiv.org 03-14-2024
https://arxiv.org/pdf/2301.00448.pdfDaha Derin Sorular