The article introduces an unsupervised data-driven approach for acoustic scene mapping that overcomes the limitations of traditional methods sensitive to reverberation. By leveraging the Relative Transfer Function (RTF) as a feature vector, the proposed scheme learns an isometric representation of microphone spatial locations. The Local Conformal Autoencoder (LOCA) is adapted to extract standardized data coordinates, enabling extrapolation over new regions. Experimental results demonstrate superior performance compared to classical approaches and other dimensionality reduction schemes. The method shows robustness against reverberation and offers efficient inference capabilities.
إلى لغة أخرى
من محتوى المصدر
arxiv.org
الرؤى الأساسية المستخلصة من
by Idan Cohen,O... في arxiv.org 03-14-2024
https://arxiv.org/pdf/2301.00448.pdfاستفسارات أعمق