The article introduces an unsupervised data-driven approach for acoustic scene mapping that overcomes the limitations of traditional methods sensitive to reverberation. By leveraging the Relative Transfer Function (RTF) as a feature vector, the proposed scheme learns an isometric representation of microphone spatial locations. The Local Conformal Autoencoder (LOCA) is adapted to extract standardized data coordinates, enabling extrapolation over new regions. Experimental results demonstrate superior performance compared to classical approaches and other dimensionality reduction schemes. The method shows robustness against reverberation and offers efficient inference capabilities.
Para Outro Idioma
do conteúdo original
arxiv.org
Principais Insights Extraídos De
by Idan Cohen,O... às arxiv.org 03-14-2024
https://arxiv.org/pdf/2301.00448.pdfPerguntas Mais Profundas