The article introduces an unsupervised data-driven approach for acoustic scene mapping that overcomes the limitations of traditional methods sensitive to reverberation. By leveraging the Relative Transfer Function (RTF) as a feature vector, the proposed scheme learns an isometric representation of microphone spatial locations. The Local Conformal Autoencoder (LOCA) is adapted to extract standardized data coordinates, enabling extrapolation over new regions. Experimental results demonstrate superior performance compared to classical approaches and other dimensionality reduction schemes. The method shows robustness against reverberation and offers efficient inference capabilities.
toiselle kielelle
lähdeaineistosta
arxiv.org
Tärkeimmät oivallukset
by Idan Cohen,O... klo arxiv.org 03-14-2024
https://arxiv.org/pdf/2301.00448.pdfSyvällisempiä Kysymyksiä