Kuzmin, N., Luong, H. T., Yao, J., Xie, L., Lee, K. A., & Chng, E. S. (2024). NTU-NPU System for Voice Privacy 2024 Challenge. arXiv preprint arXiv:2410.02371.
This paper describes the NTU-NPU team's submissions to the Voice Privacy Challenge 2024, aiming to improve upon provided baseline systems for voice anonymization. The main objective is to enhance speaker anonymization while maintaining the emotional and content integrity of the speech.
The researchers focused on modifying two baseline systems, B3 and B5, provided by the challenge organizers. For B3, they incorporated emotion embeddings, experimented with different speaker embedders (WavLM and ECAPA2), and explored various anonymization strategies like random speaker selection and cross-gender anonymization. For B5, they introduced a Mean Reversion method and added white Gaussian noise to the prosody for enhanced privacy. Additionally, they explored disentanglement-based models like ß-VAE and NaturalSpeech3 FACodec.
The authors successfully modified the baseline systems, achieving improved speaker anonymization while preserving emotional and content information to varying degrees. Their experiments highlighted the trade-off between privacy and utility in voice anonymization, with techniques like AWGN and prosody manipulation enhancing privacy at the cost of reduced ASR and emotion recognition performance.
This research contributes to the field of voice privacy by exploring and refining techniques for speaker anonymization. The findings provide valuable insights for developing anonymization systems that balance privacy with the usability of anonymized speech data.
The authors acknowledge the volatility of EER results and the need for further investigation into the convergence of attacker ASV models. Future research could explore more robust anonymization techniques, particularly for disentanglement-based models, and investigate methods for mitigating the privacy-utility trade-off.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Nikita Kuzmi... lúc arxiv.org 10-04-2024
https://arxiv.org/pdf/2410.02371.pdfYêu cầu sâu hơn