Khái niệm cốt lõi
The authors propose improvements to the discriminative feature extraction approach for anomalous sound detection in unlabeled conditions, including enhanced feature extractors and effective pseudo-labeling methods.
Tóm tắt
The paper focuses on improving the performance of discriminative methods for anomalous sound detection (ASD) in unlabeled conditions.
Key highlights:
- The authors enhance the feature extractor by using multi-resolution spectrograms and a subspace loss function, which improves performance with and without labels.
- They propose several pseudo-labeling methods to effectively train the feature extractor in the absence of labels, including using classification of available labels, external pre-trained models, and triplet learning.
- Experimental results show that the enhanced feature extractor and pseudo-labeling methods significantly improve ASD performance under unlabeled conditions.
- The external pre-trained models generally achieve the best performance, while triplet learning is more effective in noisy conditions.
- The authors analyze the differences in effectiveness among the proposed pseudo-labeling methods and the importance of constructing a noise-robust feature space.
Thống kê
The dataset used in the experiments is the DCASE 2023 and 2024 Task 2 Challenge dataset (ToyADMOS2 and MIMII DG), which consists of normal and anomalous machine sounds.
Trích dẫn
"The experimental results demonstrate that 1) the enhanced feature extractors improve the performance with and without the labels, and 2) the pseudo-labeling methods significantly improves the performance in the unlabeled conditions."
"We observed that the external pre-trained models tended to form clusters based on the noise differences, which explains why the pseudo-labels degrade performance. Even in such cases, Triplet generates effective pseudo-labels by constructing a noise-robust feature space from scratch."