Preserving Historical Knowledge in Class Incremental Audio-Visual Video Recognition
The core message of this work is to introduce a novel Hierarchical Augmentation and Distillation (HAD) framework for Class Incremental Audio-Visual Video Recognition (CIAVVR) to effectively preserve historical class knowledge without forgetting.