Integrating pre-trained AV-HuBERT with a Mask-And-Recover strategy enhances target speech extraction performance.