T-DEED: A Temporal-Discriminability Enhancer Encoder-Decoder for Precise Event Spotting in Sports Videos
T-DEED addresses multiple challenges in Precise Event Spotting, including the need for discriminability among frame representations, high output temporal resolution, and the necessity to capture information at different temporal scales. It tackles these challenges through its specifically designed architecture, featuring an encoder-decoder for leveraging multiple temporal scales and achieving high output temporal resolution, along with temporal modules designed to increase token discriminability.