Core Concepts
This technical report details a novel approach to improve replay grounding in soccer videos by transforming it into a temporal action detection problem and utilizing a unified network called Faster-TAD with enhanced feature engineering techniques.
Stats
With features of positive samples from “3s style1”, “6s” and “3s+6s”, the model achieved 90.84, 91.45, 92.19 in AUC, 67.69, 66.07, 70.54 in AR@1, 86.56, 86.08, 88.34 in AR@5 on the validation set.
The proposed method achieved a tight mAP of 52.31% on the test set, a 26.76% mAP improvement over the previous best.
Quotes
"In order to make full use of video information, we transform the replay grounding problem into a temporal action detection problem."
"We apply a Faster-RCNN like network in temporal action detection, Faster-TAD."
"By jointing temporal proposal generation and action classification with multi-task loss and shared features, Faster-TAD simplifies the pipeline of TAD."