Dual-Level Query-Based Framework for Precise Multi-Label Temporal Action Detection
DualDETR, a novel dual-level query-based framework, integrates instance-level and boundary-level modeling to achieve precise localization and recognition of temporal action instances in untrimmed videos.