Conceitos essenciais
Introducing two methods, surround dense sampling and Online Temporally Aware Label Cleaning (O-TALC), to improve the performance of online temporal action segmentation by addressing the issues of inaccurate segment boundaries and oversegmentation.
Resumo
The paper introduces two methods to improve online temporal action segmentation (AS):
-
Surround Dense Sampling:
- Addresses the issues with traditional dense sampling during training, where the initial frame of training clips is constrained within the labeled segment boundaries.
- Allows the densely sampled training clips to extend beyond the segment boundaries, matching the online sliding window inference clips.
- This helps improve segment boundary predictions and prevents missing short atomic action segments.
-
Online Temporally Aware Label Cleaning (O-TALC):
- Explicitly removes short erroneous segments that fall below a predefined cutoff value during online inference.
- Operates in real-time with a small segmentation delay (typically less than 1 second for short actions).
- Adopts both static and class-based cutoff values to handle the large variation in action lengths.
The authors show that their methods, which are backbone-invariant, can be deployed with computationally efficient spatio-temporal action recognition models to achieve strong online AS performance, rivaling offline approaches on challenging fine-grained datasets like CBAA, 50 Salads, and Assembly-101.
Estatísticas
The paper does not contain any key metrics or important figures to support the author's key logics.
Citações
The paper does not contain any striking quotes supporting the author's key logics.