Core Concepts
Video sequences can be leveraged to improve the robustness and performance of morphing attack detection systems compared to traditional single-image or differential image-based approaches.
Abstract
This paper introduces and explores the potential of Video-based Morphing Attack Detection (V-MAD) systems in real-world operational scenarios. While current morphing attack detection methods primarily focus on a single or a pair of images, V-MAD is based on video sequences, exploiting the video streams often acquired by face verification tools available at airport gates.
The key highlights and insights are:
Incorporating information from multiple frames can lead to substantial improvements in overall morphing attack detection performance compared to traditional differential image-based approaches. Even simple score fusion strategies applied to the individual frame-level detection scores proved to be effective.
Face image quality can further contribute to the development of robust V-MAD systems. Unified quality scores as well as single quality components (e.g., illumination, focus, pose) can be leveraged to improve performance, especially when combined with machine learning models.
V-MAD represents a significant evolution from traditional morphing attack detection approaches, offering increased effectiveness and robustness in detecting face morphing attacks. However, the results achieved are still quite far from the theoretical upper bound, confirming the need for new and more robust V-MAD systems that can effectively work directly on video sequences.
The study establishes a foundation and guidelines for future V-MAD research, highlighting the potential advantages of leveraging video information in the context of the morphing attack detection task.
Stats
The database contains 205 bona fide document images, 612 gate images, and 1142 morphed document images.
There are 2187 bona fide attempts and 34698 morphed attempts for the D-MAD task, and 125 bona fide attempts and 1145 morphed attempts for the V-MAD task.
Quotes
"Video sequences represent valuable information for increasing the robustness and performance of morphing attack detection systems."
"The use of multiple frames could be advantageous from the MAD task perspective and must be considered."