Gaze-Guided Graph Neural Network for Predicting Household Activities and Atomic Actions
Our method utilizes human gaze fixations to construct a visual-semantic graph, which is then processed by a Graph Neural Network to recognize the overall household activity and predict the sequence of atomic actions necessary to complete the activity.