GazePointAR: A Context-Aware Multimodal Voice Assistant for Pronoun Disambiguation in Wearable Augmented Reality
GazePointAR is a context-aware multimodal voice assistant for wearable augmented reality that leverages eye gaze, pointing gestures, and conversation history to disambiguate speech queries containing pronouns.