Comprehensive Panoptic Scene Understanding Dataset with Multiple Viewpoints and Modalities
The 360+x dataset provides a comprehensive and authentic representation of real-world scenes by capturing multiple viewpoints (360°panoramic, third-person front, and egocentric) and diverse data modalities (video, audio, directional binaural delay, location, and textual descriptions) to enable holistic scene understanding.