Utilizing spatio-temporal proximity and a dual-path architecture enhances panoramic activity recognition.