Multimodal Transformers for Real-Time Surgical Activity Prediction Study
The study introduces a multimodal transformer model for real-time surgical gesture and trajectory prediction, outperforming the state-of-the-art models by fusing kinematic and video data efficiently.