Streaming Dense Video Captioning: Efficient Processing and Detailed Descriptions of Long Untrimmed Videos
A streaming model for dense video captioning that can handle long input videos, generate detailed textual descriptions, and produce outputs before processing the entire video.