UniEdit is a tuning-free framework that supports both video motion editing (e.g., changing the action from playing guitar to waving) and various video appearance editing scenarios (e.g., stylization, object replacement, background modification) by harnessing the power of a pre-trained text-to-video generator within an inversion-then-generation pipeline.
CCEdit is a versatile generative video editing framework that decouples structure and appearance control, enabling precise and creative editing capabilities through a novel trident network architecture.
MaskINT, an efficient prompt-based video editing framework, disentangles the task into keyframes joint editing and structure-aware frame interpolation, eliminating the need for paired text-video datasets and significantly accelerating the processing time compared to diffusion-based methods.
VidEdit is a novel method for zero-shot text-based video editing that guarantees robust temporal and spatial consistency by combining an atlas-based video representation with a pre-trained text-to-image diffusion model.
DragVideo proposes a novel framework that enables intuitive and accurate drag-style editing of videos while preserving spatio-temporal consistency.
FastVideoEdit proposes an efficient zero-shot video editing approach inspired by Consistency Models, reducing editing time while maintaining high quality.
RealCraft ermöglicht formgerechtes, konsistentes Video-Editing ohne zusätzliche Parameter.
RealCraft proposes an attention-control-based method for zero-shot real-world video editing, achieving shape-wise edits and enhanced temporal consistency without additional information.
Learning visual representations of editing components is crucial for video creation tasks, supported by a novel dataset and method.
VIDEOSHOP enables precise video editing by propagating semantic changes across frames.