MVD-Fusion: Generating Depth-Consistent Multi-View Images from a Single Input
MVD-Fusion casts the task of 3D inference as directly generating mutually-consistent multiple views and leverages depth estimation to enforce this consistency, enabling more accurate synthesis compared to prior state-of-the-art methods.