MaGRITTE: Manipulative and Generative 3D Scene Realization from Image, Topview, and Text
The proposed method generates 3D scenes by integrating partial images, layout information represented in the top view, and text prompts as input conditions in a complementary manner, addressing the limitations of existing methods that rely on a single condition.