Generating Realistic 3D Human-Object Interactions from Text Descriptions in a Zero-Shot Setting
This paper introduces a novel framework, InterDreamer, that can generate realistic and coherent 3D human-object interaction sequences from text descriptions without requiring text-interaction paired data for training.