Główne pojęcia
TextField3D introduces Noisy Text Fields to enhance open-vocabulary 3D generation by injecting dynamic noise into the latent space of text prompts.
Streszczenie
TextField3D proposes a conditional 3D generative model that utilizes Noisy Text Fields (NTFs) to expand the vocabulary scale and improve text control in 3D generation. By introducing NTFGen and NTFBind modules, TextField3D enhances the mapping of limited 3D data to textual latent space. Multi-modal discrimination is employed for geometry and texture guidance. Extensive experiments demonstrate the potential open-vocabulary capability of TextField3D.
Statystyki
"Extensive experiments demonstrate that our method achieves a potential open-vocabulary 3D generation capability."
"The total training time is around 3 days and 1 day with 8 V100 GPUs, respectively."
"We sample 8,192 points for each mesh object."
"The image resolution is 512 × 512 for both rendering and generation."
Cytaty
"Generative models have shown remarkable progress in 3D aspect."
"To tackle this issue, we introduce a conditional 3D generative model, namely TextField3D."
"Compared to previous methods, TextField3D includes three merits: large vocabulary, text consistency, and low latency."