An optimization framework is proposed to enforce multi-view consistency for texturing 3D meshes using pre-trained text-to-image models.