An automated cross-modal testing method, ACTesting, is proposed to effectively detect defects and evaluate the generation robustness of text-to-image software.