A method with a mixture-of-expert (MOE) controllers to align the text-guided capacity of diffusion models with different kinds of human instructions, enabling the model to handle various open-domain image manipulation tasks with natural language instructions.
CycleNet introduces cycle consistency into diffusion models for superior image manipulation.