Efficient Text-Conditioned Image-to-Animation Generation with Tuning-Free LLM-Driven Attention Control
The proposed LASER framework integrates large language models (LLMs) with pre-trained text-to-image models to enable high-quality and smooth text-conditioned image-to-animation translation without the need for fine-tuning.