DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
DriveDreamer-2 introduces a Large Language Model (LLM) to generate user-defined driving videos, enhancing diversity and quality. The Unified Multi-View Model improves temporal and spatial coherence in video generation.