Core Concepts
TD-MPC2 presents significant improvements over baselines in online RL tasks, achieving strong results with a single set of hyperparameters and demonstrating scalability.
Stats
"We further demonstrate the scalability of TD-MPC2 by training a single 317M parameter agent to perform 80 tasks across multiple domains, embodiments, and action spaces."