Core Concepts
A non-intrusive approach is used to port and optimize the Community Earth System Model (CESM) 2.2 for kilometer-level coupled climate modeling on a 40-million-core Sunway supercomputer, achieving a simulation speed of 222 simulated days per day.
Abstract
The authors present their eight-year journey of enabling high-resolution and ultra-high-resolution climate modeling on Sunway supercomputers, focusing on porting and optimizing the Community Earth System Model (CESM) 2.2.
Key highlights:
A hierarchical grid system with different spatial resolutions is adopted, enabling a progressive development of the final, ultra-high resolution coupled model.
A non-intrusive proxy toolkit called O2ATH is developed to facilitate OpenMP offloading to the heterogeneous Sunway manycore architecture, without requiring significant manual code modifications.
The O2ATH toolkit is widely adopted across major component models in CESM, including the atmosphere (CAM), ocean (POP), and sea ice (CICE) components, achieving significant performance improvements.
The initialization stage of the model is optimized through techniques like reforming the MPI communication pattern, reducing time complexity of mapping algorithms, and balancing I/O and communication overheads.
A suite of tools is developed to support the porting and optimization process, including profiling, hardware fault detection, bit-accurate result validation, and binary static call map analysis.
The final coupled model, with a 5-km atmosphere and 3-km ocean resolution, scales to 101,800 nodes (39,702,000 cores) and achieves a simulation speed of 222 simulated days per day, enabling multi-year or even multi-decadal ultra-high-resolution climate modeling.
Stats
The final coupled model, with a 5-km atmosphere and 3-km ocean resolution, scales to 101,800 nodes (39,702,000 cores) and achieves a simulation speed of 222 simulated days per day.
The atmosphere-only (CAM) component achieves a simulation speed of 340 simulated days per day at 5-km resolution.
The ocean-only (POP) component achieves a simulation speed of 265 simulated days per day at 3-km resolution.
Quotes
"We form a non-intrusive yet efficient workflow to port CESM 2.2 to a 40-million-core heterogeneous supercomputer, in around three weeks. Maintaining the consistency of the code, we improve from simulating 1.79 days to 222 days per day (enabling multi-year or even multi-decadal ultra-high-resolution climate modeling)."