Scaling the AlphaFold Training to 2080 GPUs and Reducing Initial Training Time to 10 Hours
A systematic training method called ScaleFold that incorporates optimizations to address the key factors preventing the AlphaFold training from scaling to more compute resources, enabling it to be completed in 10 hours.