The author presents Smart-Infinity as a solution to the storage bandwidth bottleneck in large language model training by utilizing near-storage processing devices and innovative optimization techniques.
Optimizing ZeRO for efficient large language model training.