The author proposes a novel approach, Batch Size Invariant Adam, to address the limitations of standard Adam optimization in large-scale distributed settings. By modifying the update rules, batch size invariance is achieved without relying on strong assumptions.
Optimizing Adam for batch size invariance in large-scale distributed settings.