Seesaw: Accelerating Training by Balancing Learning Rate and Batch Size Scheduling
Publication information:
A. Meterez, D. Morwani, J. Wu, C.-A. Oncescu, C. Pehlevan, and S. Kakade,
“Seesaw: Accelerating Training by Balancing Learning Rate and Batch Size Scheduling”, International Conference on Learning Representations (ICLR), 2026.