Hyperparameter Transfer with Mixture-of-Expert Layers

Publication information:

T. Jiang, B. Bordelon, C. Pehlevan, and B. Hanin,
“Hyperparameter Transfer with Mixture-of-Expert Layers”, International Conference on Machine Learning (ICML), 2026.