Don't be lazy: CompleteP enables compute-efficient deep transformers

Publication information:

N. Dey et al.,
“Don’t be lazy: CompleteP enables compute-efficient deep transformers”, Advances in Neural Information Processing Systems (NeurIPS), 2025, 2025.