Loss Dynamics of Temporal Difference Reinforcement Learning

Publication information:

B. Bordelon, P. Masset, H. Kuo, and C. Pehlevan,
“Loss Dynamics of Temporal Difference Reinforcement Learning ”, in Advances in Neural Information Processing Systems (NeurIPS), 2023.