https://github.com/berkeleydeeprlcourse/homework_fall2020/blob/b5d34989d30c72b353acc2a64e691d17ddbea81f/hw3/cs285/critics/dqn_critic.py#L91 Should we use `clip_grad_norm_` here?