Hi, I run the train_coco_word2_1_pool.m, but after more than 10 epochs, I found the train result is still bad(as below). I didn't change any hyper parameters but I don't know why it doesn't work.

Can you tell me what make this result happen? And I found the learning rate in your code is 0.1, but it's reported as 0.001 in the paper. Which lr is correct and better in this task?