In your paper, you mention the evaluation metric Forward Transfer (FWT). However, the formula provided in your paper appears to differ from the one in the original paper you cited (Lopez-Paz and Ranzato, 2017). Specifically, your formula is:

Whereas the original paper presents the formula as:

It seems that these two formulas are not equivalent. Could you provide clarification on this discrepancy and explain why your formula differs from the original?
References: David Lopez-Paz and Marc’Aurelio Ranzato. 2017. Gradient episodic memory for continual learning. Advances in neural information processing systems, 30.
Additionally, I noticed another discrepancy regarding the Backward Transfer (BWT) metric in your paper. The formula given in the paper is:

In this formula, the denominator is T-1. However, in your code (score.py), BWT is computed as follows:
Bwt=sum([scores_array[-1][i] - scores_array[i][i] for i in range(task_num)])/task_num
Here, the denominator is T. Could you please explain this difference between the formula in the paper and the implementation in the code?
I greatly appreciate your paper. Your work has been incredibly valuable to my research, and I look forward to your clarification. Thanks in advance for your help!
In your paper, you mention the evaluation metric Forward Transfer (FWT). However, the formula provided in your paper appears to differ from the one in the original paper you cited (Lopez-Paz and Ranzato, 2017). Specifically, your formula is:


Whereas the original paper presents the formula as:
It seems that these two formulas are not equivalent. Could you provide clarification on this discrepancy and explain why your formula differs from the original?
References: David Lopez-Paz and Marc’Aurelio Ranzato. 2017. Gradient episodic memory for continual learning. Advances in neural information processing systems, 30.
Additionally, I noticed another discrepancy regarding the Backward Transfer (BWT) metric in your paper. The formula given in the paper is:

In this formula, the denominator is
T-1. However, in your code (score.py), BWT is computed as follows:Here, the denominator is
T. Could you please explain this difference between the formula in the paper and the implementation in the code?I greatly appreciate your paper. Your work has been incredibly valuable to my research, and I look forward to your clarification. Thanks in advance for your help!