Hyperoptimization metrics - implementation of $`\varphi^{2}`$ estimator

We are interested in implementing an additional metrics to `hyperopt` that is sensitive to higher moments of the probability distribution, $`\varphi^{2}`$; see Eq.(4.6) of the [NNPDF3.0 paper](https://link.springer.com/article/10.1007/JHEP04(2015)040). As defined therein and extended by @RoyStegeman and @juanrojochacon to the context of hyperoptimization,  $`\varphi^{2}`$ can be calculated for each $k$-fold as

$$\Huge \varphi_{\chi^2_k}^2 = \langle \chi^2_k  [ \mathcal{T}[f_{\rm fit}], \mathcal{D} ] \rangle_{\rm rep} - \chi^2_k [ \langle \mathcal{T}[f_{\rm fit}] \rangle_{\rm rep}, \mathcal{D} ] $$

where the first term represents our usual averaged-over-replicas hyper loss, $`\chi^2_k`$, that is calculated based on the dataset used in the fit ($`\mathcal{D}`$) and the theory predictions from each fitted PDF ($f_{\rm fit}$) replica. The second term of the above equation would involve the calculation of the hyper loss but now using the theory predictions from the central PDF (averaged-over-replicas PDF - if I understood well).

The idea would be to implement this new metrics as an additional `@staticmethod` of the [`HyperLoss` class](https://github.com/NNPDF/nnpdf/blob/hyperopt_loss/n3fit/src/n3fit/hyper_optimization/rewards.py#L36C7-L36C16).

I noticed that there already exists an implementation of $\varphi$ (probably from [NNPDF3.0 paper](https://link.springer.com/article/10.1007/JHEP04(2015)040)) in the [`phi_data` function](https://github.com/NNPDF/nnpdf/blob/hyperopt_loss/validphys2/src/validphys/results.py#L542C1-L542C13) in `validphys`. This function depends on the [`abs_chi2_data` function](https://github.com/NNPDF/nnpdf/blob/hyperopt_loss/validphys2/src/validphys/results.py#L525) which in turn depends on [`results`](https://github.com/NNPDF/nnpdf/blob/hyperopt_loss/validphys2/src/validphys/results.py#L463). 

To avoid code duplication, I think it would be nice to use these functions probably via [`n3fit/vpinterface.py`](https://github.com/NNPDF/nnpdf/blob/hyperopt_loss/n3fit/src/n3fit/vpinterface.py).

The problem is that I really do not know how to use these functions from `validphys`, specially [`results`](https://github.com/NNPDF/nnpdf/blob/hyperopt_loss/validphys2/src/validphys/results.py#L463) that depends on `covariance_matrix` and `sqrt_covmat` arguments.

Please, could anybody help me on that or even suggest any alternative way to do so ? I would appreciate it very much you help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyperoptimization metrics - implementation of $`\varphi^{2}`$ estimator #1849

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Hyperoptimization metrics - implementation of $\varphi^{2}$ estimator #1849

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Hyperoptimization metrics - implementation of $`\varphi^{2}`$ estimator #1849