Skip to content

[WIP] Added NLO and NNLO candidate runcards for NNPDF40#675

Merged
Zaharid merged 77 commits into
masterfrom
NNPDF40_runcrads
May 12, 2021
Merged

[WIP] Added NLO and NNLO candidate runcards for NNPDF40#675
Zaharid merged 77 commits into
masterfrom
NNPDF40_runcrads

Conversation

@enocera
Copy link
Copy Markdown
Contributor

@enocera enocera commented Mar 13, 2020

This PR contains two runcards (NLO and NNLO) for a candidate NNPDF40 fit with the currently available data set. @scarlehoff : you might consider to use these as a baseline for the n3fit requested by @juanrojochacon at the PC today.

@voisey
Copy link
Copy Markdown
Contributor

voisey commented Mar 13, 2020

Hi @enocera. One very minor comment: I noticed that this includes the 8 TeV single top data (for which I am currently working on implementing the correlation info) rather than the 7 TeV data (for which we have the full correlation info already implemented). Is there a reason for this?

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Mar 13, 2020

Thanks for noticing @voisey . That was a typo. This is now corrected. (BTW I was not aware of the fact that we have correlations for the 8 TeV data now?)

@voisey
Copy link
Copy Markdown
Contributor

voisey commented Mar 13, 2020

No problem! Yes, we do. It's available here: https://atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/PAPERS/TOPQ-2015-05/ and it has both the statistical correlation matrices and the systematic breakdowns. We got an email from Wolfgang Wagner on 8 Jan 2020 pointing us to the info. He also said he would put some of the tables into a machine readable format for us by the end of January but I've heard nothing from him since, so I decided to do it myself. There isn't that much info to convert so it shouldn't take me too long

@juanrojochacon
Copy link
Copy Markdown

Thanks, let me look at the candidate NNPDF4.0 runcard and I will let you know if I have any comments.

@scarlehoff
Copy link
Copy Markdown
Member

Great, I'll start running the NLO one now. Let me know if the runcard changes!

One question, is there by any chance a nnfit baseline fit I can compare with?

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Mar 16, 2020

@scarlehoff Please wait one second - I've to add the off-peak and forward W/Z data in the runcards. There's no nnfit baseline, but I can easily produce it if need be.

@juanrojochacon
Copy link
Copy Markdown

wait a sec, let me check the runcard first.

There is no nnfit baseline, so if required we would need to run it ex-profeso. I guess probably we want this baseline, right @enocera ? But first let us get the n3fit running, which is much faster and would allow us to identify any potential problem.

@scarlehoff
Copy link
Copy Markdown
Member

Ok! I'll be in stand-by !

wrt the baseline, no problem, I was just wondering whether it existed.

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Mar 16, 2020

Please note that I've updated the wiki correspondingly:
https://www.wiki.ed.ac.uk/display/nnpdfwiki/NNPDF4.0+dataset
and that the K-folding partitions suggested by @juanrojochacon a while ago MUST be updated with the file provided there.

@juanrojochacon
Copy link
Copy Markdown

Hi @enocera the run cards look good to me with a couple of small modifications:

  • I guess we also want to include the CMS 8 TeV dilepton 2D distributions right? So we should uncomment for example CMS_TTBAR_2D_DIFF_MTT_TRAP_NORM

  • Are we sure we want to fit the top pT from ATLAS lepton+jets 8 TeV? I understood that in this case the theory does not provide a good description of the data?

Other than that the runcards seem ready to go to me. @scarlehoff before running any fit, could you please send around a list of the chi2s at NLO and NNLO to try to identify if there is any obvious problem?

@juanrojochacon
Copy link
Copy Markdown

Also @scarlehoff for the k-foldings, I am not sure in which format do you need the info, but I guess that it should be possible to adapt my list and proposed partitions so that it is in one-to-one correspondence with the runcard that @enocera has produced?
Let me know if I can provide any further information!

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Mar 16, 2020

I guess we also want to include the CMS 8 TeV dilepton 2D distributions right?
So we should uncomment for example CMS_TTBAR_2D_DIFF_MTT_TRAP_NORM

There's in principle double counting with the single differential CMS top pair at 8 TeV (CMSTOPDIFF8TEVTTRAPNORM) which is not commented. It's up to us to decide whether we want to replace it with one of the 2D differential distributions.

Are we sure we want to fit the top pT from ATLAS lepton+jets 8 TeV? I understood that in this
case the theory does not provide a good description of the data?

As we have shown in our LH proceedings, including or not the pT distribution is inconsequential. Maybe @stefanoforte has a strong opinion on this? I don't.

Other than that the runcards seem ready to go to me. @scarlehoff before running any fit,
could you please send around a list of the chi2s at NLO and NNLO to try to identify if there is
any obvious problem?

I of course checked all these numbers before opening the PR. Please have a look at these

  • NLO (with 200310-ern-dijets-nlo)
  • NNLO (with 200310-ern-dijets-nnlo)

which were computed with the iterated fit to dijet data (no PDF uncertainty included).

Also @scarlehoff for the k-foldings, I am not sure in which format do you need the info, but I guess that it should be possible to adapt my list and proposed partitions so that it is in one-to-one correspondence with the runcard that @enocera has produced?

I've already adapted the list at this link.

@juanrojochacon
Copy link
Copy Markdown

There's in principle double counting with the single differential CMS top pair at 8 TeV (CMSTOPDIFF8TEVTTRAPNORM) which is not commented. It's up to us to decide whether we want to replace it with one of the 2D differential distributions.

I seem to recall that the 2D distributions correspond to the dilepton dataset while the single-differential ones are for lepton+jets, so to the best of my understanding there is no double counting in such a case?

Are we sure we want to fit the top pT from ATLAS lepton+jets 8 TeV? I understood that in this
case the theory does not provide a good description of the data?

As we have shown in our LH proceedings, including or not the pT distribution is inconsequential. Maybe @stefanoforte has a strong opinion on this? I don't.

Me neither, but since the chi2 is not optimal perhaps we can facilitate the life of the optimiser? Again, this is a minute thing.

Other than that the runcards seem ready to go to me. @scarlehoff before running any fit,
could you please send around a list of the chi2s at NLO and NNLO to try to identify if there is
any obvious problem?

I of course checked all these numbers before opening the PR. Please have a look at these

  • NLO (with 200310-ern-dijets-nlo)
  • NNLO (with 200310-ern-dijets-nnlo)

which were computed with the iterated fit to dijet data.

Ok thanks I will check this and comment back.

Also @scarlehoff for the k-foldings, I am not sure in which format do you need the info, but I guess that it should be possible to adapt my list and proposed partitions so that it is in one-to-one correspondence with the runcard that @enocera has produced?

I've already adapted the list at this link.

Amazing, many thanks ;)

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Mar 16, 2020

I seem to recall that the 2D distributions correspond to the dilepton dataset while the single-differential ones are for lepton+jets, so to the best of my understanding there is no double counting in such a case?

I had the same recollection, but that's not the case. I've checked the papers earlier today while preparing the table at https://www.wiki.ed.ac.uk/display/nnpdfwiki/NNPDF4.0+dataset. But please feel free to double check.

@juanrojochacon
Copy link
Copy Markdown

Well in

https://arxiv.org/abs/1703.01630

they say "The measurement is performed in the dilepton e±µ∓ final state", so indeed there is no double counting? And then they quote the older paper

https://arxiv.org/abs/1505.04480

which is lepton+jets. Or I am missing something obvious?

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Mar 16, 2020

@juanrojochacon I detest to say that you're right. I've corrected the runcards and all files/numbers linked to https://www.wiki.ed.ac.uk/display/nnpdfwiki/NNPDF4.0+dataset.

@juanrojochacon
Copy link
Copy Markdown

Thanks, now it looks good to me

juanrojochacon
juanrojochacon previously approved these changes Mar 17, 2020
Copy link
Copy Markdown

@juanrojochacon juanrojochacon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good to go

@scarlehoff
Copy link
Copy Markdown
Member

scarlehoff commented Mar 17, 2020

Great, I'll start running with these runcards.
I've already run a few fits with the iterated runcards of this PR and seems to work ok. If there are no unexpected problems I should be able to produce reports by this afternoon.

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Mar 17, 2020

@scarlehoff Please note that there was still a bug in the NNLO runcard: the EWK K-factors were missing for the ATLASPHT12 and ATLASPHT15 data sets (they were not propagated to theory 53). You have to download theory 53 again to get these.

@enocera enocera changed the title Added NLO and NNLO candidate runcards for NNPDF40 [WIP] Added NLO and NNLO candidate runcards for NNPDF40 Mar 18, 2020
@siranipour
Copy link
Copy Markdown
Contributor

Just out of curiosity, is the DISonly runcard going to remain as the one that's in

#
# Configuration file for NNPDF++
#
############################################################
description: NNPDF3.1 NNLO fitted charm global dataset
############################################################
# frac: training fraction
# ewk: apply ewk k-factors
# sys: systematics treatment (see systypes)
experiments:
# Fixed target DIS
- experiment: NMC
datasets:
- { dataset: NMCPD, frac: 0.5 }
- { dataset: NMC, frac: 0.5 }
- experiment: SLAC
datasets:
- { dataset: SLACP, frac: 0.5}
- { dataset: SLACD, frac: 0.5}
- experiment: BCDMS
datasets:
- { dataset: BCDMSP, frac: 0.5}
- { dataset: BCDMSD, frac: 0.5}
- experiment: CHORUS
datasets:
- { dataset: CHORUSNU, frac: 0.5}
- { dataset: CHORUSNB, frac: 0.5}
- experiment: NTVDMN
datasets:
- { dataset: NTVNUDMN, frac: 0.5}
- { dataset: NTVNBDMN, frac: 0.5}
# EMC F2C data
# - experiment: EMCF2C
# datasets:
# - { dataset: EMCF2C, frac: 1.0}
# HERA data
- experiment: HERACOMB
datasets:
- { dataset: HERACOMBNCEM , frac: 0.5}
- { dataset: HERACOMBNCEP460, frac: 0.5}
- { dataset: HERACOMBNCEP575, frac: 0.5}
- { dataset: HERACOMBNCEP820, frac: 0.5}
- { dataset: HERACOMBNCEP920, frac: 0.5}
- { dataset: HERACOMBCCEM , frac: 0.5}
- { dataset: HERACOMBCCEP , frac: 0.5}
# Combined HERA charm production cross-sections
- experiment: HERAF2CHARM
datasets:
- { dataset: HERAF2CHARM, frac: 0.5}
# F2bottom data
- experiment: F2BOTTOM
datasets:
- { dataset: H1HERAF2B, frac: 1.0}
- { dataset: ZEUSHERAF2B, frac: 1.0}

@juanrojochacon
Copy link
Copy Markdown

no this is an old card, for NNPDF3.1-like fits

@juanrojochacon
Copy link
Copy Markdown

@siranipour you should take the NNPDF4.0 card and remove all the non-DIS experiments

@enocera enocera mentioned this pull request Apr 5, 2020
@scarrazza scarrazza marked this pull request as draft April 22, 2020 08:25
@scarlehoff
Copy link
Copy Markdown
Member

scarlehoff commented May 27, 2020

Hi @enocera (cc @scarrazza @juanrojochacon @stefanoforte) I've tried doing a global fit, current is the fit after hyperoptimization and reference before.

nnpdf40-fit
nnpdf40-fit extra flexibility

As you can see the fits after some hyperoptimization "work better" (they are more stable from one run to the next) but the chi2 of the central replica is quite bad.

I'm looking through https://www.wiki.ed.ac.uk/download/attachments/431524064/NNLO_chi2.txt?version=2&modificationDate=1584395434672&api=v2 and it seems that the experiments these fits fail to describe well (such as ATLAS TPTNORM, ATLAS_1JET_8TEV_R06, and even D0WEASY or ATLASTTBARTOT) were already more problematic in that document.

I was wondering whether there could be some problem with these datasets.
Before expanding more the network to add flexibility to force lower chi2 (which seems to improve the chi2 but in exchange of some overfitting) I've tried doing a fit without these "problematic" datasets just in case:

fit with some datasets removed

which seems to do much better (note that I also removed here positivity, this was my first test but didn't really make a difference, I just forgot to put it back, example here )

Any insights? Let me know if you'd like any other fits to complete the info (bear in mind they take 3-6 hours to be ready in the best case).

Some extra info:

Before fitting I removed the theory_53 folder to force a redownload in case there was a problem with my local copy of the data. For the set of data (and the training validation splits) I am using the latest runcards form this branch.

All in all, this is my runcard:
250520_global_bestworst.txt

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented May 27, 2020

@scarlehoff I think that the somewhat high numbers you find are expected, and consistent with our recent top study (https://inspirehep.net/literature/1783782) for ATLASTPTNORM and ATLASTTBARTOT, with our recent jet study (https://inspirehep.net/literature/1797633) for ATLAS_1JET_8TEV_R06 and with NNPDF3.1 for D0WEASY. We have indication that the experimental covariance matrix is problematic in these cases. Our claim (to be checked) is that the chi2 can be improved by regularizing/decorrelating the covariance matrix, but that this leaves the PDFs unchanged.

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Apr 23, 2021

Thanks. Then I have a collateral question (for the record): why the data sets on F2c and F2b were not removed from the LO NNPDF3.1 fit?

@juanrojochacon
Copy link
Copy Markdown

I think technically F2c is not zero at LO but rather reduced to the massless calculation. But this is terrible for low Q. So it is a bit ambiguous but I think it is safer to remove them

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Apr 23, 2021

@juanrojochacon Thanks for the clarification. I apologise for having overlooked W+jet.
@RoyStegeman I think that there's now sufficient consensus on the correctness of the LO runcards, and therefore room for additional investigations of the corresponding fits.

@RoyStegeman
Copy link
Copy Markdown
Member

Perfect! Thanks all for your quick replies.

@juanrojochacon
Copy link
Copy Markdown

agreed, let's see what happens now. If the problem persist, we should store the chi2 logs dataset per dataset to pin down the problem @scarlehoff

@wilsonmr
Copy link
Copy Markdown
Contributor

That's easier said than done - the training validation split is done at the level of the experiments and we never calculate per dataset chi2 at present.

I wonder if we should open a new discussion of this study, the conversation is rather long here and nothing to do with the title (but of course very important).

@wilsonmr
Copy link
Copy Markdown
Contributor

I'm also opening a separate issue wrt the PDG plots for similar reason - the discussion will likely be forgotten about or duplicated as is.

@RoyStegeman
Copy link
Copy Markdown
Member

There is already an issue for the PDG plots: https://github.com/NNPDF/papers/issues/27

@wilsonmr
Copy link
Copy Markdown
Contributor

Cheers @RoyStegeman I still opened the issue incase we want that code in this repo - if not then somebody can have the endorphin rush of closing the issue without the usual associated pain..

I also opened an issue about fitting at LO, if the new runcard solves the issues then perhaps this will also be closed relatively soon.

I wonder if at some point we should merge this PR? It seems somehow useful to have the current runcards on the main branch even if a PR in the future must be made to update them. I guess at some point this was discussed before but rejected for some reason, but if not then I'll re/propose the idea.

@Zaharid
Copy link
Copy Markdown
Contributor

Zaharid commented Apr 23, 2021

@wilsonmr I guess we didn't believe we had anything reasonably close to final runcards until today (with the LO one checked), but it probably makes sense now. There are various modifications in the pipeline and I think it is up to @enocera to decide if he prefers managing them from here or master.

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Apr 23, 2021

There are various modifications in the pipeline and I think it is up to @enocera to decide if he prefers managing them from here or master.

It's immaterial, insofar as I'm concerned.

@RoyStegeman
Copy link
Copy Markdown
Member

CMS_WCHARM_DIFF_UNNORM_13TEV does not have any frac or cfac set. Is that intentional or should it be changed?

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Apr 23, 2021

@RoyStegeman Good spot! The missing training fraction is a mistake - fixed now through all the LO and NLO runcards; the missing cfac should be fine, W+c data are excluded from the computation of similarity cuts.

@RoyStegeman
Copy link
Copy Markdown
Member

I noticed you also removed the posf2c constraint from the regular (non pch) LO runcard. Is there a reason for this?

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented Apr 24, 2021

@RoyStegeman It's the same reason for which Juan suggested to remove F2c for HERA.

@RoyStegeman
Copy link
Copy Markdown
Member

@enocera I think the LO runcards can be replaced by the ones below (make sure to change the extension to .yml). LO pch is iterated wrt the current version in the repo while the fitted charm is slightly changed to be positive definite in the flavour basis.

Further things to note are:

  • Both runcards are without HERA F2c
  • Both runcards include ATLAS_WMU_8TEV (not the t0 pdfs though)
  • I noticed you didn't add NNLO K-factors for DYE886R to the NNLODatasets in the LO runcards. I did not include them here either since I don't know if they should, but just wanted to point it out since it seems inconsistent to me.

NNPDF40_lo_as_0118_pch.txt
NNPDF40_lo_as_0118.txt

@enocera
Copy link
Copy Markdown
Contributor Author

enocera commented May 10, 2021

@RoyStegeman Thanks a lot. The LO runcards should now read like the ones you pasted above and they should alos include the missing NNLO K-factors. TOmorrow I'll update them with the missing Seaquest dataset.

@scarlehoff scarlehoff marked this pull request as ready for review May 12, 2021 14:11
Copy link
Copy Markdown
Member

@scarlehoff scarlehoff left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approving by explicit request from @Zaharid

@Zaharid Zaharid merged commit c97c8cb into master May 12, 2021
@Zaharid Zaharid deleted the NNPDF40_runcrads branch May 12, 2021 14:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants