[WIP] Added NLO and NNLO candidate runcards for NNPDF40 by enocera · Pull Request #675 · NNPDF/nnpdf

enocera · 2020-03-13T18:52:38Z

This PR contains two runcards (NLO and NNLO) for a candidate NNPDF40 fit with the currently available data set. @scarlehoff : you might consider to use these as a baseline for the n3fit requested by @juanrojochacon at the PC today.

voisey · 2020-03-13T18:56:33Z

Hi @enocera. One very minor comment: I noticed that this includes the 8 TeV single top data (for which I am currently working on implementing the correlation info) rather than the 7 TeV data (for which we have the full correlation info already implemented). Is there a reason for this?

enocera · 2020-03-13T19:02:47Z

Thanks for noticing @voisey . That was a typo. This is now corrected. (BTW I was not aware of the fact that we have correlations for the 8 TeV data now?)

voisey · 2020-03-13T19:08:17Z

No problem! Yes, we do. It's available here: https://atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/PAPERS/TOPQ-2015-05/ and it has both the statistical correlation matrices and the systematic breakdowns. We got an email from Wolfgang Wagner on 8 Jan 2020 pointing us to the info. He also said he would put some of the tables into a machine readable format for us by the end of January but I've heard nothing from him since, so I decided to do it myself. There isn't that much info to convert so it shouldn't take me too long

juanrojochacon · 2020-03-16T08:11:15Z

Thanks, let me look at the candidate NNPDF4.0 runcard and I will let you know if I have any comments.

scarlehoff · 2020-03-16T09:07:43Z

Great, I'll start running the NLO one now. Let me know if the runcard changes!

One question, is there by any chance a nnfit baseline fit I can compare with?

enocera · 2020-03-16T09:09:59Z

@scarlehoff Please wait one second - I've to add the off-peak and forward W/Z data in the runcards. There's no nnfit baseline, but I can easily produce it if need be.

juanrojochacon · 2020-03-16T09:10:56Z

wait a sec, let me check the runcard first.

There is no nnfit baseline, so if required we would need to run it ex-profeso. I guess probably we want this baseline, right @enocera ? But first let us get the n3fit running, which is much faster and would allow us to identify any potential problem.

scarlehoff · 2020-03-16T09:12:15Z

Ok! I'll be in stand-by !

wrt the baseline, no problem, I was just wondering whether it existed.

enocera · 2020-03-16T10:34:22Z

Please note that I've updated the wiki correspondingly:
https://www.wiki.ed.ac.uk/display/nnpdfwiki/NNPDF4.0+dataset
and that the K-folding partitions suggested by @juanrojochacon a while ago MUST be updated with the file provided there.

juanrojochacon · 2020-03-16T17:46:24Z

Hi @enocera the run cards look good to me with a couple of small modifications:

I guess we also want to include the CMS 8 TeV dilepton 2D distributions right? So we should uncomment for example CMS_TTBAR_2D_DIFF_MTT_TRAP_NORM
Are we sure we want to fit the top pT from ATLAS lepton+jets 8 TeV? I understood that in this case the theory does not provide a good description of the data?

Other than that the runcards seem ready to go to me. @scarlehoff before running any fit, could you please send around a list of the chi2s at NLO and NNLO to try to identify if there is any obvious problem?

juanrojochacon · 2020-03-16T17:47:51Z

Also @scarlehoff for the k-foldings, I am not sure in which format do you need the info, but I guess that it should be possible to adapt my list and proposed partitions so that it is in one-to-one correspondence with the runcard that @enocera has produced?
Let me know if I can provide any further information!

enocera · 2020-03-16T18:07:19Z

I guess we also want to include the CMS 8 TeV dilepton 2D distributions right?
So we should uncomment for example CMS_TTBAR_2D_DIFF_MTT_TRAP_NORM

There's in principle double counting with the single differential CMS top pair at 8 TeV (CMSTOPDIFF8TEVTTRAPNORM) which is not commented. It's up to us to decide whether we want to replace it with one of the 2D differential distributions.

Are we sure we want to fit the top pT from ATLAS lepton+jets 8 TeV? I understood that in this
case the theory does not provide a good description of the data?

As we have shown in our LH proceedings, including or not the pT distribution is inconsequential. Maybe @stefanoforte has a strong opinion on this? I don't.

Other than that the runcards seem ready to go to me. @scarlehoff before running any fit,
could you please send around a list of the chi2s at NLO and NNLO to try to identify if there is
any obvious problem?

I of course checked all these numbers before opening the PR. Please have a look at these

NLO (with 200310-ern-dijets-nlo)
NNLO (with 200310-ern-dijets-nnlo)

which were computed with the iterated fit to dijet data (no PDF uncertainty included).

Also @scarlehoff for the k-foldings, I am not sure in which format do you need the info, but I guess that it should be possible to adapt my list and proposed partitions so that it is in one-to-one correspondence with the runcard that @enocera has produced?

I've already adapted the list at this link.

juanrojochacon · 2020-03-16T18:13:49Z

There's in principle double counting with the single differential CMS top pair at 8 TeV (CMSTOPDIFF8TEVTTRAPNORM) which is not commented. It's up to us to decide whether we want to replace it with one of the 2D differential distributions.

I seem to recall that the 2D distributions correspond to the dilepton dataset while the single-differential ones are for lepton+jets, so to the best of my understanding there is no double counting in such a case?

Are we sure we want to fit the top pT from ATLAS lepton+jets 8 TeV? I understood that in this
case the theory does not provide a good description of the data?

As we have shown in our LH proceedings, including or not the pT distribution is inconsequential. Maybe @stefanoforte has a strong opinion on this? I don't.

Me neither, but since the chi2 is not optimal perhaps we can facilitate the life of the optimiser? Again, this is a minute thing.

Other than that the runcards seem ready to go to me. @scarlehoff before running any fit,
could you please send around a list of the chi2s at NLO and NNLO to try to identify if there is
any obvious problem?

I of course checked all these numbers before opening the PR. Please have a look at these

NLO (with 200310-ern-dijets-nlo)

NNLO (with 200310-ern-dijets-nnlo)

which were computed with the iterated fit to dijet data.

Ok thanks I will check this and comment back.

Also @scarlehoff for the k-foldings, I am not sure in which format do you need the info, but I guess that it should be possible to adapt my list and proposed partitions so that it is in one-to-one correspondence with the runcard that @enocera has produced?

I've already adapted the list at this link.

Amazing, many thanks ;)

enocera · 2020-03-16T18:17:21Z

I seem to recall that the 2D distributions correspond to the dilepton dataset while the single-differential ones are for lepton+jets, so to the best of my understanding there is no double counting in such a case?

I had the same recollection, but that's not the case. I've checked the papers earlier today while preparing the table at https://www.wiki.ed.ac.uk/display/nnpdfwiki/NNPDF4.0+dataset. But please feel free to double check.

juanrojochacon · 2020-03-16T18:22:31Z

Well in

https://arxiv.org/abs/1703.01630

they say "The measurement is performed in the dilepton e±µ∓ final state", so indeed there is no double counting? And then they quote the older paper

https://arxiv.org/abs/1505.04480

which is lepton+jets. Or I am missing something obvious?

enocera · 2020-03-16T21:55:43Z

@juanrojochacon I detest to say that you're right. I've corrected the runcards and all files/numbers linked to https://www.wiki.ed.ac.uk/display/nnpdfwiki/NNPDF4.0+dataset.

juanrojochacon · 2020-03-17T09:07:17Z

Thanks, now it looks good to me

juanrojochacon

good to go

scarlehoff · 2020-03-17T09:20:48Z

Great, I'll start running with these runcards.
I've already run a few fits with the iterated runcards of this PR and seems to work ok. If there are no unexpected problems I should be able to produce reports by this afternoon.

enocera · 2020-03-17T22:57:47Z

@scarlehoff Please note that there was still a bug in the NNLO runcard: the EWK K-factors were missing for the ATLASPHT12 and ATLASPHT15 data sets (they were not propagated to theory 53). You have to download theory 53 again to get these.

siranipour · 2020-04-01T15:09:40Z

Just out of curiosity, is the DISonly runcard going to remain as the one that's in

nnpdf/n3fit/runcards/PN3_DIS_example.yml

Lines 1 to 56 in f9bbe6f

    
           # 
        
           # Configuration file for NNPDF++ 
        
           # 
        
           ############################################################ 
        
           description: NNPDF3.1 NNLO fitted charm global dataset 
        
           ############################################################ 
        
           # frac: training fraction 
        
           # ewk: apply ewk k-factors 
        
           # sys: systematics treatment (see systypes) 
        
           experiments: 
        
           # Fixed target DIS 
        
             - experiment: NMC 
        
               datasets: 
        
                 - { dataset: NMCPD, frac: 0.5 } 
        
                 - { dataset: NMC,   frac: 0.5  } 
        
             - experiment: SLAC 
        
               datasets: 
        
                 - { dataset: SLACP, frac: 0.5} 
        
                 - { dataset: SLACD, frac: 0.5} 
        
             - experiment: BCDMS 
        
               datasets: 
        
                 - { dataset: BCDMSP, frac: 0.5} 
        
                 - { dataset: BCDMSD, frac: 0.5} 
        
             - experiment: CHORUS 
        
               datasets: 
        
                 - { dataset: CHORUSNU, frac: 0.5} 
        
                 - { dataset: CHORUSNB, frac: 0.5} 
        
             - experiment: NTVDMN 
        
               datasets: 
        
                 - { dataset: NTVNUDMN, frac: 0.5} 
        
                 - { dataset: NTVNBDMN, frac: 0.5} 
        
           # EMC F2C data 
        
           #  - experiment: EMCF2C 
        
           #    datasets: 
        
           #      - { dataset: EMCF2C, frac: 1.0} 
        
           # HERA data 
        
             - experiment: HERACOMB 
        
               datasets: 
        
                 - { dataset: HERACOMBNCEM , frac: 0.5} 
        
                 - { dataset: HERACOMBNCEP460, frac: 0.5} 
        
                 - { dataset: HERACOMBNCEP575, frac: 0.5} 
        
                 - { dataset: HERACOMBNCEP820, frac: 0.5} 
        
                 - { dataset: HERACOMBNCEP920, frac: 0.5} 
        
                 - { dataset: HERACOMBCCEM , frac: 0.5} 
        
                 - { dataset: HERACOMBCCEP , frac: 0.5} 
        
           # Combined HERA charm production cross-sections 
        
             - experiment: HERAF2CHARM 
        
               datasets: 
        
                 - { dataset: HERAF2CHARM, frac: 0.5} 
        
           # F2bottom data 
        
             - experiment: F2BOTTOM 
        
               datasets:  
        
                 - { dataset: H1HERAF2B, frac: 1.0} 
        
                 - { dataset: ZEUSHERAF2B, frac: 1.0}

juanrojochacon · 2020-04-01T15:10:56Z

no this is an old card, for NNPDF3.1-like fits

juanrojochacon · 2020-04-01T15:11:12Z

@siranipour you should take the NNPDF4.0 card and remove all the non-DIS experiments

scarlehoff · 2020-05-27T06:41:44Z

Hi @enocera (cc @scarrazza @juanrojochacon @stefanoforte) I've tried doing a global fit, current is the fit after hyperoptimization and reference before.

nnpdf40-fit
nnpdf40-fit extra flexibility

As you can see the fits after some hyperoptimization "work better" (they are more stable from one run to the next) but the chi2 of the central replica is quite bad.

I'm looking through https://www.wiki.ed.ac.uk/download/attachments/431524064/NNLO_chi2.txt?version=2&modificationDate=1584395434672&api=v2 and it seems that the experiments these fits fail to describe well (such as ATLAS TPTNORM, ATLAS_1JET_8TEV_R06, and even D0WEASY or ATLASTTBARTOT) were already more problematic in that document.

I was wondering whether there could be some problem with these datasets.
Before expanding more the network to add flexibility to force lower chi2 (which seems to improve the chi2 but in exchange of some overfitting) I've tried doing a fit without these "problematic" datasets just in case:

fit with some datasets removed

which seems to do much better (note that I also removed here positivity, this was my first test but didn't really make a difference, I just forgot to put it back, example here )

Any insights? Let me know if you'd like any other fits to complete the info (bear in mind they take 3-6 hours to be ready in the best case).

Some extra info:

Before fitting I removed the theory_53 folder to force a redownload in case there was a problem with my local copy of the data. For the set of data (and the training validation splits) I am using the latest runcards form this branch.

All in all, this is my runcard:
250520_global_bestworst.txt

enocera · 2020-05-27T07:10:37Z

@scarlehoff I think that the somewhat high numbers you find are expected, and consistent with our recent top study (https://inspirehep.net/literature/1783782) for ATLASTPTNORM and ATLASTTBARTOT, with our recent jet study (https://inspirehep.net/literature/1797633) for ATLAS_1JET_8TEV_R06 and with NNPDF3.1 for D0WEASY. We have indication that the experimental covariance matrix is problematic in these cases. Our claim (to be checked) is that the chi2 can be improved by regularizing/decorrelating the covariance matrix, but that this leaves the PDFs unchanged.

enocera · 2021-04-23T12:00:51Z

Thanks. Then I have a collateral question (for the record): why the data sets on F2c and F2b were not removed from the LO NNPDF3.1 fit?

juanrojochacon · 2021-04-23T12:02:18Z

I think technically F2c is not zero at LO but rather reduced to the massless calculation. But this is terrible for low Q. So it is a bit ambiguous but I think it is safer to remove them

enocera · 2021-04-23T12:09:16Z

@juanrojochacon Thanks for the clarification. I apologise for having overlooked W+jet.
@RoyStegeman I think that there's now sufficient consensus on the correctness of the LO runcards, and therefore room for additional investigations of the corresponding fits.

RoyStegeman · 2021-04-23T12:12:11Z

Perfect! Thanks all for your quick replies.

juanrojochacon · 2021-04-23T12:13:22Z

agreed, let's see what happens now. If the problem persist, we should store the chi2 logs dataset per dataset to pin down the problem @scarlehoff

wilsonmr · 2021-04-23T12:18:36Z

That's easier said than done - the training validation split is done at the level of the experiments and we never calculate per dataset chi2 at present.

I wonder if we should open a new discussion of this study, the conversation is rather long here and nothing to do with the title (but of course very important).

wilsonmr · 2021-04-23T12:22:27Z

I'm also opening a separate issue wrt the PDG plots for similar reason - the discussion will likely be forgotten about or duplicated as is.

RoyStegeman · 2021-04-23T12:23:37Z

There is already an issue for the PDG plots: https://github.com/NNPDF/papers/issues/27

wilsonmr · 2021-04-23T12:37:08Z

Cheers @RoyStegeman I still opened the issue incase we want that code in this repo - if not then somebody can have the endorphin rush of closing the issue without the usual associated pain..

I also opened an issue about fitting at LO, if the new runcard solves the issues then perhaps this will also be closed relatively soon.

I wonder if at some point we should merge this PR? It seems somehow useful to have the current runcards on the main branch even if a PR in the future must be made to update them. I guess at some point this was discussed before but rejected for some reason, but if not then I'll re/propose the idea.

Zaharid · 2021-04-23T12:47:55Z

@wilsonmr I guess we didn't believe we had anything reasonably close to final runcards until today (with the LO one checked), but it probably makes sense now. There are various modifications in the pipeline and I think it is up to @enocera to decide if he prefers managing them from here or master.

enocera · 2021-04-23T12:50:45Z

There are various modifications in the pipeline and I think it is up to @enocera to decide if he prefers managing them from here or master.

It's immaterial, insofar as I'm concerned.

RoyStegeman · 2021-04-23T20:49:45Z

CMS_WCHARM_DIFF_UNNORM_13TEV does not have any frac or cfac set. Is that intentional or should it be changed?

enocera · 2021-04-23T21:35:59Z

@RoyStegeman Good spot! The missing training fraction is a mistake - fixed now through all the LO and NLO runcards; the missing cfac should be fine, W+c data are excluded from the computation of similarity cuts.

RoyStegeman · 2021-04-24T10:24:39Z

I noticed you also removed the posf2c constraint from the regular (non pch) LO runcard. Is there a reason for this?

enocera · 2021-04-24T10:32:06Z

@RoyStegeman It's the same reason for which Juan suggested to remove F2c for HERA.

RoyStegeman · 2021-05-10T21:42:08Z

@enocera I think the LO runcards can be replaced by the ones below (make sure to change the extension to .yml). LO pch is iterated wrt the current version in the repo while the fitted charm is slightly changed to be positive definite in the flavour basis.

Further things to note are:

Both runcards are without HERA F2c
Both runcards include ATLAS_WMU_8TEV (not the t0 pdfs though)
I noticed you didn't add NNLO K-factors for DYE886R to the NNLODatasets in the LO runcards. I did not include them here either since I don't know if they should, but just wanted to point it out since it seems inconsistent to me.

NNPDF40_lo_as_0118_pch.txt
NNPDF40_lo_as_0118.txt

…factor for DYE886R

enocera · 2021-05-10T22:44:11Z

@RoyStegeman Thanks a lot. The LO runcards should now read like the ones you pasted above and they should alos include the missing NNLO K-factors. TOmorrow I'll update them with the missing Seaquest dataset.

scarlehoff

Approving by explicit request from @Zaharid

enocera requested review from juanrojochacon and scarlehoff March 13, 2020 18:52

juanrojochacon previously approved these changes Mar 17, 2020

View reviewed changes

enocera changed the title ~~Added NLO and NNLO candidate runcards for NNPDF40~~ [WIP] Added NLO and NNLO candidate runcards for NNPDF40 Mar 18, 2020

enocera mentioned this pull request Apr 5, 2020

ATLAS and CMS W/Charm #704

Merged

scarrazza marked this pull request as draft April 22, 2020 08:25

Removed data sets that should not go into the LO fit

991f6aa

This was referenced Apr 23, 2021

Plots with many flavours in PDG style #1205

Closed

LO fits with 4.0 methodology #1206

Closed

removed text

068b9a8

Corrected missing training fraction

82ff676

Zaharid added the 4.0-blocker label May 7, 2021

enocera added 2 commits May 10, 2021 16:23

Updated runcards to include missing NNLO K-factors for DYE886R

6245183

Added ATLAS_WMU_8TEV data set to NNPDF4.0 example runcards

c12fd93

Corrected LO runcards according to RS fits; added missing NNLO QCD K-…

eb50941

…factor for DYE886R

scarlehoff marked this pull request as ready for review May 12, 2021 14:11

scarlehoff approved these changes May 12, 2021

View reviewed changes

Zaharid merged commit c97c8cb into master May 12, 2021

Zaharid deleted the NNPDF40_runcrads branch May 12, 2021 14:13

Conversation

enocera commented Mar 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

voisey commented Mar 13, 2020

Uh oh!

enocera commented Mar 13, 2020

Uh oh!

voisey commented Mar 13, 2020

Uh oh!

juanrojochacon commented Mar 16, 2020

Uh oh!

scarlehoff commented Mar 16, 2020

Uh oh!

enocera commented Mar 16, 2020

Uh oh!

juanrojochacon commented Mar 16, 2020

Uh oh!

scarlehoff commented Mar 16, 2020

Uh oh!

enocera commented Mar 16, 2020

Uh oh!

juanrojochacon commented Mar 16, 2020

Uh oh!

juanrojochacon commented Mar 16, 2020

Uh oh!

enocera commented Mar 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juanrojochacon commented Mar 16, 2020

Uh oh!

enocera commented Mar 16, 2020

Uh oh!

juanrojochacon commented Mar 16, 2020

Uh oh!

enocera commented Mar 16, 2020

Uh oh!

juanrojochacon commented Mar 17, 2020

Uh oh!

juanrojochacon left a comment

Choose a reason for hiding this comment

Uh oh!

scarlehoff commented Mar 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

enocera commented Mar 17, 2020

Uh oh!

siranipour commented Apr 1, 2020

Uh oh!

juanrojochacon commented Apr 1, 2020

Uh oh!

juanrojochacon commented Apr 1, 2020

Uh oh!

scarlehoff commented May 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

enocera commented May 27, 2020

Uh oh!

enocera commented Apr 23, 2021

Uh oh!

juanrojochacon commented Apr 23, 2021

Uh oh!

enocera commented Apr 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RoyStegeman commented Apr 23, 2021

Uh oh!

juanrojochacon commented Apr 23, 2021

Uh oh!

wilsonmr commented Apr 23, 2021

Uh oh!

wilsonmr commented Apr 23, 2021

Uh oh!

RoyStegeman commented Apr 23, 2021

Uh oh!

wilsonmr commented Apr 23, 2021

Uh oh!

Zaharid commented Apr 23, 2021

Uh oh!

enocera commented Apr 23, 2021

enocera commented Mar 13, 2020 •

edited

Loading

enocera commented Mar 16, 2020 •

edited

Loading

scarlehoff commented Mar 17, 2020 •

edited

Loading

scarlehoff commented May 27, 2020 •

edited

Loading

enocera commented Apr 23, 2021 •

edited

Loading