Fix info file of mc2hessian by scarlehoff · Pull Request #1551 · NNPDF/nnpdf

scarlehoff · 2022-04-08T10:18:14Z

I'll add also a test for this.

Zaharid · 2022-04-08T10:24:03Z

Could we use a YAML parser for this? The silly search and replace thing was done because bootstrapping YAML there was difficult in 2015 for some reason.

Zaharid · 2022-04-08T11:13:21Z

The test is great but I have a minor quibble with it: The environment variable prepends the new path but does not disable the old ones https://gitlab.com/hepcedar/lhapdf/-/blob/513b44b3d6864acd805bdefc7caab6414be5371c/src/Paths.cc#L30
so there is the theoretical possibility that something gets shadowed with the installed PDFs. I would suggest doing this instead:

nnpdf/validphys2/src/validphys/lhio.py

Line 273 in c4084de

finally:

which could be wrapped as something like

import contextlib

@contextlib.contextmanager
def temp_lhapdf_path(folder):
    oldpaths = lhapdf.paths()
    lhapdf.setPaths([str(folder)])
    try:
        yield
    finally:
        lhapdf.setPaths(oldpaths)

scarlehoff · 2022-04-08T11:20:29Z

Could we use a YAML parser for this? The silly search and replace thing was done because bootstrapping YAML there was difficult in 2015 for some reason.

Turns out it is still difficult in 2022 because LHAPDF doesn't really use standard YAML: https://github.com/N3PDF/eko/blob/341e6323e680d2320ae4400da61c92a445160416/src/ekobox/genpdf/export.py#L88

RE the test, fine, I'll add that.

Zaharid · 2022-04-08T11:22:21Z

Why is that? They do use a yaml parser:

https://gitlab.com/hepcedar/lhapdf/-/tree/main/src/yamlcpp

scarlehoff · 2022-04-08T11:28:09Z

I'll direct you to the people who know why that was needed @alecandido @andreab1997

That said, it might be that's the case now but not historically and so past lhapdf are not truly yaml.

Then there's the question of supporting old codes (fortran, c) that might rely on the exact format of lhapdf info files (like the order, lists being enclosed in [] etc).

alecandido · 2022-04-08T11:33:40Z

Unfortunately I've not done it directly, it was really @andreab1997. I still hope I could do as much as possible without installing lhapdf, but in the simplest possible way (otherwise I could install lhapdf). And, unfortunately, when @andreab1997 attempted to parse info files it turned out that it was not standard YAML.

At this point, I wonder if it's just an old standard YAML, since lhapdf is vendoring the YAML library, and most likely not updating it very frequently...

alecandido · 2022-04-08T11:36:07Z

I know that YAML is wonderful for readability, but I'm more and more tempted to step on the JSON side: the full YAML spec is damn complicated 😞

The best compromise I'm aware of is https://hitchdev.com/strictyaml/

scarlehoff · 2022-04-08T11:38:33Z

attempted to parse info files it turned out that it was not standard YAML.

We've been always parsing the lhapdf info files using yaml

nnpdf/validphys2/src/validphys/lhaindex.py

Line 116 in c4084de

return result

that was last changed 4 years ago. Maybe it was a problem with some specific pdf sets? I remember there were sets in lhapdf that were broken but that's hardly our fault.

I know that YAML is wonderful for readability, but I'm more and more tempted to step on the JSON side: the full YAML spec is damn complicated

It's not up to us here though.

Zaharid · 2022-04-08T11:39:06Z

Are there any example of correct commonly used LHAPDF sets (i.e. that are in the official repository) that cannot be parsed with yaml and yet work with lhapdf proper?

If not it seems to me we should proceed, as indeed we are doing elsewhere.

andreab1997 · 2022-04-08T11:44:06Z

I'll direct you to the people who know why that was needed @alecandido @andreab1997

That said, it might be that's the case now but not historically and so past lhapdf are not truly yaml.

Then there's the question of supporting old codes (fortran, c) that might rely on the exact format of lhapdf info files (like the order, lists being enclosed in [] etc).

Are there any example of correct commonly used LHAPDF sets (i.e. that are in the official repository) that cannot be parsed with yaml and yet work with lhapdf proper?

If not it seems to me we should proceed, as indeed we are doing elsewhere.

I remember that I had this problem with CT14 probably. I don't remember exactly the PDF set but I can check. Anyway I believe that this problem is only in some old PDF sets.

alecandido · 2022-04-08T11:52:15Z

I remember that I had this problem with CT14 probably. I don't remember exactly the PDF set but I can check. Anyway I believe that this problem is only in some old PDF sets.

This supports the conjecture of an oldish YAML library: maybe they are edge cases that could be parsed by every library at that time, but then spec and libraries evolved, and now some old files are violating something in the new specs.

Zaharid · 2022-04-08T11:54:49Z

Right, I think we should assume that both replica headers and info files are going to be working yaml since it is what lhapdf itself does.

alecandido · 2022-04-08T12:05:14Z

Right, I think we should assume that both replica headers and info files are going to be working yaml since it is what lhapdf itself does.

Fine by me. I'd love not to depend on lhapdf for reading what's inside an info file.

scarlehoff · 2022-04-08T12:06:56Z

Right, I think we should assume that both replica headers and info files are going to be working yaml since it is what lhapdf itself does.

Ok. Shall we do that in a separate PR? It will simplify #1537 if we also do it for postfit and anything else that might be writting info files but it might also break something (who knows) so better if we isolate it.

Zaharid · 2022-04-08T12:09:29Z

Right, I think we should assume that both replica headers and info files are going to be working yaml since it is what lhapdf itself does.

Ok. Shall we do that in a separate PR? It will simplify #1537 if we also do it for postfit and anything else that might be writting info files but it might also break something (who knows) so better if we isolate it.

Yes please.

scarlehoff · 2022-04-08T12:11:09Z

Sorry, I've been an idiot.

I actually tested all pdfs in the LHAPDF server when I did lhapdf_management and I was able to read all info files with a standard yaml safe load

https://gitlab.com/scarlehoff/lhapdf/-/blob/master/python_management/test_itall.py#L60

I can rerun that, but I'm pretty sure I got no failing PDFs (on 09 Nov, 2021)

alecandido · 2022-04-08T12:15:05Z

Maybe it was something specific @andreab1997 was doing. It might be that the problem was in the dump, not in loading them.

Can you reconstruct what you found @andreab1997?

scarlehoff · 2022-04-08T12:16:14Z

I'll open an issue for this.

andreab1997 · 2022-04-08T12:31:49Z

Maybe it was something specific @andreab1997 was doing. It might be that the problem was in the dump, not in loading them.

Can you reconstruct what you found @andreab1997?

Yes I think

alecandido · 2022-04-08T15:06:10Z

The fastest PR I've ever seen...

scarlehoff · 2022-04-08T15:08:15Z

It was an obvious bug with an obvious solution. We've had faster :P https://github.com/NNPDF/nnpdf/pulls?q=head%3Ahotfix+

fix info file

fbbd4ef

scarlehoff requested a review from Zaharid April 8, 2022 10:18

add test for mc2hessian

7546d06

change lhapdf data dir

1eaff2b

scarlehoff mentioned this pull request Apr 8, 2022

Use yaml for .info files. #1552

Open

2 tasks

Zaharid approved these changes Apr 8, 2022

View reviewed changes

Zaharid added the bug Something isn't working label Apr 8, 2022

scarlehoff merged commit 1c86eb5 into master Apr 8, 2022

scarlehoff deleted the hotfix_mc2hessian branch April 8, 2022 15:04

Conversation

scarlehoff commented Apr 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Zaharid commented Apr 8, 2022

Uh oh!

Zaharid commented Apr 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

scarlehoff commented Apr 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Zaharid commented Apr 8, 2022

Uh oh!

scarlehoff commented Apr 8, 2022

Uh oh!

alecandido commented Apr 8, 2022

Uh oh!

alecandido commented Apr 8, 2022

Uh oh!

scarlehoff commented Apr 8, 2022

Uh oh!

Zaharid commented Apr 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

andreab1997 commented Apr 8, 2022

Uh oh!

alecandido commented Apr 8, 2022

Uh oh!

Zaharid commented Apr 8, 2022

Uh oh!

alecandido commented Apr 8, 2022

Uh oh!

scarlehoff commented Apr 8, 2022

Uh oh!

Zaharid commented Apr 8, 2022

Uh oh!

scarlehoff commented Apr 8, 2022

Uh oh!

alecandido commented Apr 8, 2022

Uh oh!

scarlehoff commented Apr 8, 2022

Uh oh!

andreab1997 commented Apr 8, 2022

Uh oh!

alecandido commented Apr 8, 2022

Uh oh!

scarlehoff commented Apr 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

scarlehoff commented Apr 8, 2022 •

edited

Loading

Zaharid commented Apr 8, 2022 •

edited

Loading

scarlehoff commented Apr 8, 2022 •

edited

Loading

Zaharid commented Apr 8, 2022 •

edited

Loading