-
Notifications
You must be signed in to change notification settings - Fork 451
Add an A case and an ELM case to CI #7617
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@jgfouca these machines have like 4 procs (or at most 8, but it safer to assume only 2 or 4). Would it be ok to add |
|
@mahf708 , yes, i didn't notice that the other tests were doing that too (_P4). |
|
I will need to issue a PR in E3SM-Project/containers and add the new files needed, otherwise Rob (and ANL server people) will kill me. ... 👀 |
Why are these tests requesting 64 ranks? Isn't CIME supposed to limit the number of procs with whatever is in config_machines.xml? Edit: I see that ghci-oci does not specifies max_tasks_per_node=16. First, I would expect CIME to use 16, no? Second, @mahf708 how about changing that to <MAX_TASKS_PER_NODE>$SHELL{cat /proc/cpuinfo | grep processor | wc -l}</MAX_TASKS_PER_NODE>so that it works with whatever github throws at us? @jgfouca I'm assuming the above will work as intended (meaning, CIME will expand it with the shell output)? |
|
@bartgol , config_pes.xml is what determines ranks per case. We could have configured it such that ghci-oci just uses 4 ranks for everything, but P4 is fine too I think. |
I think some gh hosted runners may only have 2 cores. Can config_pes.xml use the |
|
Jim, not sure what's getting this over the disk limit. Maybe domain files? But the other test passed. Should we try to use ne4pg2_oQU480 for all these tests? What do you think? |
|
@mahf708 , I see one check still running. Where are you seeing that we are going over the limit? |
I made it run again; here's the first (failed) run: https://github.com/E3SM-Project/E3SM/actions/runs/17105290100/attempts/1; you can find the attempts on the top right in small box it'a simple disk error error
|
|
@mahf708 , OK, I switched them all to the same grid. |
|
@bishtgautam @rljacob I'm struggling to figure out the sweet combo that will make this test not try to download lots of data... but also, I'm hearing elm-betr likely not the best test to add here. Any suggestions for other tests that will cover common cases? Also, tagging @jonbob in case you'd like us to add a test or two for ocean/ice :) thanks. The goal of these tests is to basically run simple ones (no baselines for now) that can uncover low-order/basic build/run errors |
|
You can try adding a low res version of the SMS.r05_r05.I1850ELMCN.elm-qian_1948. Something like |
Add an A case and an ELM case to CI [BFB]
Add an A case and an ELM case to CI [BFB]
Add an A case and an ELM case to CI Merge 2 for this PR, forgot to update before merging. [BFB]
[BFB]