Skip to content

Fix the expanding logic of SLURM_JOB_NODELIST and add unit tests for parallel training.#913

Merged
amcadmus merged 2 commits intodeepmodeling:develfrom
shishaochen:test_cluster_and_env
Aug 2, 2021
Merged

Fix the expanding logic of SLURM_JOB_NODELIST and add unit tests for parallel training.#913
amcadmus merged 2 commits intodeepmodeling:develfrom
shishaochen:test_cluster_and_env

Conversation

@shishaochen
Copy link
Collaborator

@shishaochen shishaochen commented Aug 1, 2021

For better code coverage and feature compatibility, add unit tests for #892 and #905.

Need to mention, the expanding logic of environment variable SLURM_JOB_NODELIST is fixed with a professional package python-hostlist.

@codecov-commenter
Copy link

codecov-commenter commented Aug 1, 2021

Codecov Report

Merging #913 (79efdb5) into devel (b5b15fa) will increase coverage by 0.47%.
The diff coverage is 100.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##            devel     #913      +/-   ##
==========================================
+ Coverage   74.68%   75.16%   +0.47%     
==========================================
  Files          85       85              
  Lines        6748     6725      -23     
==========================================
+ Hits         5040     5055      +15     
+ Misses       1708     1670      -38     
Impacted Files Coverage Δ
deepmd/env.py 74.44% <ø> (+4.44%) ⬆️
deepmd/cluster/slurm.py 100.00% <100.00%> (+78.04%) ⬆️
deepmd/cluster/local.py 100.00% <0.00%> (+9.09%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b5b15fa...79efdb5. Read the comment docs.

@shishaochen shishaochen mentioned this pull request Aug 1, 2021
61 tasks
@shishaochen shishaochen changed the title Add unit tests of cluster and env. Fix the expanding logic of SLURM_JOB_NODELIST and add unit tests for parallel training. Aug 2, 2021
@amcadmus amcadmus requested a review from njzjz August 2, 2021 03:28
@amcadmus amcadmus merged commit ee0ed99 into deepmodeling:devel Aug 2, 2021
@shishaochen shishaochen deleted the test_cluster_and_env branch August 6, 2021 01:57
gzq942560379 pushed a commit to HPC-AI-Team/deepmd-kit that referenced this pull request Sep 2, 2021
…r parallel training. (deepmodeling#913)

* Add unit tests of `cluster` and `env`.

* Fix the expanding logic of `SLURM_JOB_NODELIST`.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants