Merge devel into master by amcadmus · Pull Request #1542 · deepmodeling/deepmd-kit

amcadmus · 2022-03-07T01:28:32Z

No description provided.

* add model compression for se-t descriptor * optimize range structure for se-t * add CPU support for se-t type model compression * add gradient define for op TabulateFusion * add ut for st_t model compression descriptor * fix UT error * add rocm support * address comments * Update compress.md * fix typo * bump model version from 1.0 to 1.1 * bump model version from 1.0 to 1.1 * fix bug of model compression * fix single precision error * fix UT error * address comments * upgrade convert support from 2.0 to 2.1 * add 2.0 convert-from support * update doc for model compatibility * Update model-compatability.md * Update model-compatability.md Co-authored-by: denghuilu <denghuilu@pku.edu.cn>

…odeling#1228) * enable model compression for dipole and polar type fitting net * delete debug code

another typo. Co-authored-by: Han Wang <amcadmus@gmail.com>

typo update Co-authored-by: Han Wang <amcadmus@gmail.com>

…ace (deepmodeling#1233) * fix compress training bug within the dp train --init-frz-model interface * address comments * rename _transfer_graph_def function within freeze.py

tensorflow doesn't support it

skip musllinux build

Fix typo in training.md

* deprecate `numb_test` See deepmodeling#1243. * bugfix

Fix deepmodeling#1248 (only the first part).

* fix bug of loc_frame descriptor, due to the change of nei list convention introduced in v2 * fix change of format Co-authored-by: Han Wang <wang_han@iapcm.ac.cn>

fix the np.frombuffer in dp transfer

* accelerate model compression * remove redundancy

/home/runner/work/deepmd-kit/deepmd-kit/deepmd/utils/graph.py:156: SyntaxWarning: "is not" with a literal. Did you mean "!="?

) * Improved activation function. * Improved activation function. * Added description of available activation functions. * Added description of available activation functions. * Added description of available activation functions. * Added description of available activation functions. * Update train-input.rst * Update train-input.rst * Update train-input.rst * Added description of available activation functions. * Added description of available activation functions. * Update train-input.rst * Added description of available activation functions. * Added description of available activation functions. * Added description of available activation functions. * Added description of available activation functions. * Update train-input.rst

See google/googletest#3663

This ensures the old model can be compressed by the new program.

* update guidelines for the number of threads Setting `OMP_NUM_THREADS` to 3 may be faster than 6. It needs to be confirmed. * add warnings if not adjusting threads

Co-authored-by: Han Wang <wang_han@iapcm.ac.cn>

* enable mixed precision support for dp * set the default embedding net & fitting net precision * add doc for mixed precision * fix typo * fix UT bug * use input script to control the mixed precision workflow * add tf version check for mixed precision * Update training-advanced.md * fix typo * fix TF_VERSION control * fix TF_VERSION comparison * enable mixed precision for hybrid descriptor * Update network.py * use parameter to control the network mixed precision output precision * add example for mixed precision training workflow * fix lint errors

…ing#1303)

Same as deepmodeling/dpdata#227.

Co-authored-by: pkulzy <47965866+pkulzy@users.noreply.github.com>

…ling#1306) * fix model compression bug when fparam and aparam are not zero * Update ener.py

fix deepmodeling#1286

…g#1424)

TF supports a remapper optimizer which remaps subgraphs onto more efficient implementations by replacing commonly occuring subgraphs with optimized fused monolithic kernels. However, its support is limited: (1) MatMul + BiasAdd (not Add) + Activation; (2) Float32 (but not float64); (3) Activation is Tanh; (4) MKL is built and used. This commit replaces Add by BiasAdd in the NN. The speed of a single op can be improved by about 20% when TF is using MKL and precision is set to float32. One can find `_MklNativeFusedMatMul` op in the profiler. See also: - https://www.tensorflow.org/guide/graph_optimization - https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/grappler/optimizers/remapper.cc (cherry picked from commit 8f2dc44)

* dynamically load op library in C++ interface C++ interface will dynamically load OP libraries, just like Python interface, so it no longer needs linking. * version cannot be NULL... * set LD_LIBRARY_PATH * install runUnitTest * remove CMAKE_LINK_WHAT_YOU_USE It is not necessary anymore. * also remove CMAKE_LINK_WHAT_YOU_USE in api_cc/tests * change the type of op library to MODULE * add the absolute path of library directory to cc rpath * Revert "add the absolute path of library directory to cc rpath" This reverts commit fabdac9. * add `-Wl,--disable-new-dtags` * Revert "add `-Wl,--disable-new-dtags`" This reverts commit ecccb57. * dlopen from dp lib but not TF

The following migrators were applied: - Migrate to `build.tools` configuration. This uses the new base Docker image based on Ubuntu 20.04 introduced in October 2021 and picks an appropriate Python version for your project (read [our blog post](https://blog.readthedocs.com/new-build-specification/) for details). Notice that now you can specify the Node.js, Rust, and Go versions as well. *Note:* Some system dependencies are not preinstalled anymore, so this might require manually adding them to `build.apt_packages` (see [our documentation](https://docs.readthedocs.io/en/stable/config-file/v2.html#build-apt-packages>)). - Migrate to Mamba as a drop-in replacement for Conda. Your project requested using Mamba instead of Conda for performance reasons. Now this is included in your configuration and you can change it without our intervention. Co-authored-by: Han Wang <amcadmus@gmail.com>

* Add image link of ROCm version. * Update easy-install.md Co-authored-by: Han Wang <amcadmus@gmail.com>

Fix deepmodeling#1339.

* add doc and examples for deep potential long-range * add "* *" to pair style. use warning section of RTD for warning message * rst syntax does not work in md. changed Co-authored-by: Han Wang <wang_han@iapcm.ac.cn>

…g#1463) `std` and `avg` are stored as variables. `saver.restore` will restore and override it. So `data_stat` before `saver.restore` is useless. Note that currently `init_from_frz_model` does not restore these variables. I also add a message before `data_stat`. It sometimes takes long time.

* bump the testing Python version to 3.10 Test whether deepmd-kit is compatible with the latest Python. TF 2.8 was just released with Python 3.10 support. * add cp310 to wheel building * the version is not a float...

* do some small optimization to ops 1. avoid concat or add in loops. Instead, append tensors to a list, and concat or accumulate_n after loops 2. remove a duplicated reshape * revert unnecessary changes * revert wfc.py as it has been decrepated

* fix bug of mixed precision training * Update test_mixed_prec_training.py * fix UT error * fix UT error * fix UT error * address comment

…ng#1477) * assign energy shift stats if atomic energies are assigned Atomic energies stats may be incorrect. Here, we use assigned atomic energies instead. I am training a model against multiple systems, where some atoms should not contribute energies. Before this commit, the loss of my model could not decrease. After this commit, it decreased quickly. * fix tests * add ut * keep and test the sum energy * fix typo

* calculate neighbor statistics from CLI In many cases, we need to confirm neighbor statistics before training. This command provides such CLI option. * add UT and docs * fix typo * correct name of the class * remove data_requirement

Before, only NN parameters were recovered.

* test: move loading graphs to setUpClass Loading graphs is expensive. * small fix * fix a typo * fix typo

* add an interface to eval descriptors Fix deepmodeling#1393. * move eval descriptor out of eval * bugfix and remove unnecessary changes * fix test * merge duplicated codes * bugfix * imap * Update deep_pot.py * do not return tuple * fix typo * make the code more readable

* run test_python on pre-built container * pin mistune to <2 See miyakogi/m2r#66 * I don't understand why we need m2r... * install m2r2 instead * try to encode with utf-8 instead.. * add back pip cache * Update test_python.yml * bump importlib_metadata version * Revert "bump importlib_metadata version" This reverts commit b8b1727. * Revert "Update test_python.yml" This reverts commit d9a906e. * set PYTHONPATH * Revert "set PYTHONPATH" This reverts commit d35bcf0.

…ng#1490) * typo: 'should be in consistent' -> 'should be consistent' * typo: 'notice' -> 'important' Co-authored-by: Han Wang <amcadmus@gmail.com>

Fix deepmodeling#1491.

Co-authored-by: Han Wang <amcadmus@gmail.com>

`memset`, `cudaMemset`, and `hipMemset` only accept integer as value. It's meaningless to pass a float, whether the compiler optimizes or not. References: https://www.cplusplus.com/reference/cstring/memset/ https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__MEMORY.html#group__CUDART__MEMORY_1gf7338650f7683c51ee26aadc6973c63a https://rocmdocs.amd.com/en/latest/ROCm_API_References/HIP_API/Memory-Management.html#hipmemset

fix deepmodeling#1493

) Co-authored-by: Han Wang <amcadmus@gmail.com>

codecov-commenter · 2022-03-07T01:31:59Z

Codecov Report

Merging #1542 (f61ff4a) into master (159e45d) will decrease coverage by 11.66%.
The diff coverage is n/a.

❗ Current head f61ff4a differs from pull request most recent head 0ff22bc. Consider uploading reports for the commit 0ff22bc to get more accurate results

@@             Coverage Diff             @@
##           master    #1542       +/-   ##
===========================================
- Coverage   75.95%   64.28%   -11.67%     
===========================================
  Files          91        5       -86     
  Lines        7260       14     -7246     
===========================================
- Hits         5514        9     -5505     
+ Misses       1746        5     -1741

Impacted Files	Coverage Δ
deepmd/common.py
deepmd/descriptor/descriptor.py
deepmd/descriptor/hybrid.py
deepmd/descriptor/loc_frame.py
deepmd/descriptor/se.py
deepmd/descriptor/se_a.py
deepmd/descriptor/se_r.py
deepmd/descriptor/se_t.py
deepmd/entrypoints/__init__.py
deepmd/entrypoints/compress.py
... and 76 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 159e45d...0ff22bc. Read the comment docs.

wanghan-iapcm and others added 30 commits October 19, 2021 09:53

enable model compression for dipole and polar type fitting net (deepm…

2e6be16

…odeling#1228) * enable model compression for dipole and polar type fitting net * delete debug code

bump pip version (deepmodeling#1237)

70061f0

Update training.md (deepmodeling#1238)

3130caa

another typo. Co-authored-by: Han Wang <amcadmus@gmail.com>

Update dpdata.md (deepmodeling#1229)

6c41aa3

typo update Co-authored-by: Han Wang <amcadmus@gmail.com>

Fix compress training bug within the dp train --init-frz-model interf…

1a8fd73

…ace (deepmodeling#1233) * fix compress training bug within the dp train --init-frz-model interface * address comments * rename _transfer_graph_def function within freeze.py

skip musllinux build

956143d

tensorflow doesn't support it

Merge pull request deepmodeling#1247 from njzjz/patch-20211027

d463689

skip musllinux build

Merge pull request deepmodeling#1241 from JingchaoZhang/patch-3

75501ac

Fix typo in training.md

add init-frz-model support for se-t type descriptor (deepmodeling#1245)

edb8bd9

deprecate numb_test (deepmodeling#1249)

d57a53a

* deprecate `numb_test` See deepmodeling#1243. * bugfix

fix Python bugs of loc_frame descriptor (deepmodeling#1253)

0362659

Fix deepmodeling#1248 (only the first part).

fix bug of loc_frame descriptor when using lammps (deepmodeling#1255)

09d1391

* fix bug of loc_frame descriptor, due to the change of nei list convention introduced in v2 * fix change of format Co-authored-by: Han Wang <wang_han@iapcm.ac.cn>

Update transfer.py (deepmodeling#1246)

4e15b0d

fix the np.frombuffer in dp transfer

Accelerate model compression (deepmodeling#1274)

36275c8

* accelerate model compression * remove redundancy

Use c++14 for TF 2.7 (deepmodeling#1275)

43fc8b3

fix SyntaxWarning in graph.py (deepmodeling#1278)

77ce651

/home/runner/work/deepmd-kit/deepmd-kit/deepmd/utils/graph.py:156: SyntaxWarning: "is not" with a literal. Did you mean "!="?

add a citation badge (deepmodeling#1280)

abec07e

change googletest from master to main (deepmodeling#1292)

95e321f

See google/googletest#3663

updata_deepmd_input when compress (deepmodeling#1297)

b9e51e8

This ensures the old model can be compressed by the new program.

update guidelines for the number of threads (deepmodeling#1291)

3b96011

* update guidelines for the number of threads Setting `OMP_NUM_THREADS` to 3 may be faster than 6. It needs to be confirmed. * add warnings if not adjusting threads

fix typo updata->update (deepmodeling#1301)

9c517eb

Co-authored-by: Han Wang <wang_han@iapcm.ac.cn>

add embedding network dimension check of model compression (deepmodel…

3d93fd0

…ing#1303)

add importlib_metadata as dependency (deepmodeling#1308)

1bcb917

Same as deepmodeling/dpdata#227.

fix bugs about parameters of memset (deepmodeling#1302)

338ba31

Co-authored-by: pkulzy <47965866+pkulzy@users.noreply.github.com>

Fix model compression bug when fparam or aparam is not zero (deepmode…

843a3c5

…ling#1306) * fix model compression bug when fparam and aparam are not zero * Update ener.py

add space between words in messages (deepmodeling#1312)

3869a5b

do not print virial error with nopbc data (deepmodeling#1314)

743c1e6

fix deepmodeling#1286

njzjz and others added 26 commits January 17, 2022 08:58

support recursive detection for the system of model_devi (deepmodelin…

a9d08a7

…g#1424)

fix cast_precision breaking docstring (deepmodeling#1437)

83f5dc8

Add image link of ROCm version. (deepmodeling#1432)

6238482

* Add image link of ROCm version. * Update easy-install.md Co-authored-by: Han Wang <amcadmus@gmail.com>

fix a typo in document (deepmodeling#1445)

bcfb127

Fix deepmodeling#1339.

Dplr doc and examples (deepmodeling#1458)

1ee6987

* add doc and examples for deep potential long-range * add "* *" to pair style. use warning section of RTD for warning message * rst syntax does not work in md. changed Co-authored-by: Han Wang <wang_han@iapcm.ac.cn>

bump the Python version to 3.10 (deepmodeling#1465)

0d8fe0a

* bump the testing Python version to 3.10 Test whether deepmd-kit is compatible with the latest Python. TF 2.8 was just released with Python 3.10 support. * add cp310 to wheel building * the version is not a float...

fix bug of mixed precision training (deepmodeling#1471)

f015f58

* fix bug of mixed precision training * Update test_mixed_prec_training.py * fix UT error * fix UT error * fix UT error * address comment

recover input stats from frozen models (deepmodeling#1482)

17d1443

Before, only NN parameters were recovered.

test: move loading graphs to setUpClass (deepmodeling#1484)

45c2feb

* test: move loading graphs to setUpClass Loading graphs is expensive. * small fix * fix a typo * fix typo

typo: 'should be in consistent' -> 'should be consistent' (deepmodeli…

d9a4a86

…ng#1490) * typo: 'should be in consistent' -> 'should be consistent' * typo: 'notice' -> 'important' Co-authored-by: Han Wang <amcadmus@gmail.com>

make OpenMP an optional dependency (deepmodeling#1498)

87b4930

Fix deepmodeling#1491.

typo: 'productive' -> 'production' (deepmodeling#1497)

ea7fbde

Co-authored-by: Han Wang <amcadmus@gmail.com>

fix the example of inference with C++ interface (deepmodeling#1504)

52ba6d9

fix deepmodeling#1493

typos: fix more small grammar issues on install pages (deepmodeling#1503

336c273

) Co-authored-by: Han Wang <amcadmus@gmail.com>

y-axis should use log scale instead of symlog (deepmodeling#1514)

3eb2550

Merge branch 'master' into devel

0ff22bc

amcadmus requested a review from njzjz March 7, 2022 02:25

njzjz approved these changes Mar 7, 2022

View reviewed changes

wanghan-iapcm merged commit 3e54fea into deepmodeling:master Mar 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge devel into master#1542

Merge devel into master#1542
wanghan-iapcm merged 88 commits intodeepmodeling:masterfrom
amcadmus:master

amcadmus commented Mar 7, 2022

Uh oh!

codecov-commenter commented Mar 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

14 participants

Conversation

amcadmus commented Mar 7, 2022

Uh oh!

codecov-commenter commented Mar 7, 2022

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

14 participants