use float constants and functions in float functions by njzjz · Pull Request #1647 · deepmodeling/deepmd-kit

njzjz · 2022-04-19T01:06:15Z

Before this commit, almost all functions use double constants even under float precision. We know that a float will cast to double when adding/multiplying double.
This PR also uses sqrtf and fmodf for GPU device float functions, instead of double sqrt and fmod.

codecov-commenter · 2022-04-19T01:17:48Z

Codecov Report

Merging #1647 (90cb99f) into devel (de7ba72) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##            devel    #1647   +/-   ##
=======================================
  Coverage   76.15%   76.15%           
=======================================
  Files          94       94           
  Lines        7850     7850           
=======================================
  Hits         5978     5978           
  Misses       1872     1872

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update de7ba72...90cb99f. Read the comment docs.

njzjz · 2022-04-19T02:18:06Z

CI failure also happens on the devel... Not sure what happened.

wanghan-iapcm · 2022-04-19T05:49:38Z

Does your revision pass the UTs on GPU?

njzjz · 2022-04-20T06:29:49Z

Does your revision pass the UTs on GPU?

I cannot build the UTs on GPU even on the devel branch. Here is the error message:

/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc: In member function 'virtual void TestTabulateSeR_tabulate_fusion_se_r_grad_gpu_cuda_Test::TestBody()':
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:5: error: 'dy_dem_dev' was not declared in this scope
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
     ^~~~~~~~~~
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:5: note: suggested alternative: 'dy_dem'
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
     ^~~~~~~~~~
     dy_dem
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:26: error: 'table_dev' was not declared in this scope
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                          ^~~~~~~~~
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:26: note: suggested alternative: 'table'
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                          ^~~~~~~~~
                          table
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:46: error: 'em_dev' was not declared in this scope
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                                              ^~~~~~
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:46: note: suggested alternative: 'dy_dem'
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                                              ^~~~~~
                                              dy_dem
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:63: error: 'dy_dev' was not declared in this scope
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                                                               ^~~~~~
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:63: note: suggested alternative: 'dy_dem'
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                                                               ^~~~~~
                                                               dy_dem
make[2]: *** [CMakeFiles/runUnitTests.dir/build.make:426: CMakeFiles/runUnitTests.dir/test_tabulate_se_r.cc.o] Error 1

wanghan-iapcm · 2022-04-22T04:38:24Z

Does your revision pass the UTs on GPU?

I cannot build the UTs on GPU even on the devel branch. Here is the error message:

/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc: In member function 'virtual void TestTabulateSeR_tabulate_fusion_se_r_grad_gpu_cuda_Test::TestBody()':
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:5: error: 'dy_dem_dev' was not declared in this scope
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
     ^~~~~~~~~~
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:5: note: suggested alternative: 'dy_dem'
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
     ^~~~~~~~~~
     dy_dem
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:26: error: 'table_dev' was not declared in this scope
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                          ^~~~~~~~~
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:26: note: suggested alternative: 'table'
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                          ^~~~~~~~~
                          table
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:46: error: 'em_dev' was not declared in this scope
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                                              ^~~~~~
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:46: note: suggested alternative: 'dy_dem'
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                                              ^~~~~~
                                              dy_dem
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:63: error: 'dy_dev' was not declared in this scope
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                                                               ^~~~~~
/home/jz748/codes/deepmd-kit/source/lib/tests/test_tabulate_se_r.cc:125:63: note: suggested alternative: 'dy_dem'
   * dy_dem_dev = NULL, * table_dev = NULL, * em_dev = NULL, * dy_dev = NULL;
                                                               ^~~~~~
                                                               dy_dem
make[2]: *** [CMakeFiles/runUnitTests.dir/build.make:426: CMakeFiles/runUnitTests.dir/test_tabulate_se_r.cc.o] Error 1

Should be fixed by #1651

njzjz · 2022-04-22T06:02:21Z

Does your revision pass the UTs on GPU?

Yes.

source/lib/src/cuda/gelu.cu

source/lib/src/rocm/gelu.hip.cu

Co-authored-by: Denghui Lu <denghuilu@pku.edu.cn>

denghuilu

All UTs within the api_cc and lib folders have passed in an ROCm GPU environment, except this one:

Not found: Op type not registered 'MatrixDiagV3' in binary running on j17r2n14. Make sure the Op and Kernel are registered in the binary running in this process. Note that if you are loading a saved graph which used ops from tf.contrib, accessing (e.g.) `tf.contrib.resampler` should be done before importing the graph, as contrib ops are lazily registered when the module is first accessed.
unknown file: Failure
C++ exception with description "DeePMD-kit Error: TensorFlow Error: Not found: Op type not registered 'MatrixDiagV3' in binary running on j17r2n14. Make sure the Op and Kernel are registered in the binary running in this process. Note that if you are loading a saved graph which used ops from tf.contrib, accessing (e.g.) `tf.contrib.resampler` should be done before importing the graph, as contrib ops are lazily registered when the module is first accessed." thrown in SetUp().
[  FAILED  ] TestInferDeepPolarNew.cpu_build_nlist (378 ms)

It's a TF library problem obviously, so it should have little effect on the correctness of the test program.

Same as deepmodeling#1647, but a function was missing. Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

Same as #1647, but a function was missing. Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

Same as deepmodeling#1647, but a function was missing. Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

njzjz added 7 commits April 18, 2022 18:39

use float constant for float functions

e946782

kk

c37246b

kk

208fdc0

we have cmath in c++ 11

2068c3a

use float functions

4b1467d

revert changes to legacy codes

9730e9a

fix typo in ewald.cc

3521f9e

njzjz requested review from denghuilu and wanghan-iapcm April 19, 2022 01:06

njzjz marked this pull request as draft April 19, 2022 01:10

njzjz marked this pull request as ready for review April 19, 2022 02:17

njzjz added 2 commits April 22, 2022 01:00

Merge branch 'devel' into float

667684e

fix lib/tests CUDA installation

e32b25e

wanghan-iapcm approved these changes Apr 22, 2022

View reviewed changes

Merge branch 'devel' into float

15a6881

denghuilu reviewed Apr 23, 2022

View reviewed changes

source/lib/src/cuda/gelu.cu Outdated Show resolved Hide resolved

denghuilu reviewed Apr 23, 2022

View reviewed changes

source/lib/src/rocm/gelu.hip.cu Outdated Show resolved Hide resolved

use tanh for float

90cb99f

Co-authored-by: Denghui Lu <denghuilu@pku.edu.cn>

denghuilu approved these changes Apr 23, 2022

View reviewed changes

wanghan-iapcm merged commit 18ac81f into deepmodeling:devel Apr 28, 2022

njzjz mentioned this pull request May 24, 2022

correct type behavior when atomic energy is requested #1727

Merged

njzjz deleted the float branch October 11, 2022 22:41

njzjz added a commit to njzjz/deepmd-kit that referenced this pull request Oct 11, 2022

use float/double constant for spline5_switch

7a82645

Same as deepmodeling#1647, but a function was missing. Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

njzjz mentioned this pull request Oct 11, 2022

use float/double constants for spline5_switch #1985

Merged

wanghan-iapcm pushed a commit that referenced this pull request Oct 12, 2022

use float/double constants for spline5_switch (#1985)

77c7ad4

Same as #1647, but a function was missing. Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

mingzhong15 pushed a commit to mingzhong15/deepmd-kit that referenced this pull request Jan 15, 2023

use float/double constants for spline5_switch (deepmodeling#1985)

2bf66d5

Same as deepmodeling#1647, but a function was missing. Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use float constants and functions in float functions#1647

use float constants and functions in float functions#1647
wanghan-iapcm merged 11 commits intodeepmodeling:develfrom
njzjz:float

njzjz commented Apr 19, 2022 •

edited

Loading

Uh oh!

codecov-commenter commented Apr 19, 2022 •

edited

Loading

Uh oh!

njzjz commented Apr 19, 2022

Uh oh!

wanghan-iapcm commented Apr 19, 2022

Uh oh!

njzjz commented Apr 20, 2022 •

edited

Loading

Uh oh!

wanghan-iapcm commented Apr 22, 2022

Uh oh!

njzjz commented Apr 22, 2022

Uh oh!

Uh oh!

Uh oh!

denghuilu left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

njzjz commented Apr 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Apr 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

njzjz commented Apr 19, 2022

Uh oh!

wanghan-iapcm commented Apr 19, 2022

Uh oh!

njzjz commented Apr 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wanghan-iapcm commented Apr 22, 2022

Uh oh!

njzjz commented Apr 22, 2022

Uh oh!

Uh oh!

Uh oh!

denghuilu left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

njzjz commented Apr 19, 2022 •

edited

Loading

codecov-commenter commented Apr 19, 2022 •

edited

Loading

njzjz commented Apr 20, 2022 •

edited

Loading