Reconstruct the interface of model compression by denghuilu · Pull Request #921 · deepmodeling/deepmd-kit

denghuilu · 2021-08-04T12:16:43Z

Before this pull request, the model compression module within the deepmd-kit would require the users to provide the training script as well as the training data to compress a given frozen model. Herein, this pull request brings a more useful and robust interface of model compression, which removes the dependences on training script and data from the model compression.

The main changes are:

Save the training script [Feature Request] Save the input script to DP model #727 as well as the min_nbor_dist value [Feature Request] Save the range of embedding matrix input in the DP model #728 within the training process.
Remove the dependences on training script and data from the model compression. Now the model compression command can be simply written as: dp compress -i original.pb -o compress.pb instead of dp compress input.json -i original.pb -o compress.pb.
Update the UTs for model compression.
Update the docs of model compression.

The next pull request will add an item within the convert-from interface to allow the users to upgrade their existing frozen models to adapt the new interface of model compression.

codecov-commenter · 2021-08-04T12:21:02Z

Codecov Report

Merging #921 (a1f087d) into devel (ee0ed99) will increase coverage by 0.11%.
The diff coverage is 94.11%.

@@            Coverage Diff             @@
##            devel     #921      +/-   ##
==========================================
+ Coverage   75.42%   75.53%   +0.11%     
==========================================
  Files          85       85              
  Lines        6730     6770      +40     
==========================================
+ Hits         5076     5114      +38     
- Misses       1654     1656       +2

Impacted Files	Coverage Δ
deepmd/entrypoints/freeze.py	`74.46% <ø> (ø)`
deepmd/entrypoints/main.py	`88.29% <ø> (-0.13%)`	⬇️
deepmd/entrypoints/compress.py	`91.11% <83.33%> (-1.39%)`	⬇️
deepmd/common.py	`83.44% <86.66%> (+0.35%)`	⬆️
deepmd/train/trainer.py	`71.92% <96.77%> (+0.85%)`	⬆️
deepmd/entrypoints/train.py	`87.97% <100.00%> (+1.21%)`	⬆️
deepmd/utils/argcheck.py	`87.58% <100.00%> (+0.04%)`	⬆️
deepmd/utils/errors.py	`100.00% <100.00%> (ø)`
deepmd/utils/data.py	`90.33% <0.00%> (-0.03%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ee0ed99...a1f087d. Read the comment docs.

deepmd/entrypoints/train.py

deepmd/common.py

njzjz · 2021-08-05T06:00:07Z

I have another question. What is the behavior if one uses another JSON to restart the training? For example, in active learning cycles, we can continue training from the last iteration.

denghuilu · 2021-08-05T06:52:53Z

I have another question. What is the behavior if one uses another JSON to restart the training? For example, in active learning cycles, we can continue training from the last iteration.

The model will save the latest json file.

amcadmus · 2021-08-05T12:49:54Z

I have another question. What is the behavior if one uses another JSON to restart the training? For example, in active learning cycles, we can continue training from the last iteration.

In the init-model mode, overwriting the existing training script should be the expected behavior.

In the restart mode, the training script are expected to be the same.

deepmd/common.py

deepmd/entrypoints/freeze.py

deepmd/train/trainer.py

deepmd/utils/argcheck.py

deepmd/train/trainer.py

deepmd/entrypoints/train.py

deepmd/common.py

deepmd/train/trainer.py

njzjz

Also, please fix lint warnings.

deepmd/train/trainer.py

deepmd/utils/argcheck.py

deepmd/train/trainer.py

njzjz · 2021-08-07T21:14:31Z

deepmd-kit/deepmd/train/trainer.py

Lines 288 to 307 in 425a896

    
           if self.is_compress == False: 
        
               # Usually, the type number of the model should be equal to that of the data 
        
               # However, nt_model > nt_data should be allowed, since users may only want to  
        
               # train using a dataset that only have some of elements  
        
               assert (self.ntypes >= data.get_ntypes()), "ntypes should match that found in data" 
        
           self.stop_batch = stop_batch 
        
           # self.batch_size = data.get_batch_size() 
        
           if self.numb_fparam > 0 : 
        
               log.info("training with %d frame parameter(s)" % self.numb_fparam) 
        
           else: 
        
               log.info("training without frame parameter") 
        
           # self.type_map = data.get_type_map() 
        
           if self.is_compress == False: 
        
               # Usually, the type number of the model should be equal to that of the data 
        
               # However, nt_model > nt_data should be allowed, since users may only want to  
        
               # train using a dataset that only have some of elements  
        
               assert (self.ntypes >= data.get_ntypes()), "ntypes should match that found in data"

Duplicated lines

deepmd/utils/argcheck.py

* remove dependences on training script and data from model compression * reset function update_one_sel in train.py * update the doc of model compression * fix bug in UT * optimize code for reviewer's comments * undo changes to constant variables * Update common.py * update code structure of DPTrainer * fix lint warnings in common.py * fix duplicated lines within trainer.py * Update trainer.py * rm default values with False optional in argcheck.py

deepmodeling#921 discussed that the tensors are compressed in the graph file. But it looks no... So at least we remove white space.

#921 discussed that the tensors are compressed in the graph file. But it looks no... So at least we remove white space.

deepmodeling#921 discussed that the tensors are compressed in the graph file. But it looks no... So at least we remove white space.

denghuilu added 3 commits August 4, 2021 17:03

remove dependences on training script and data from model compression

63a6ece

reset function update_one_sel in train.py

a0453a6

update the doc of model compression

b194662

denghuilu requested review from amcadmus, galeselee, iProzd and njzjz August 4, 2021 12:16

njzjz reviewed Aug 4, 2021

View reviewed changes

deepmd/entrypoints/train.py Outdated Show resolved Hide resolved

shishaochen reviewed Aug 5, 2021

View reviewed changes

deepmd/common.py Outdated Show resolved Hide resolved

fix bug in UT

9b65b37

amcadmus requested changes Aug 5, 2021

View reviewed changes

njzjz requested changes Aug 6, 2021

View reviewed changes

deepmd/entrypoints/train.py Outdated Show resolved Hide resolved

optimize code for reviewer's comments

431621e

denghuilu requested review from amcadmus and njzjz August 6, 2021 09:22

undo changes to constant variables

721ea37

amcadmus requested changes Aug 6, 2021

View reviewed changes

deepmd/common.py Outdated Show resolved Hide resolved

deepmd/train/trainer.py Outdated Show resolved Hide resolved

deepmd/train/trainer.py Show resolved Hide resolved

This was linked to issues Aug 6, 2021

[Feature Request] Save the range of embedding matrix input in the DP model #728

Closed

[Feature Request] Save the input script to DP model #727

Closed

Update common.py

dbc56d3

njzjz reviewed Aug 7, 2021

View reviewed changes

deepmd/train/trainer.py Outdated Show resolved Hide resolved

deepmd/train/trainer.py Outdated Show resolved Hide resolved

deepmd/utils/argcheck.py Show resolved Hide resolved

njzjz reviewed Aug 7, 2021

View reviewed changes

deepmd/train/trainer.py Outdated Show resolved Hide resolved

denghuilu added 2 commits August 7, 2021 19:11

update code structure of DPTrainer

0f0ff8f

fix lint warnings in common.py

425a896

fix duplicated lines within trainer.py

604707a

Update trainer.py

d769598

njzjz reviewed Aug 9, 2021

View reviewed changes

deepmd/utils/argcheck.py Outdated Show resolved Hide resolved

rm default values with False optional in argcheck.py

a1f087d

amcadmus requested a review from njzjz August 9, 2021 02:14

amcadmus approved these changes Aug 9, 2021

View reviewed changes

iProzd approved these changes Aug 9, 2021

View reviewed changes

njzjz approved these changes Aug 9, 2021

View reviewed changes

amcadmus merged commit 4a9943e into deepmodeling:devel Aug 9, 2021

amcadmus mentioned this pull request Aug 12, 2021

add multi task #929

Merged

denghuilu deleted the reconstruct-model-compression branch August 21, 2021 09:46

This was referenced Aug 22, 2021

give a clear message if model.get_ntypes()<data.get_ntypes() #1016

Merged

[BUG] hybrid descriptor training failed with KeyError #1051

Closed

tuoping mentioned this pull request Sep 14, 2021

[BUG] Compress model command covers the original checkpoint. #1146

Closed

njzjz added a commit to njzjz/deepmd-kit that referenced this pull request Aug 21, 2022

remove white space from train_attr/training_script

5bdffcb

deepmodeling#921 discussed that the tensors are compressed in the graph file. But it looks no... So at least we remove white space.

njzjz mentioned this pull request Aug 21, 2022

remove white space from train_attr/training_script #1870

Merged

wanghan-iapcm pushed a commit that referenced this pull request Aug 24, 2022

remove white space from train_attr/training_script (#1870)

5d26f17

#921 discussed that the tensors are compressed in the graph file. But it looks no... So at least we remove white space.

Conversation

denghuilu commented Aug 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Aug 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

njzjz commented Aug 5, 2021

Uh oh!

denghuilu commented Aug 5, 2021

Uh oh!

amcadmus commented Aug 5, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njzjz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njzjz commented Aug 7, 2021

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

denghuilu commented Aug 4, 2021 •

edited

Loading

codecov-commenter commented Aug 4, 2021 •

edited

Loading