Skip to content

[autoparallel] modify comm nodes' memory cost in construct chain#2263

Merged
Cypher30 merged 4 commits intohpcaitech:debug/ckpt-autoparallelfrom
Cypher30:debug/ckpt-autoparallel
Jan 3, 2023
Merged

[autoparallel] modify comm nodes' memory cost in construct chain#2263
Cypher30 merged 4 commits intohpcaitech:debug/ckpt-autoparallelfrom
Cypher30:debug/ckpt-autoparallel

Conversation

@Cypher30
Copy link
Copy Markdown
Contributor

@Cypher30 Cypher30 commented Jan 3, 2023

What's New?

In this PR, I modify the calculation of the communication node runtime_apply and runtime_comm_spec_apply in _construct_chain of the auto activation checkpoint solver. The results seem lovely.

I also change the compute cost into fwd_time and bwd_time as those two keys in Node.meta are more compatible with communication cost (it is a float after aligning with computational cost).

@Cypher30 Cypher30 requested a review from super-dainiu January 3, 2023 03:38
@Cypher30 Cypher30 merged commit 5c2ef9f into hpcaitech:debug/ckpt-autoparallel Jan 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant