Skip to content

[Shardformer] Downstream bert#3979

Merged
FrankLeeeee merged 13 commits intohpcaitech:feature/shardformerfrom
FoolPlayer:downstream_bert
Jun 15, 2023
Merged

[Shardformer] Downstream bert#3979
FrankLeeeee merged 13 commits intohpcaitech:feature/shardformerfrom
FoolPlayer:downstream_bert

Conversation

@FoolPlayer
Copy link
Copy Markdown
Contributor

@FoolPlayer FoolPlayer commented Jun 13, 2023

📌 Checklist before creating the PR

  • I have created an issue for this PR for traceability
  • The title follows the standard format: [doc/gemini/tensor/...]: A concise description
  • I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

fixed #3974

📝 What does this PR do?

Add down stream model of bert
optimize the test code of bert

💥 Checklist before requesting a review

  • I have linked my PR to an issue (instruction)
  • My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
  • I have performed a self-review of my code
  • I have added thorough tests.
  • I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

  • 🌝 Yes, I do.
  • 🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

@FoolPlayer FoolPlayer linked an issue Jun 13, 2023 that may be closed by this pull request
@FoolPlayer
Copy link
Copy Markdown
Contributor Author

Local pytest of shardformer

image

@FrankLeeeee FrankLeeeee merged commit c291abf into hpcaitech:feature/shardformer Jun 15, 2023
FrankLeeeee pushed a commit that referenced this pull request Jun 26, 2023
* add dist dropout in model

* update docstring and bert policy with dropout

* refactor basepolicy and sharded, update bert

* update format

* update gpt2 policy

* update bert policy

* remove unused code

* update readme for new policy usage

* add downstream model of bert

* remove unused code
flybird11111 pushed a commit to flybird11111/ColossalAI that referenced this pull request Jul 3, 2023
* add dist dropout in model

* update docstring and bert policy with dropout

* refactor basepolicy and sharded, update bert

* update format

* update gpt2 policy

* update bert policy

* remove unused code

* update readme for new policy usage

* add downstream model of bert

* remove unused code
FrankLeeeee pushed a commit that referenced this pull request Jul 4, 2023
* add dist dropout in model

* update docstring and bert policy with dropout

* refactor basepolicy and sharded, update bert

* update format

* update gpt2 policy

* update bert policy

* remove unused code

* update readme for new policy usage

* add downstream model of bert

* remove unused code
ver217 pushed a commit to ver217/ColossalAI that referenced this pull request Jul 13, 2023
* add dist dropout in model

* update docstring and bert policy with dropout

* refactor basepolicy and sharded, update bert

* update format

* update gpt2 policy

* update bert policy

* remove unused code

* update readme for new policy usage

* add downstream model of bert

* remove unused code
@FoolPlayer FoolPlayer deleted the downstream_bert branch September 7, 2023 11:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[shardformer] Add down stream model of Bert

2 participants