[shardformer] Support customized policy for llamav2 based model with HybridParallelPlugin by eric8607242 · Pull Request #4614 · hpcaitech/ColossalAI

eric8607242 · 2023-09-05T02:50:00Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

#4613

📝 What does this PR do?

In this PR, I have two modifications for ShardFormer.

I created a new argument policy for HybridParallelPlugin, which enables user can apply HybridParallel to their own model with customized Policy.
I add a new attribute replacement self_attn.num_key_value_heads for LlamaPolicy. The attribute is new for LLaMAv2, without this attribute replacement, I can not apply tensor parallelism on LLaMAv2 successfully.

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

…llamav2

flybird11111 · 2023-09-05T08:39:34Z

 class HybridParallelModule(ModelWrapper):

    def __init__(self, module: Module, precision: str, shard_config: ShardConfig, dp_group: ProcessGroup, use_ddp: bool,
-                 ddp_config: dict) -> None:


Thank you for your contribution. Could you please make some changes here? It's best not to expose 'Policy' in the plugin; it's just a component of Shardformer.

Hi,

Thanks for your review!

I have two candidate approaches currently.

Remove the typing hint

Modify the typing hint from Policy to Object

Which one do you think better?

Hi,

I modify the code with the second approach (modify from Policy to object)

Enable policy assignment in HybridPlugin and enable llama policy for …

d69f88f

…llamav2

eric8607242 changed the title ~~[shardformer] Enable policy assignment in HybridParallelPlugin and enable llama policy for …~~ [shardformer] Support customized policy for llamav2 based model Sep 5, 2023

eric8607242 changed the title ~~[shardformer] Support customized policy for llamav2 based model~~ [shardformer] Support customized policy for llamav2 based model with HybridParallelPlugin Sep 5, 2023

flybird11111 marked this pull request as draft September 5, 2023 08:37

flybird11111 marked this pull request as ready for review September 5, 2023 08:37

flybird11111 suggested changes Sep 5, 2023

View reviewed changes

eric8607242 added 2 commits September 5, 2023 21:30

Remove Policy from Plugin

028be03

Merge branch 'hpcaitech:feature/shardformer' into feature/shardformer

226dce8

ver217 deleted the branch hpcaitech:feature/shardformer September 5, 2023 15:20

ver217 closed this Sep 5, 2023

eric8607242 mentioned this pull request Sep 5, 2023

[shardformer] Support customized policy for llamav2 based model with HybridParallelPlugin #4624

Merged

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[shardformer] Support customized policy for llamav2 based model with HybridParallelPlugin#4614

[shardformer] Support customized policy for llamav2 based model with HybridParallelPlugin#4614
eric8607242 wants to merge 3 commits intohpcaitech:feature/shardformerfrom
eric8607242:feature/shardformer

eric8607242 commented Sep 5, 2023 •

edited

Loading

Uh oh!

flybird11111 Sep 5, 2023 •

edited

Loading

Uh oh!

eric8607242 Sep 5, 2023

Uh oh!

eric8607242 Sep 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

eric8607242 commented Sep 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

Uh oh!

flybird11111 Sep 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eric8607242 Sep 5, 2023

Choose a reason for hiding this comment

Uh oh!

eric8607242 Sep 5, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eric8607242 commented Sep 5, 2023 •

edited

Loading

flybird11111 Sep 5, 2023 •

edited

Loading