[BUG]: Llama2 HybridParallelPlugin train failed when  pp_size>1

### 🐛 Describe the bug

Llama2 HybridParallelPlugin train failed when  pp_size>1
we modified the llama training example to use hybridparallel plugin but encountered such an error.
The hidden_states is None in  colossalai/shardformer/modeling/llama.py:61 in  llama_model_forward
<img width="739" alt="image" src="https://github.com/hpcaitech/ColossalAI/assets/106210907/7deed690-6f8b-462c-9d05-1293a3d2eae5">


### Environment

torch-'1.13.1+cu117', transformers-4.32.0, colossalAI-release-v0.3.2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: Llama2 HybridParallelPlugin train failed when pp_size>1 #4705

🐛 Describe the bug

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG]: Llama2 HybridParallelPlugin train failed when pp_size>1 #4705

Description

🐛 Describe the bug

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions