Skip to content

[shardformer] update llama2#5499

Merged
wangbluo merged 6 commits intohpcaitech:feature/update-transformersfrom
wangbluo:update_llama2
Mar 26, 2024
Merged

[shardformer] update llama2#5499
wangbluo merged 6 commits intohpcaitech:feature/update-transformersfrom
wangbluo:update_llama2

Conversation

@wangbluo
Copy link
Copy Markdown
Contributor

@wangbluo wangbluo commented Mar 25, 2024

🚨 Issue number

📝 What does this PR do?

[shardformer/modeling/llama2]: Upgrade transformers from version 4.33.0 to version 4.36.0 for the Llama2 model, including the requirements-test.txt, the llama_model_forward function, the llama_for_causal_lm_forward function, the llama_for_sequence_classification_forward function and the get_llama_flash_attention_forward.

@wangbluo wangbluo requested a review from a team as a code owner March 25, 2024 06:01
Comment thread colossalai/shardformer/modeling/llama.py Outdated
@ver217
Copy link
Copy Markdown
Contributor

ver217 commented Mar 25, 2024

Please add transformers==4.36.0 in requirements.txt and remove transformers==4.33.0 in requirements-test.txt

@wangbluo wangbluo merged commit cdb166c into hpcaitech:feature/update-transformers Mar 26, 2024
@wangbluo wangbluo changed the title Update llama2 [shardformer] update llama2 Mar 26, 2024
@wangbluo wangbluo deleted the update_llama2 branch August 17, 2024 09:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants