System Info
Using the latest transformers from source (newer than the latest 4.44.2 release tag), the changes in pytorch_utils from this PR add nn.RMSNorm to the list of modules, but nn.RMSNorm isn't added to torch until the torch 2.4 release, causing CI failures when using DeepSpeed unless we either update torch or pin the transformers version.
Who can help?
@muellerzr
Information
Tasks
Reproduction
Clone latest DeepSpeed or run CI from hpu_gaudi2.yml workflow, failure here.
Expected behavior
Error out when requiring a specific torch version if it doesn't exist, or similar.
System Info
Using the latest transformers from source (newer than the latest 4.44.2 release tag), the changes in pytorch_utils from this PR add nn.RMSNorm to the list of modules, but nn.RMSNorm isn't added to torch until the torch 2.4 release, causing CI failures when using DeepSpeed unless we either update torch or pin the transformers version.
Who can help?
@muellerzr
Information
Tasks
examplesfolder (such as GLUE/SQuAD, ...)Reproduction
Clone latest DeepSpeed or run CI from hpu_gaudi2.yml workflow, failure here.
Expected behavior
Error out when requiring a specific torch version if it doesn't exist, or similar.