removed restrictions for custom optimizer by CalogeroZarbo · Pull Request #161 · deepspeedai/DeepSpeed

CalogeroZarbo · 2020-03-20T10:37:09Z

Removed the restriction to be able to use any optimizer of choice. A warning message has been added.

tjruwase · 2020-03-22T22:02:29Z

Looks good

ShadenSmith · 2020-03-22T23:56:24Z

Thanks for your contribution to DeepSpeed!

CalogeroZarbo · 2020-03-23T08:25:44Z

Thank you too for this amazing lib!

ShadenSmith · 2020-03-25T03:25:01Z

I was thinking this over some more and I'm worried that the lone warning is easy to miss in a typically-long logfile. That could lead to some "silent" divergence errors.

What if we keep the warning, but require an additional config in the JSON such as "zero_allow_untested_optimizer" : true in order to "unlock" the untested optimizer? If the flag is not set, we raise an error and provide instructions and the appropriate warnings.

CalogeroZarbo · 2020-03-25T14:54:19Z

Yep @ShadenSmith that seems to me as a very good approach to avoid problems difficult do debug in a complex training system. I'll fork again and make another pull-request if that's ok for you. Let me know.

Cheers,
Cal

tjruwase · 2020-03-25T16:04:35Z

@CalogeroZarbo this sounds good. Thanks!

ShadenSmith · 2020-03-25T16:10:08Z

Perfect, thanks @CalogeroZarbo !

* Add optimizer swapping * Swap fp16 params to nvme * Formatting * Address review feedback * License file

FWkey · 2023-11-01T11:48:53Z

what are the possible errors if i use custom optimizer? or how to ensure its correctness, should i do some tests?

tjruwase · 2023-11-01T17:31:39Z

@FWkey, this is a tricky question. You could compare training loss for a smaller model of pytorch DDP vs ZeRO

Calogero Zarbo added 2 commits March 20, 2020 11:20

removed restrictions for custom optimizer

bf1d0e3

fixed yapf missformatting

2f07f41

ShadenSmith requested a review from tjruwase March 22, 2020 17:54

tjruwase approved these changes Mar 22, 2020

View reviewed changes

Merge branch 'master' into custom_optimizer

eb573e2

ShadenSmith merged commit ac9cc7f into deepspeedai:master Mar 22, 2020

CalogeroZarbo deleted the custom_optimizer branch March 23, 2020 08:25

ShadenSmith linked an issue Mar 25, 2020 that may be closed by this pull request

ZeRO & Custom Optmizer (RangerLars) #153

Closed

kouml pushed a commit to kouml/DeepSpeed that referenced this pull request Apr 3, 2020

removed restrictions for custom optimizer (deepspeedai#161)

1a1ba9b

jeffra pushed a commit to jeffra/DeepSpeed that referenced this pull request Aug 25, 2021

Swap fp16 params to nvme (deepspeedai#161)

2be60c9

* Add optimizer swapping * Swap fp16 params to nvme * Formatting * Address review feedback * License file

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

removed restrictions for custom optimizer#161

removed restrictions for custom optimizer#161
ShadenSmith merged 3 commits intodeepspeedai:masterfrom
CalogeroZarbo:custom_optimizer

CalogeroZarbo commented Mar 20, 2020

Uh oh!

tjruwase commented Mar 22, 2020

Uh oh!

ShadenSmith commented Mar 22, 2020

Uh oh!

CalogeroZarbo commented Mar 23, 2020

Uh oh!

ShadenSmith commented Mar 25, 2020

Uh oh!

CalogeroZarbo commented Mar 25, 2020

Uh oh!

tjruwase commented Mar 25, 2020

Uh oh!

ShadenSmith commented Mar 25, 2020

Uh oh!

FWkey commented Nov 1, 2023

Uh oh!

tjruwase commented Nov 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

CalogeroZarbo commented Mar 20, 2020

Uh oh!

tjruwase commented Mar 22, 2020

Uh oh!

ShadenSmith commented Mar 22, 2020

Uh oh!

CalogeroZarbo commented Mar 23, 2020

Uh oh!

ShadenSmith commented Mar 25, 2020

Uh oh!

CalogeroZarbo commented Mar 25, 2020

Uh oh!

tjruwase commented Mar 25, 2020

Uh oh!

ShadenSmith commented Mar 25, 2020

Uh oh!

FWkey commented Nov 1, 2023

Uh oh!

tjruwase commented Nov 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants