LayerNorm OP draft#1186

Merged

reyna-abhyankar merged 15 commits intoflexflow:repo-refactorfrom

lambda7xx:repo-refactor-lambda-layernorm

Feb 8, 2024

Contributor

lambda7xx commented Oct 9, 2023 •

edited by wmdi

Loading

Description of changes:

how to handle int64_t effective_batch_size, effective_num_elements in LayerNormPerDeviceState
how to handle gamma and beta in forward and backward
leave some comment

Related Issues:

Linked Issues:

Issue Update LayerNorm operator #1070

Issues closed by this PR:

Closes #

This change is

lambda7xx added 4 commits

October 9, 2023 19:14


          init draft and leave some comment

dbfc547


          implement the forward and backward

51c351b


          layer norm version 0.1

bbe08aa


           layer norm draft

1e52a77

lambda7xx added repo-refactor labels

lambda7xx self-assigned this

lambda7xx requested review from lockshaw and reyna-abhyankar and removed request for lockshaw

October 9, 2023 19:36

lambda7xx changed the title ~~layernorm draft~~ LayerNorm OP draft

lambda7xx commented

View reviewed changes

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lambda7xx mentioned this pull request

Embedding Op #1179

Closed

reyna-abhyankar requested changes

View reviewed changes

lib/runtime/src/ops/layer_norm.cc Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Show resolved Hide resolved


           fix the layer norm and leave beta, gamma

ea9bfa5

lambda7xx requested review from lockshaw and wmdi

October 26, 2023 01:20


          Merge branch 'repo-refactor' into repo-refactor-lambda-layernorm

reyna-abhyankar requested changes

View reviewed changes

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Show resolved Hide resolved

deps/fmt Show resolved Hide resolved

reyna-abhyankar reviewed

View reviewed changes

lib/runtime/src/ops/layer_norm.cc Show resolved Hide resolved

reyna-abhyankar reviewed

View reviewed changes

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

reyna-abhyankar reviewed

View reviewed changes

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

reyna-abhyankar added 2 commits

January 17, 2024 13:47


          Finish layer norm

146095b


          Merge branch 'repo-refactor' into repo-refactor-lambda-layernorm

a74116d

lockshaw removed request for lockshaw and wmdi

January 19, 2024 09:17

lambda7xx added 3 commits

February 6, 2024 07:01


          Merge branch 'repo-refactor' into repo-refactor-lambda-layernorm

3374ec3


          refine the layernorm

0de8620


          Merge branch 'repo-refactor-lambda-layernorm' of https://github.com/l…

22fa31c

…ambda7xx/FlexFlow into repo-refactor-lambda-layernorm

lambda7xx requested a review from reyna-abhyankar

February 7, 2024 13:58

reyna-abhyankar requested changes

View reviewed changes

Collaborator

reyna-abhyankar left a comment •

edited

Loading

Can you apply the format script?

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved

lib/runtime/src/ops/layer_norm.cc Outdated Show resolved Hide resolved


          Apply suggestions from code review

0cd1ca3

Contributor Author

lambda7xx commented Feb 7, 2024

ok

lambda7xx added 2 commits

February 7, 2024 18:21


          fix the typo and format

8adb275


          format the code

f62c586

lambda7xx commented

View reviewed changes

Contributor Author

lambda7xx left a comment

Reviewable status: 0 of 5 files reviewed, 17 unresolved discussions (waiting on @lambda7xx and @reyna-abhyankar)

lib/runtime/src/ops/layer_norm.cc line 0 at r1 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

(Reviewable was unable to map this GitHub inline comment thread to the right spot — sorry!)

Keep these
using Legion::Context;
using Legion::PhysicalRegion;
using Legion::Runtime;
using Legion::Task;

Done.

lib/runtime/src/ops/layer_norm.cc line 0 at r1 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

(Reviewable was unable to map this GitHub inline comment thread to the right spot — sorry!)

@lambda7xx this is the original code that sets effective_num_elements and effective_batch_size. Seems like we can get this from input

Done.

lib/runtime/src/ops/layer_norm.cc line 0 at r1 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

Split the signature out of register task (see other merged PRs)

Done.

lib/runtime/src/ops/layer_norm.cc line 43 at r1 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

I think this is correct since gamma and beta are parameters

Done.

lib/runtime/src/ops/layer_norm.cc line 60 at r1 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

Based on original permissions, I think it's RW and for backward only the grads are RW

Done.

lib/runtime/src/ops/layer_norm.cc line 144 at r1 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

                                  InputParallelTensorDesc const &input_shape,
I think we should use InputParallelTensorDesc for input shape always because it also has the IsTrainable

Done.

lib/runtime/src/ops/layer_norm.cc line 162 at r1 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

I think it's the same shape for both

Done.

lib/runtime/src/ops/layer_norm.cc line 195 at r1 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

  fwd.add_weight_slot(GAMMA);
  fwd.add_weight_slot(BETA);

Done.

lib/runtime/src/ops/layer_norm.cc line 138 at r2 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

In the original code I don't think it's divided by num_replicas to get effective_batch_size. Just inputs->volume() / M

Done.

lib/runtime/src/ops/layer_norm.cc line 138 at r2 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

Done.

lib/runtime/src/ops/layer_norm.cc line 141 at r2 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

  effective_num_elements = M;
  effective_batch_size = input.shape.get_volume() / M;

Done.

lib/runtime/src/ops/layer_norm.cc line 187 at r2 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

fwd_binding.bind(GAMMA, input_shape);
fwd_binding.bind(BETA, input_shape);

Done.

lib/runtime/src/ops/layer_norm.cc line 217 at r4 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

OpTaskSignature bwd_signature<LAYERNORM_BWD_TASK_ID>()  {

Done.

lib/runtime/src/ops/layer_norm.cc line 223 at r4 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

OpTaskSignature init_signature<LAYERNORM_INIT_TASK_ID>()  {

Done.

lib/runtime/src/ops/layer_norm.cc line 236 at r4 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

  register_task(LAYERNORM_INIT_TASK_ID, "LayerNorm init", init_signature<LAYERNORM_INIT_TASK_ID>(), init_task);

Done.

lib/runtime/src/ops/layer_norm.cc line 247 at r4 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

      LAYERNORM_BWD_TASK_ID, "LayerNorm backward",  bwd_signature<AYERNORM_BWD_TASK_ID>() , backward_task);

Done.

deps/fmt line 0 at r2 (raw file):

Previously, reyna-abhyankar (Reyna Abhyankar) wrote…

Use most updated submodules

Done.


          Merge branch 'repo-refactor' into repo-refactor-lambda-layernorm

834fb03

reyna-abhyankar self-requested a review

February 7, 2024 23:58

reyna-abhyankar approved these changes

View reviewed changes

reyna-abhyankar enabled auto-merge (squash)

February 7, 2024 23:59

reyna-abhyankar merged commit 49d2ff7 into flexflow:repo-refactor

reyna-abhyankar mentioned this pull request

Update LayerNorm operator #1070

Closed

This was referenced Jan 15, 2025

Merge substitution-builder #1575

Merged

Change OpCostMetrics.memory to be a nonnegative_int (Issue 1584) #1585

Closed

Change OpCostMetrics.memory to be a nonnegative_int #1586

Merged

victorli2002 mentioned this pull request

Adding to unity substitution set #1594

Closed

victorli2002 mentioned this pull request

Closed

victorli2002 mentioned this pull request

fix substitution #1606

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet