[fx] hack __torch_dispatch__ for meta tensor and autograd. by super-dainiu · Pull Request #1515 · hpcaitech/ColossalAI

super-dainiu · 2022-08-29T07:37:48Z

What's new?

When I tried to run autograd with meta tensor input on vit_b_16, I discovered that some aten ops are not registered for meta backend. So following the suggestions in Function to automatically calculate Conv shape · Issue #79512 · pytorch/pytorch · GitHub, I tried to patch native_layer_norm.default for meta backend.

@register_meta(aten.native_layer_norm.default)
def meta_ln(
    input: torch.Tensor, 
    normalized_shape, weight, bias, eps
):
    n_input = input.size(1)

    output = torch.empty_like(input)
    running_mean = torch.empty((n_input), device='meta')
    running_var = torch.empty((n_input), device='meta')
    return output, running_mean, running_var

@register_meta(aten.native_layer_norm_backward.default)
def meta_ln_backward(
    dY: torch.Tensor,
    input: torch.Tensor, 
    normalized_shape, mean, rstd, weight, bias, grad_input_mask
):
    dX = torch.empty_like(input)
    dgamma = torch.empty_like(weight)
    dbeta = torch.empty_like(bias)
    return dX, dgamma, dbeta

However, even if patching is successful, the autograd dispatcher refuses to use my patched op for meta backend.

RuntimeError: 0 INTERNAL ASSERT FAILED at "../aten/src/ATen/core/boxing/KernelFunction.cpp":23, please report a bug to PyTorch. aten::native_layer_norm has kernels registered to both CompositeImplicitAutograd and a backend mapped to AutogradOther. This makes the backend kernel unreachable; the dispatcher will always prefer the CompositeImplicitAutograd lowering (see Note [Ambiguity in AutogradOther kernel]). If you want to override CompositeImplicitAutograd, please open an issue to request a dedicated Autograd dispatch key for the backend.   
If you only want to run inference instead of training, add `c10::InferenceMode mode;` before model.forward(). Note this guard is only available in C++ but not Python at present.

So as discussed in CompositeImplicitAutograd operators should not perform operations that do not dispatch · Issue #61669 · pytorch/pytorch · GitHub, failing due to CompositeImplicitAutograd is inevitable for PyTorch version 1.12.0 and below. I somehow managed to develop another version of autograd with meta tensor.

class MetaTensor(torch.Tensor):

    elem: torch.Tensor
 
    __slots__ = ['elem']
 
    @staticmethod
    def __new__(cls, elem):
        r = torch.Tensor._make_wrapper_subclass(
            cls, elem.size(),
            strides=elem.stride(), storage_offset=elem.storage_offset(),
            dtype=elem.dtype, layout=elem.layout,
            device='cpu', requires_grad=elem.requires_grad
        )    # deceive the frontend for aten selections
        r.elem = elem
        return r

    @ classmethod
    def __torch_dispatch__(cls, func, types, args=(), kwargs=None):
        def unwrap(x):
            return x.elem.to('meta') if isinstance(x, MetaTensor) else x
                
        args = tree_map(unwrap, args)
        kwargs = tree_map(unwrap, kwargs)
        out = func(*args, **kwargs)
        
        def wrap(x):
            return MetaTensor(x) if isinstance(x, torch.Tensor) else x
           
        return tree_map(wrap, out)

Since previous works of the PyTorch team have supported aten ops on meta backend, we can simply hack the autograd dispatcher, deceiving it that we are running on CPU. In this way, the dispatcher will not use CompositeImplicitAutograd anymore, and our patched ops can be used for meta backend. So now we can do forward and backward with meta tensor only, and trace a large model with batch_size=1e10 in milliseconds.

model = vit_b_16()
data = MetaTensor(torch.rand(int(1e10), 3, 224, 224, device='meta'))
model.to('meta')(data).sum().backward()

With this amazing __torch_dispatch__, I replaced the previously patched version of tracing into REALLL meta tracing.

Concerns

Indeed, __torch_dispatch__ is not compatible with PyTorch 1.11.0 and below.

Cypher30

Great work! As we have debated, you could add the MetaTensor to proxy and try to record the atens inside MetaTensor data structure~

Cypher30

We could pass this PR first~

FrankLeeeee

You should fix the broken unit tests before merging this PR.

super-dainiu added 3 commits August 29, 2022 15:25

[fx] hack __torch_dispatch__ for meta tensor and autograd.

c7168cc

[fx] hack __torch_dispatch__ for meta tensor and autograd.

f257bde

[fx] hack __torch_dispatch__ for meta tensor and autograd.

b0c4393

super-dainiu requested review from Cypher30, FrankLeeeee and YuliangLiu0306 August 29, 2022 07:38

super-dainiu commented Aug 29, 2022

View reviewed changes

Comment thread colossalai/fx/profiler/__init__.py

super-dainiu commented Aug 29, 2022

View reviewed changes

Comment thread colossalai/fx/passes/meta_info_prop.py

[fx] hack __torch_dispatch__ for meta tensor and autograd.

10e3a95

super-dainiu added the Run Build and Test label Aug 29, 2022

super-dainiu added 3 commits August 29, 2022 16:00

[fx] hack __torch_dispatch__ for meta tensor and autograd.

c151432

[fx] add bad case detections.

aec0fe6

[fx] add bad case detections.

b7e2c0b

Cypher30 reviewed Aug 30, 2022

View reviewed changes

Cypher30 approved these changes Aug 30, 2022

View reviewed changes

super-dainiu and others added 2 commits August 30, 2022 13:23

Merge branch 'hpcaitech:main' into feature/meta_profiler

80ee0ab

[fx] rename MetaTensor attributes.

973fb58

FrankLeeeee suggested changes Aug 30, 2022

View reviewed changes

FrankLeeeee reviewed Aug 30, 2022

View reviewed changes

Comment thread colossalai/fx/profiler/meta_tensor.py Outdated

FrankLeeeee reviewed Aug 30, 2022

View reviewed changes

Comment thread colossalai/fx/profiler/__init__.py Outdated

FrankLeeeee reviewed Aug 30, 2022

View reviewed changes

Comment thread colossalai/fx/profiler/_meta_registrations.py

FrankLeeeee reviewed Aug 30, 2022

View reviewed changes

Comment thread colossalai/fx/profiler/meta_tensor.py

super-dainiu added 5 commits August 30, 2022 15:04

[fx] fix unexpected error.

0536f71

[fx] fix unexpected error.

f832e7d

[fx] fix unexpected error.

f8a8001

[fx] fix unexpected error.

30a85b3

[fx] fix unexpected error.

fae158e

super-dainiu requested a review from FrankLeeeee August 30, 2022 07:17

super-dainiu and others added 2 commits August 30, 2022 21:46

Merge branch 'hpcaitech:main' into feature/meta_profiler

ba26994

[fx] add register backward for native_batch_norm_backward.

1d05e52

super-dainiu added 3 commits August 31, 2022 13:39

[fx] add more meta backend support for nn.Modules.

0b2f247

[fx] add meta backend to support timm and torchvision models.

9889b86

[fx] add meta hardswish for timm models.

e3d866b

FrankLeeeee approved these changes Aug 31, 2022

View reviewed changes

FrankLeeeee merged commit 5cc849f into hpcaitech:main Aug 31, 2022

super-dainiu mentioned this pull request Sep 2, 2022

[fx] support meta tracing for aten level computation graphs like functorch. #1536

Merged

Cypher30 mentioned this pull request Feb 6, 2023

[FEATURE]: Meta information patch for torch.matmul #2582

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fx] hack __torch_dispatch__ for meta tensor and autograd.#1515

[fx] hack __torch_dispatch__ for meta tensor and autograd.#1515
FrankLeeeee merged 19 commits intohpcaitech:mainfrom
super-dainiu:feature/meta_profiler

super-dainiu commented Aug 29, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Cypher30 left a comment

Uh oh!

Cypher30 left a comment

Uh oh!

FrankLeeeee left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

super-dainiu commented Aug 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's new?

Concerns

Uh oh!

Uh oh!

Uh oh!

Cypher30 left a comment

Choose a reason for hiding this comment

Uh oh!

Cypher30 left a comment

Choose a reason for hiding this comment

Uh oh!

FrankLeeeee left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

super-dainiu commented Aug 29, 2022 •

edited

Loading