[VM] Memory alignment check for `set_input` in Virtual Machine #11391

KJlaccHoeUM9l · 2022-05-20T14:03:00Z

PR has added the ability to skip copying data when creating input NDArray tensors if the source input is in DLTensor format.

However, when adding this functionality, memory alignment was not checked, as it was done for GraphExecutor.
In view of this, runtime errors (Segmentation fault (core dumped)) are possible, because TVM uses aligned memory.

This PR adds this check.

tmoreau89 · 2022-05-23T14:49:39Z

CC @tkonolige @mbs-octoml

tkonolige · 2022-05-23T16:34:07Z

include/tvm/runtime/ndarray.h

+   * If AbilityOfZeroCopyForDLTensor is true a NDArray is created
+   * using the memory allocated by an external source.
+   * Responsibility for memory retaining lies with the external source.
+   * Otherwise new NDArray is created, the data is copied from the DLTensor.


This says data will be copied but the \brief says data will not be copied. Which is it? My reading of the code below is that data will be copied if AbilityOfZeroCopyForDLTensor is false.

I don't think we should change the semantics of this function. How about we just raise an error if the AbilityOfZeroCopyForDLTensor is false?

No, we do not raise the error due to the following reason. If you see in detail it is development of 'set_input' method of VirtualMachine. This method can be separated on two parts: one considers input as NDArray, another one does input as DLTensor. In both cases it trys to do zero copy if can, otherwise real copy is used. It has been implemented for NDArray (it automatically does not have problem with alignment) and after we have developed it for DLTensor but not checked alignment in previous PR. If we cannot use zero copy we still should use usual copy to avoid method failure. There are no 'set_input' and 'set_input_zero_copy' methods for VM. I discussed it on previous PR, it is design of VM. It means that 'set_input' should work stably in both cases with and without copying

What you are proposing here is a change to the semantics of FromExternalDLTensor. Because this is a public and important API I suggest you do not change it and instead put the logic into set_input.

If NDArray needs its underlying DLTensor to be aligned, then we should also add a check to FromExternalDLTensor to make sure that the underlying data is aligned.

@tqchen maybe you can provide some feedback here.

I agree that FromExternalDLTensor in this case should check alignment,

vvchernov · 2022-05-24T21:53:59Z

Hello @tkonolige and @tqchen! I've updated public API, now its functionality should be more correct

tkonolige · 2022-05-25T17:10:48Z

@vvchernov I see that you are still changing the semantics of FromExternalDLTensor. FromExternalDLTensor(DLTensor* dl_tensor, const Device& dst_dev) will copy if alignment is wrong, but FromExternalDLTensor(const DLTensor& dl_tensor) will throw an error if the alignment is wrong. I think this is pretty confusing as they both have the same name. Can you make FromExternalDLTensor(DLTensor* dl_tensor, const Device& dst_dev) fail if alignment is wrong and instead move the copying logic into the VM.

vvchernov · 2022-05-26T14:54:45Z

Hello @tkonolige! I've skipped FromExternalDLTensor(DLTensor* dl_tensor, const Device& dst_dev) method from NDArray class and replaced its logic to VirtualMachine where it is really used.

tkonolige

Thanks @KJlaccHoeUM9l!

vvchernov · 2022-05-27T14:24:25Z

Hello @tkonolige! CI tests were passed successfully. Could you approve it?

tkonolige · 2022-05-27T15:51:38Z

@vvchernov I have approved it :). Unfortunately I am not a committer. You'll need someone who is to approve it.

tmoreau89

LGTM, thanks for the review @tkonolige

Prior to this commit, any use of `tvm.nd.from_dlpack` to create a strided `NDArray`, or a `NDArray` whose alignment was less than `tvm::runtime::kAllocAlignment` would raise an error. As a result, views into larger arrays, which are unlikely to be aligned and compact, could only be shared when copied into an aligned and compact buffer. This commit moves the compact/aligned check from the `NDArray` class into the generated TIR code as part of DLTensor unpacking. These checks were initially introduced in apache#11391, to avoid segfaults caused by use of non-aligned buffers in code intended for aligned buffers. The new checks will provide the same safeguard as the alignment is checked prior to use, but allows the alignment requirement to be relaxed on a per-buffer basis. This approach also removes a potential bug resulting from compile-time configuration of `tvm::runtime::kAllocAlignment`, first introduced in apache#13307. Since TVM supports cross-compiling, the installation of TVM used to compile a kernel may assume a larger value of `kAllocAlignment` than is provided by the runtime installation of TVM. By validating the alignment within the generated kernel, rather than as part of the runtime, this potential inconsistency would be caught.

Prior to this commit, any use of `tvm.nd.from_dlpack` to create a strided `NDArray`, or a `NDArray` whose alignment was less than `tvm::runtime::kAllocAlignment` would raise an error. As a result, views into larger arrays, which are unlikely to be aligned and compact, could only be shared when copied into an aligned and compact buffer. This commit moves the compact/aligned check from the `NDArray` class into the generated TIR code as part of DLTensor unpacking. These checks were initially introduced in apache#11391, to avoid segfaults caused by use of non-aligned buffers in code intended for aligned buffers. The new checks will provide the same safeguard as the alignment is checked prior to use, but allows the alignment requirement to be relaxed on a per-buffer basis. This approach also removes a potential bug resulting from compile-time configuration of `tvm::runtime::kAllocAlignment`, first introduced in apache#13307. Since TVM supports cross-compiling, the installation of TVM used to compile a kernel may assume a larger value of `kAllocAlignment` than is provided by the runtime installation of TVM. By validating the alignment within the generated kernel, rather than as part of the runtime, this potential inconsistency would be caught. This check is also restricted to targets whose `void*` opaque pointer can be interpreted as a pointer to the data array. (e.g. No such check applies on Vulkan, as the `void*` is a pointer to a struct that contains additional bookkeeping.)

KJlaccHoeUM9l force-pushed the agladyshev/dev branch from bba5b16 to e906203 Compare May 23, 2022 14:49

KJlaccHoeUM9l force-pushed the agladyshev/dev branch from e906203 to c7e352d Compare May 23, 2022 14:52

tkonolige requested changes May 23, 2022

View reviewed changes

KJlaccHoeUM9l changed the title ~~[VM] Memory alignment check for set_input_zero_copy~~ [VM] Memory alignment check for set_input in Virtual Machine May 24, 2022

KJlaccHoeUM9l and others added 10 commits May 26, 2022 17:45

add memory alignment check

f2099ee

add accounting of byte_offset

40ee877

transfer NDArray generation method to NDArray class instead of VM

b8bfe9a

describe conditions. check IsContiguous for external DLTensor

5ff2c62

hide safeless method in private

908c7ef

fix lint

830813b

fix lint

12190c2

check conditions for correct creation of NDArray from external DLTensor

6d22d4e

lint fix

6d56daa

update API after review

ffdfa29

vvchernov force-pushed the agladyshev/dev branch from fd5631d to ffdfa29 Compare May 26, 2022 14:51

tkonolige approved these changes May 26, 2022

View reviewed changes

Valery Chernov added 2 commits May 27, 2022 08:50

empty commit. restart CI tests

05dcc7b

empty commit. restart CI tests once more

993f2c1

tmoreau89 approved these changes May 27, 2022

View reviewed changes

tmoreau89 merged commit 2a2d910 into apache:main May 27, 2022

vvchernov mentioned this pull request May 28, 2022

[VM] check DLManagedTensor for conditions to construct NDArray #11504

Merged

KJlaccHoeUM9l deleted the agladyshev/dev branch December 20, 2022 11:28

Lunderberg mentioned this pull request May 4, 2023

[TIR][Runtime] Allow use of external non-compact/non-aligned buffers #14771

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VM] Memory alignment check for `set_input` in Virtual Machine #11391

[VM] Memory alignment check for `set_input` in Virtual Machine #11391

Uh oh!

KJlaccHoeUM9l commented May 20, 2022

Uh oh!

tmoreau89 commented May 23, 2022

Uh oh!

tkonolige May 23, 2022

Uh oh!

vvchernov May 23, 2022

Uh oh!

tkonolige May 23, 2022

Uh oh!

tqchen May 26, 2022

Uh oh!

vvchernov commented May 24, 2022

Uh oh!

tkonolige commented May 25, 2022

Uh oh!

vvchernov commented May 26, 2022

Uh oh!

tkonolige left a comment

Uh oh!

vvchernov commented May 27, 2022

Uh oh!

tkonolige commented May 27, 2022

Uh oh!

tmoreau89 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[VM] Memory alignment check for set_input in Virtual Machine #11391

[VM] Memory alignment check for set_input in Virtual Machine #11391

Uh oh!

Conversation

KJlaccHoeUM9l commented May 20, 2022

Uh oh!

tmoreau89 commented May 23, 2022

Uh oh!

tkonolige May 23, 2022

Choose a reason for hiding this comment

Uh oh!

vvchernov May 23, 2022

Choose a reason for hiding this comment

Uh oh!

tkonolige May 23, 2022

Choose a reason for hiding this comment

Uh oh!

tqchen May 26, 2022

Choose a reason for hiding this comment

Uh oh!

vvchernov commented May 24, 2022

Uh oh!

tkonolige commented May 25, 2022

Uh oh!

vvchernov commented May 26, 2022

Uh oh!

tkonolige left a comment

Choose a reason for hiding this comment

Uh oh!

vvchernov commented May 27, 2022

Uh oh!

tkonolige commented May 27, 2022

Uh oh!

tmoreau89 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[VM] Memory alignment check for `set_input` in Virtual Machine #11391

[VM] Memory alignment check for `set_input` in Virtual Machine #11391