Skip to content

can alltoall use zero copy (only send without recv) by ncclCommRegister? #341

@ieeechuhaoyun

Description

@ieeechuhaoyun

// nWarpPerWork = nWarps/nWorks
int nWarpPerWork = __popc(__ballot_sync(~0u, nWorks*(lane+1) <= nWarps));
int nRecvWarpPerWork = nWarpPerWork<=4 ? nWarpPerWork/2 : (nWarpPerWork-1)/2;
int nSendWarpPerWork = nWarpPerWork<=4 ? nRecvWarpPerWork : nRecvWarpPerWork+1;

i found these codes in nccl/src/device/sendrecv.h, which means recv alway use half warps , maybe nccl-tests -R param can not use in ALL2ALL?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions