Skip to content

Clarification on Some Parameters and Locality-Constrained Sparse Attention Implementation #71

@linxiao-Shi

Description

@linxiao-Shi

Hi,

I have been exploring your implementation and came across the parameters topk_ratio and local_range. Could you please clarify the following points?

topk_ratio:

What does the topk_ratio parameter control in your model? How does it relate to the resolution of the input data?

local_range:

What is the role of local_range in the attention process? How does it constrain the attention span or locality?

Additionally, I am interested in understanding the Locality-Constrained Sparse Attention mechanism. Specifically:

Where is the implementation of Locality-Constrained Sparse Attention in the codebase?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions