[Feature Request] Support blocksparse mask like topk, topp

- `block_mask`
```py

AttentionEngine(block_size=BLOCK_SIZE)(
             q, # [batch, seqlenq, head, dimqk]
             k, # [batch, seqlenkv, head, dimqk]
             v, # [batch, seqlenkv, head, dimv]
             block_mask, # [batch, seqlenq//BLOCK_SIZE, seqlenkv//BLOCK_SIZE]
)
```
- `block_indices`
```py
AttentionEngine(block_size=BLOCK_SIZE)(
             q, # [batch, seqlenq, head, dimqk]
             k, # [batch, seqlenkv, head, dimqk]
             v, # [batch, seqlenkv, head, dimv]
             block_indices, # [batch, seqlenq//BLOCK_SIZE, head, MAX_BLOCKS]
             selected_block_num, # [batch, head]
)
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Support blocksparse mask like topk, topp #11

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Support blocksparse mask like topk, topp #11

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions