[Runtime] Reorganize PagedKVCache attn kernel invocation #17237

MasterJH5574 · 2024-08-02T21:00:21Z

This PR reorganizes the attention kernel invocation logic in the PagedKVCache, so that in cases of sequence fork, we can effectively merge one ragged-prefill kernel and a decode kernel into a single decode kernel.

MasterJH5574 · 2024-08-02T21:00:41Z

~~Depending on #17236.~~

This PR reorganizes the attention kernel invocation logic in the PagedKVCache, so that in cases of sequence fork, we can effectively merge one ragged-prefill kernel and a decode kernel into a single decode kernel.

MasterJH5574 marked this pull request as draft August 2, 2024 21:00

tqchen approved these changes Aug 2, 2024

View reviewed changes

[Runtime] Reorganize PagedKVCache attn kernel invocation

4351d36

This PR reorganizes the attention kernel invocation logic in the PagedKVCache, so that in cases of sequence fork, we can effectively merge one ragged-prefill kernel and a decode kernel into a single decode kernel.

MasterJH5574 force-pushed the tvm-dev/2024-08-02-kvcache-invocation-reorg branch from eac154a to 4351d36 Compare August 3, 2024 15:10

MasterJH5574 marked this pull request as ready for review August 3, 2024 15:10

tqchen merged commit cd09ab6 into apache:main Aug 4, 2024

ysh329 mentioned this pull request Oct 16, 2024

[Release] v0.18.0 Release Candidate Notes #17468

Closed

kurisu6912 mentioned this pull request Sep 5, 2025

kurisu add assume attr patch 1 tile-ai/tvm#8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Runtime] Reorganize PagedKVCache attn kernel invocation #17237

[Runtime] Reorganize PagedKVCache attn kernel invocation #17237

Uh oh!

MasterJH5574 commented Aug 2, 2024

Uh oh!

MasterJH5574 commented Aug 2, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Runtime] Reorganize PagedKVCache attn kernel invocation #17237

[Runtime] Reorganize PagedKVCache attn kernel invocation #17237

Uh oh!

Conversation

MasterJH5574 commented Aug 2, 2024

Uh oh!

MasterJH5574 commented Aug 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MasterJH5574 commented Aug 2, 2024 •

edited

Loading