Skip to content

Conversation

@jinhongyii
Copy link
Contributor

This PR introduces kv transfer kernel and KV cache integration used in prefill-decode disaggregation.

Co-authored-by: Ruihang Lai ruihangl@cs.cmu.edu
Co-authored-by: Charlie Ruan 53290280+CharlieFRuan@users.noreply.github.com
Co-authored-by: Yingcheng Wang 135535812+yingchen21@users.noreply.github.com

Co-authored-by: Ruihang Lai <ruihangl@cs.cmu.edu>
Co-authored-by: Charlie Ruan <53290280+CharlieFRuan@users.noreply.github.com>
Co-authored-by: Yingcheng Wang <135535812+yingchen21@users.noreply.github.com>
@jinhongyii
Copy link
Contributor Author

cc: @MasterJH5574

Copy link
Contributor

@MasterJH5574 MasterJH5574 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@MasterJH5574 MasterJH5574 changed the title Implementation of KV cache transfer [Runtime][Dist] Implementation of KV cache transfer Dec 15, 2024
@MasterJH5574 MasterJH5574 merged commit 567eeed into apache:main Dec 15, 2024
20 checks passed
ShiboXing pushed a commit to ShiboXing/tvm that referenced this pull request Aug 10, 2025
This PR introduces kv transfer kernel and KV cache integration used
in prefill-decode disaggregation.

Co-authored-by: Ruihang Lai <ruihangl@cs.cmu.edu>
Co-authored-by: Charlie Ruan <53290280+CharlieFRuan@users.noreply.github.com>
Co-authored-by: Yingcheng Wang <135535812+yingchen21@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants