My research interests lie at the intersection of Multimodal Large Models, Generative Models, and Efficient AI.
- π« I'm an undergraduate student at Xidian University, majoring in Data Science within the School of Computer Science and Technology.
- π I'm passionate about Deep Learning, Computer Vision, and especially interested in efficient video generation and multimodal reasoning.
- π I am currently an intern at DAMO Academy (Alibaba Group), and concurrently at ZIP Lab.
- [ICLR 2026] BLADE: A Joint Framework of Sparse Attention and Step Distillation for Efficient Video Generation
- [ICML 2026] World-R1: World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
- PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation
Here are some of the technologies I am proficient with:
| Platform | Link | Description |
|---|---|---|
π» GitHub |
github.com/Tacossp | My projects and code |
π§ Email |
youpgu71@gmail.com | Feel free to reach out |
