Yizeng Han (韩益增)

I'm a research scientist at Alibaba DAMO Academy, Beijing, China. I received my Ph.D degree in the Department of Automation, Tsinghua University, advised by Prof. Gao Huang and Prof. Shiji Song.

Download my C.V. here: English / 简体中文.

🌟 My research focuses on deep learning, computer vision and medical AI, in particular dynamic neural networks and efficient learning/inference of deep models in resource-constrained scenarios.

🔥 Recently, I am interested in directions related to Efficient/Dynamic Vision Language Model (VLM), Visual Generation, and scalable medical AI systems.

🧐 I'm also interested in fundamental machine learning problems, such as semi-supervised long-tailed learning and fine-grained learning.

📚 Education

  • Ph.D, Tsinghua University, 2018 - 2024.
  • B.E., Tsinghua University, 2014 - 2018.

💡 Research Experience

  • Research Intern, Megvii Technology (Foundation Model Group, advisor: Xiangyu Zhang), 04/2023 - 12/2023
  • Research Intern, Georgia Institute of Technology (advisor: Gregory D. Abowd), 06/2017 - 08/2017
Yizeng Han

News

Selected Papers Google Scholar

DyDiT++

DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation

Wangbo Zhao*, Yizeng Han*, Jiasheng Tang, Kai Wang, Hao Luo, Yibing Song, Gao Huang, Fan Wang, Yang You
Arxiv Preprint, 2025.

We extend DyDiT to T2I (DyFLUX) and video generation. Moreover, LoRA finetuning is supported.

RAPID^3

RAPID^3: Tri-Level Reinforced Acceleration Policies for Diffusion Transformer

Wangbo Zhao, Yizeng Han, Zhiwei Tang, Jiasheng Tang, Pengfei Zhou, Kai Wang, Bohan Zhuang, Zhangyang Wang, Fan Wang, Yang You
Arxiv Preprint, 2025.
Inferix

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Inferix Team: Tianyu Feng, Yizeng Han, Jiahao He, Yuanyu He, Xi Lin, Teng Liu, Hanfeng Lu, Jiasheng Tang, Wei Wang, Zhiyuan Wang, Jichao Wu, Mingyang Yang, Yinghao Yu, Zeyu Zhang, Bohan Zhuang
Arxiv Preprint, 2025.
BlockVid

BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation

Zeyu Zhang, Shuning Chang, Yuanyu He, Yizeng Han, Jiasheng Tang, Fan Wang, Bohan Zhuang
Arxiv Preprint, 2025.
AdaptiveNN

Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception

Yulin Wang, Yang Yue, Yang Yue, Huanqian Wang, Haojun Jiang, Yizeng Han, Zanlin Ni, Yifan Pu, Minglei Shi, Rui Lu, Qisen Yang, Andrew Zhao, Zhuofan Xia, Shiji Song, Gao Huang
Nature Machine Intelligence, 2025.
FPSAttention

FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion

Akide Liu, Zeyu Zhang, Zhexin Li, Xuehai Bai, Yizeng Han, Jiasheng Tang, Yuanjie Xing, Jichao Wu, Mingyang Yang, Weihua Chen, Jiahao He, Yuanyu He, Fan Wang, Gholamreza Haffari, Bohan Zhuang
NeurIPS (Highlight), 2025.
SGL

A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for accelerating Large VLMs

Wangbo Zhao*, Yizeng Han*, Jiasheng Tang, Zhikai Li, Yibing Song, Kai Wang, Zhangyang Wang, Yang You
CVPR, 2025.
DyT

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

Wangbo Zhao, Jiasheng Tang, Yizeng Han, Yibing Song, Kai Wang, Gao Huang, Fan Wang, Yang You
NeurIPS, 2024.
Deer

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Yang Yue, Yulin Wang, Bingyi Kang, Yizeng Han, Shenzhi Wang, Shiji Song, Jiashi Feng, Gao Huang
NeurIPS, 2024.
Survey

Dynamic Neural Networks: A Survey

Yizeng Han*, Gao Huang*, Shiji Song, Le Yang, Honghui Wang, Yulin Wang
IEEE TPAMI (IF=24.314), 2021.

In this survey, we comprehensively review the rapidly developing area, dynamic neural networks.

LAUDNet

Latency-aware Unified Dynamic Networks for Efficient Image Recognition

Yizeng Han*, Zeyu Liu*, Zhihang Yuan*, Yifan Pu, Chaofei Wang, Shiji Song, Gao Huang
IEEE TPAMI (IF=24.314), 2024.
SGL

A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for accelerating Large VLMs

Wangbo Zhao*, Yizeng Han*, Jiasheng Tang, Zhikai Li, Yibing Song, Kai Wang, Zhangyang Wang, Yang You
CVPR, 2025.
DyDiT

Dynamic Diffusion Transformer

Wangbo Zhao, Yizeng Han, Jiasheng Tang, Kai Wang, Yibing Song, Gao Huang, Fan Wang, Yang You
ICLR, 2025.
Dyn_Perceiver

Dynamic Perceiver for Efficient Visual Recognition

Yizeng Han*, Dongchen Han*, Zeyu Liu, Yulin Wang, Xuran Pan, Yifan Pu, Chao Deng, Junlan Feng, Shiji Song, Gao Huang
ICCV, 2023.
L2W-DEN

Learning to Weight Samples for Dynamic Early-exiting Networks

Yizeng Han*, Yifan Pu*, Zihang Lai, Chaofei Wang, Shiji Song, Junfen Cao, Wenhui Huang, Chao Deng, Gao Huang
ECCV, 2022.
LASNet

Latency-aware Spatial-wise Dynamic Networks

Yizeng Han*, Zhihang Yuan*, Yifan Pu*, Chenhao Xue, Shiji Song, Guangyu Sun, Gao Huang
NeurIPS, 2022.
RANet

Resolution Adaptive Networks for Efficient Inference

Le Yang*, Yizeng Han*, Xi Chen*, Shiji Song, Jifeng Dai, Gao Huang
CVPR, 2020.
SAR

Spatially Adaptive Feature Refinement for Efficient Inference

Yizeng Han, Gao Huang, Shiji Song, Le Yang, Yitian Zhang, Haojun Jiang
IEEE TIP (IF=11.041), 2021.
SimPro

SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning

Chaoqun Du*, Yizeng Han*, Gao Huang
ICML, 2024.
LearnableISDA

Fine-grained Recognition with Learnable Semantic Data Augmentation

Yifan Pu*, Yizeng Han*, Yulin Wang, Junlan Feng, Chao Deng, Gao Huang
IEEE TIP, 2023.

Awards

Contact