Research
I’ve been fortunate to focus on the fascinating and inspiring field of visual generation since starting my PhD, with research interests spanning image/video generation, generative models, and multimodal generation.
Your browser does not support the video tag.
DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
Xu Guo* ,
Fulong Ye* ,
Xinghui Li* ,
Pengqi Tu ,
Pengze Zhang ,
Qichao Sun ,
Songtao Zhao ,
Xiangwang Hou ,
Qian He
arXiv , 2025
project page
/
arXiv
Your browser does not support the video tag.
InstructX: Towards Unified Visual Editing with MLLM Guidance
Chong Mou ,
Qichao Sun ,
Yanze Wu ,
Pengze Zhang ,
Xinghui Li ,
Fulong Ye ,
Songtao Zhao ,
Qian He
arXiv , 2025
project page
/
arXiv
Your browser does not support the video tag.
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models
Jinshu Chen* ,
Xinghui Li* ,
Xu Bai* ,
Tianxiang Ma ,
Pengze Zhang ,
Zhuowei Chen ,
Gen Li ,
Lijie Liu ,
Songtao Zhao ,
Bingchuan Li ,
Qian He
arXiv , 2025
project page
/
arXiv
Your browser does not support the video tag.
DreamO: A Unified Framework for Image Customization
Chong Mou ,
Yanze Wu ,
Wenxu Wu,
Zinan Guo,
Pengze Zhang ,
Yufeng Cheng,
Yiming Luo,
Fei Ding ,
Shiwen Zhang,
Xinghui Li ,
Mengtian Li ,
Mingcong Liu ,
Yi Zhang,
Shaojin Wu ,
Songtao Zhao ,
Jian Zhang ,
Qian He ,
Xinglong Wu
SIGGRAPH Asia , 2025
project page
/
arXiv
Your browser does not support the video tag.
DreamID: A Fast and High-Fidelity diffusion-based Face Swapping via Triplet ID Group Learning
Fulong Ye ,
Miao Hua ,
Pengze Zhang ,
Xinghui Li ,
Qichao Sun ,
Songtao Zhao ,
Qian He ,
Xinglong Wu
SIGGRAPH Asia , 2025
project page
/
arXiv
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
Zinan Guo,
Pengze Zhang ,
Yanze Wu ,
Chong Mou ,
Songtao Zhao ,
Qian He
arxiv , 2025
project page
/
arXiv
Your browser does not support the video tag.
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
Xinghui Li ,
Qichao Sun ,
Pengze Zhang ,
Fulong Ye ,
Zhichao Liao ,
Wanquan Feng ,
Songtao Zhao ,
Qian He
CVPR , 2025
project page
/
arXiv
Your browser does not support the video tag.
Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
Pengze Zhang* ,
Hubery Yin*
Chen Li ,
Xiaohua Xie
CVPR , 2024
project page
/
arXiv
Pose Guided Person Image Generation Via Dual-Task Correlation and Affinity Learning
Pengze Zhang ,
Lingxiao Yang ,
Xiaohua Xie ,
Jianhuang Lai
TVCG , 2023
ieee
Formulating Discrete Probability Flow Through Optimal Transport
Pengze Zhang* ,
Hubery Yin*
Chen Li ,
Xiaohua Xie
NeurIPS , 2023
project page
/
openreview
/
arxiv
Exploring Dual-task Correlation for Pose Guided Person Image Generation
Pengze Zhang ,
Lingxiao Yang ,
Jianhuang Lai ,
Xiaohua Xie
CVPR , 2022
project page
/
arxiv
Lightweight Texture Correlation Network for Pose Guided Person Image Generation
Pengze Zhang ,
Lingxiao Yang ,
Jianhuang Lai ,
Xiaohua Xie
TCSVT , 2021
project page
/
ieee
Selected Awards
Outstanding Scholarship of the Tencent Rhino-bird Research Elite Program, Tencent, 2024
Outstanding Graduate of Sun Yat-sen University, 2024
China National Scholarship, 2023
Outstanding Graduate of Sun Yat-sen University, 2019
Guanghua Education Scholarship, 2018
Academic Service
Conference Reviewer: NeurIPS, ICML, CVPR, ICCV, ECCV, AAAI
Journal Reviewer: TPAMI, TVCG, TCSVT