Pengze Zhang (张鹏泽)

I'm a reasercher at ByteDance in Beijing, China. I obtained my Ph.D degree from the Sun Yat-sen University in 2024, advisor by Prof. Xiaohua Xie and Prof. Jianhuang Lai. I spent wonderful vacations as an intern at Wechat, Tencent. Before that, I received B.Eng from Sun Yat-sen University in 2019.

Email  /  Scholar  /  Github

profile photo

Research

I’ve been fortunate to focus on the fascinating and inspiring field of visual generation since starting my PhD, with research interests spanning image/video generation, generative models, and multimodal generation.

DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
Xu Guo*, Fulong Ye*, Xinghui Li*, Pengqi Tu, Pengze Zhang, Qichao Sun, Songtao Zhao, Xiangwang Hou, Qian He
arXiv, 2025
project page / arXiv

InstructX: Towards Unified Visual Editing with MLLM Guidance
Chong Mou, Qichao Sun, Yanze Wu, Pengze Zhang, Xinghui Li, Fulong Ye, Songtao Zhao, Qian He
arXiv, 2025
project page / arXiv

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models
Jinshu Chen*, Xinghui Li*, Xu Bai*, Tianxiang Ma, Pengze Zhang, Zhuowei Chen, Gen Li, Lijie Liu, Songtao Zhao, Bingchuan Li, Qian He
arXiv, 2025
project page / arXiv

DreamO: A Unified Framework for Image Customization
Chong Mou, Yanze Wu, Wenxu Wu, Zinan Guo, Pengze Zhang, Yufeng Cheng, Yiming Luo, Fei Ding, Shiwen Zhang, Xinghui Li, Mengtian Li, Mingcong Liu, Yi Zhang, Shaojin Wu, Songtao Zhao, Jian Zhang, Qian He, Xinglong Wu
SIGGRAPH Asia, 2025
project page / arXiv

DreamID: A Fast and High-Fidelity diffusion-based Face Swapping via Triplet ID Group Learning
Fulong Ye, Miao Hua, Pengze Zhang, Xinghui Li, Qichao Sun, Songtao Zhao, Qian He, Xinglong Wu
SIGGRAPH Asia, 2025
project page / arXiv

MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
Zinan Guo, Pengze Zhang, Yanze Wu, Chong Mou, Songtao Zhao, Qian He
arxiv, 2025
project page / arXiv

AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
Xinghui Li, Qichao Sun, Pengze Zhang, Fulong Ye, Zhichao Liao, Wanquan Feng, Songtao Zhao, Qian He
CVPR, 2025
project page / arXiv

Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
Pengze Zhang*, Hubery Yin* Chen Li, Xiaohua Xie
CVPR, 2024
project page / arXiv

Pose Guided Person Image Generation Via Dual-Task Correlation and Affinity Learning
Pengze Zhang, Lingxiao Yang, Xiaohua Xie, Jianhuang Lai
TVCG, 2023
ieee

Formulating Discrete Probability Flow Through Optimal Transport
Pengze Zhang*, Hubery Yin* Chen Li, Xiaohua Xie
NeurIPS, 2023
project page / openreview / arxiv

Exploring Dual-task Correlation for Pose Guided Person Image Generation
Pengze Zhang, Lingxiao Yang, Jianhuang Lai, Xiaohua Xie
CVPR, 2022
project page / arxiv

Lightweight Texture Correlation Network for Pose Guided Person Image Generation
Pengze Zhang, Lingxiao Yang, Jianhuang Lai, Xiaohua Xie
TCSVT, 2021
project page / ieee

Selected Awards

Outstanding Scholarship of the Tencent Rhino-bird Research Elite Program, Tencent, 2024

Outstanding Graduate of Sun Yat-sen University, 2024

China National Scholarship, 2023

Outstanding Graduate of Sun Yat-sen University, 2019

Guanghua Education Scholarship, 2018

Academic Service

Conference Reviewer: NeurIPS, ICML, CVPR, ICCV, ECCV, AAAI

Journal Reviewer: TPAMI, TVCG, TCSVT