-
VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE.
Yazhou Xing*, Yang Fei*, Yingqing He*†, Jingye Chen, Jiaxin Xie, Xiaowei Chi, Qifeng Chen†
ICCV, 2025.
-
LDM-ISP: Enhancing Neural ISP for Low Light with Latent Diffusion Models.
Qiang Wen, Zhefan Rao, Yazhou Xing†, Qifeng Chen†
ICRA, 2025.
-
ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement.
Zhefan Rao, Liya Ji, Yazhou Xing, Runtao Liu, Zhaoyang Liu, Jiaxin Xie, Ziqiao Peng, Yingqing He, Qifeng Chen
arXiv, 2024.
-
LLMs Meet Multimodal Generation and Editing: A Survey.
Yingqing He, Zhaoyang Liu, Jingye Chen, Zeyue Tian, Hongyu Liu, Xiaowei Chi, Runtao Liu, Ruibin Yuan, Yazhou Xing, Wenhai Wang, Jifeng Dai, Yong Zhang, Wei Xue, Qifeng Liu, Yike Guo, Qifeng Chen
arXiv, 2024.
-
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners.
Yazhou Xing*, Yingqing He*, Zeyue Tian*, Xintao Wang, Qifeng Chen
CVPR, 2024.
-
Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection.
Yazhou Xing, Amrita Mazumdar, Anjul Patney, Chao Liu, Hongxu Yin, Qifeng Chen, Jan Kautz, Iuri Frosio
arXiv, 2023.
-
Invertible Image Signal Processing.
Yazhou Xing*, Zian Qian*, Qifeng Chen
CVPR, 2021.
-
Blind Video Temporal Consistency via Deep Video Prior.
Chenyang Lei*, Yazhou Xing*, Qifeng Chen
NeurIPS, 2020.