Yuanzhi Wang is currently a PhD student at Nanjing University of Science and Technology, with Vision and Graph Group (VGG) affiliated with the PCA Lab, advised by Professor Zhen Cui. He is currently a research intern in the Department of Content Security, Kuaishou Technology.

His research interests include Multimodal Machine Learning, Generative Modeling, AIGC, Image/Video Processing and Analysis, etc. He currently focuses on Multimodal Content Generation, Perception, and Understanding, Multimodal/Cross-modal Generative Modeling, and Text-guided Image/Video Generation and Editing.

🔥 News

  • 2025.03:  🎉🎉 One paper is accepted by CVPR 2025.
  • 2025.01:  🔥🔥 Invited talk at VALSE Webinar!
  • 2025.01:  🔥🔥 Our MMM-RS dataset is now released!
  • 2024.12:  🎉🎉 Two papers are accepted by AAAI 2025.
  • 2024.11:  🔥🔥 Invited talk at Conference on Artificial Intelligence in Jiangsu Province!
  • 2024.09:  🎉🎉 One paper is accepted by NeurIPS 2024.
  • 2024.08:  🎉🎉 One paper is accepted by ACM TOMM 2024.
  • 2023.09:  🎉🎉 One paper is accepted by NeurIPS 2023.
  • 2023.07:  🎉🎉 One paper is accepted by ICCV 2023.
  • 2023.07:  🎉🎉 One paper is accepted by IEEE TMM 2023.
  • 2023.02:  🎉🎉 One paper is accepted by CVPR 2023.

📝 Publications

→ Full list (Google Scholar)

CVPR 2025
sym

Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection

Fuyun Wang, Tong Zhang, Yuanzhi Wang, Yide Qiu, Xin Liu, Xu Guo, Zhen Cui

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025

[Paper] [Codes]

AAAI 2025
sym

Re-Attentional Controllable Video Diffusion Editing

Yuanzhi Wang, Yong Li, Mengyi Liu, Xiaoya Zhang, Xin Liu, Zhen Cui, Antoni B. Chan

In the 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025

[Paper] [Codes]

NeurIPS 2024
sym

MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation

Jialin Luo, Yuanzhi Wang (Co-first Authors, Equal Contribution), Ziqi Gu, Yide Qiu, Shuaizhen Yao, Fuyun Wang, Chunyan Xu, Wenhua Zhang, Dan Wang, Zhen Cui

In the 38th Conference on Neural Information Processing Systems (NeurIPS), 2024

[Paper] [Codes] [公众号报道]

ACM TOMM 2024
sym

Edit Temporal-Consistent Videos with Image Diffusion Model

Yuanzhi Wang, Yong Li, Xiaoya Zhang, Xin Liu, Anbo Dai, Antoni B. Chan, Zhen Cui

ACM Transactions on Multimedia Computing, Communications, and Applications (ACM TOMM), 2024

[Paper] [Arxiv Version] [Codes]

NeurIPS 2023
sym

Incomplete Multimodality-Diffused Emotion Recognition

Yuanzhi Wang, Yong Li, Zhen Cui

In the 37th Conference on Neural Information Processing Systems (NeurIPS), 2023

[Paper] [Codes]

ICCV 2023
sym

Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning

Yuanzhi Wang, Zhen Cui, Yong Li

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023

[Paper] [Codes]

IEEE TMM 2023
sym

Learning to Hallucinate Face in the Dark (Highly Cited Papers)

Yuanzhi Wang, Tao Lu, Yuan Yao, Yanduo Zhang, Zixiang, Xiong

IEEE Transactions on Multimedia (TMM), 2023

[Paper] [Codes]

CVPR 2023
sym

Decoupled Multimodal Distilling for Emotion Recognition

Yong Li, Yuanzhi Wang, Zhen Cui

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, Highlight paper (10% of accepted papers)

[Paper] [Codes]

IEEE TCSVT 2022
sym

FaceFormer: Aggregating Global and Local Representation for Face Hallucination

Yuanzhi Wang, Tao Lu, Yanduo Zhang, Zhongyuan, Wang, Junjun, Jiang, Zixiang, Xiong

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2022

First Transformer-based Face Super-Resolution Method

[Paper]

IEEE TNNLS 2022
sym

Rethinking Prior-guided Face Super-resolution: A New Paradigm with Facial Component Prior

Tao Lu, Yuanzhi Wang, Yanduo Zhang, Junjun, Jiang, Zhongyuan, Wang, Zixiang, Xiong

IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

[Paper]

ACM MM 2021
sym

Face Hallucination via Split-Attention in Split-Attention Network

Tao Lu, Yuanzhi Wang (Corresponding Author), Yanduo Zhang, Yu Wang, Liu Wei, Zhongyuan Wang, Junjun Jiang

Proceedings of the 29th ACM International Conference on Multimedia (ACM MM), 2021

[Paper] [Codes]

NTIRE 2021
sym

Multi-Scale Self-Calibrated Network for Image Light Source Transfer

Yuanzhi Wang, Tao Lu, Yanduo Zhang, Yuntao Wu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2021

[Paper] [Codes]

🎖 Honors and Awards

  • 2024.11 Outstanding Ph.D student of Nanjing University of Science and Technology
  • 2024.11 National Scholarship (The highest national scholarship for students in China)
  • 2023.11 Outstanding Ph.D student of Nanjing University of Science and Technology
  • 2022.11 Outstanding Master’s Thesis of Wuhan Institute of Technology
  • 2022.06 Outstanding Graduates of Wuhan Institute of Technology
  • 2021.11 National Scholarship (The highest national scholarship for students in China)

📖 Educations

  • 2022.09 - Now, Ph.D in Computer Science and Technology at the School of Computer Science and Engineering, Nanjing University of Science and Technology, China (Supervisor: Prof. Dr. Zhen Cui)
  • 2019.09 - 2022.06, MS in Computer Technology at the School of Computer Science and Engineering, Wuhan Institute of Technology, China (Supervisor: Prof. Dr. Tao Lu)

💬 Invited Talks

💻 Internships

  • 2024.06 - Now, Research Intern in the Department of Content Security, Kuaishou Technology (Beijing), China (Leader: Dr. Mengyi Liu)