Chenyi Zhuang

INTJ / 二次元 / 咖啡和猫

I'm a third-year master's student in Computer Technology at the Nanjing University of Aeronautics and Astronautics (2022-2025), advised by Prof. Pan Gao at Immersive and Interactive Multimedia Lab (I2ML).

My current research interests are image-related (multi-modal) vision tasks, especially generative models and their explainability. I am also interested in computer graphics and video understanding.

NEWS! I am applying for PhD in Fall 2025, feel free to contact me ⬇

WeChat  /  Email  /  CV  /  Google Scholar  /  Github

profile photo

Research

* indicate an equal contribution.

DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer
Ying Hu*, Chenyi Zhuang*, Pan Gao. ACM MM Asia 2024

Leverage textual and spatial representations and the step-by-step denoising nature of the pre-trained diffusion model to achieve balanced style transfer results.

code / arXiv
Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Chenyi Zhuang, Ying Hu, Pan Gao. NeurIPS 2024

In-depth analysis of attribute understanding for CLIP text encoder and CLIP-based diffusion models, a novel training-free approach to tackle the attribute binding issue.

code / arXiv
CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution
Qingguo Liu, Chenyi Zhuang, Pan Gao, Jie Qin. CVPR 2024

Investigate the diffusion model as an estimator to predict Content Degradation Prior (CDP) with rich content detail for the super-resolution task.

code / arXiv
StylePrompter: All Styles Need Is Attention
Chenyi Zhuang, Pan Gao, Aljosa Smolic. ACM MM 2023

Propose a Transformer-based framework to predict W+ codes at the token level for StyleGAN, with a Style-driven Multi-scale Adaptive Refinement Transformer (SMART) block to refine features in F space.

code / arXiv

What's More?

My academic career is guided by two principles: "A believing heart is a magic" as my faith and commitment to any upcoming event; and "simple but effective" as a guideline for my work style with a high level of productivity and creativity.


Last updated: February 2, 2025