Currently, I am a 4th-year Ph.D. student in Computer Science, School of Electronic and Computer Engineering, PKU Peking University, supervised by Prof. Jian Zhang. Previously, I received my B.Eng degree of software engineering from DLUT Dalian University of Technology in 2022. During my undergraduate studies, I am privileged to work closely with Prof. Risheng Liu and Prof. Xin Fan in the field of low-level vision.

My primary research interests include computer vision, diffusion model, and machine learning, mainly focusing on video-related Artificial Intelligence Generated Content (AIGC), Low-level Vision, and Novel View Synthesis. You are welcome to contact me via my email: szyang AT stu DOT pku DOT edu DOT cn

📜 Research Area

  1. Video-related AIGC
  2. Novel View Synthesis
  3. Low-level Vision

📝 Selected Publications

 Equal Contribution†, Corresponding Author*, [J] Journal, [C] Conference
GenCompositor: Generative Video Compositing with Diffusion Transformer
Shuzhou Yang, Xiaoyu Li, Xiaodong Cun, Guangzhi Wang, Lingen Li, Ying Shan, Jian Zhang*.
Preprint, 2025
arXiv | Project Page | Code

The prioneer work that enables effortlessly compositing different videos guided by user-specified trajectories and scales.

4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation
Shuzhou Yang, Xiaodong Cun, Xiaoyu Li*, Yaowei Li, Jian Zhang*.
Preprint, 2025
arXiv | Project Page

Generating dense-view videos through cascaded diffusion model.

Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation
Shuzhou Yang, Yu Wang, Haijie Li, Jiarui Meng, Yanmin Wu, Xiandong Meng, Jian Zhang*.
Visual Intelligence (VI) [J], 2025
arXiv | Project Page | Code

Using both 2D and 3D diffusion models to generate 3D asset from a single image with hybrid fourier score distillation.

Neural Video Fields Editing
Shuzhou Yang, Chong Mou, Jiwen Yu, Yuhan Wang, Xiandong Meng, Jian Zhang*.
Computational Visual Media (CVMJ) [J], 2025
arXiv | Project Page | Code

Editing long videos coherently via neural video fields.

DiffLLE: Diffusion-based Domain Calibration for Weak Supervised Low-light Image Enhancement
Shuzhou Yang†, Xuanyu Zhang†, Yinhuai Wang, Jiwen Yu, Yuhan Wang, Jian Zhang*.
International Journal of Computer Vision (IJCV) [J], 2024
arXiv | Paper

Bridge the gap between real scenes and training data by diffusion model prior.

Implicit Neural Representation for Cooperative Low-light Image Enhancement
Shuzhou Yang, Moxuan Ding, Yanmin Wu, Zihan Li, Jian Zhang*.
International Conference on Computer Vision (ICCV) [C], 2023
arXiv | Paper | Code

Normalize images by neural representation and enhance them based on CLIP prior.

Multi-scale Synergism Ensemble Progressive and Contrastive Investigation for Image Restoration
Zhiying Jiang†, Shuzhou Yang†, Jinyuan Liu, Xin Fan, Risheng Liu*.
IEEE Transactions on Instrumentation and Measurement (TIM) [J], 2023
Paper | Code

Restore image degradation through a multi-scale progressive network.

NeRFocus: Neural Radiance Field for 3D Synthetic Defocus
Yinhuai Wang†, Shuzhou Yang†, Yujie Hu, Jian Zhang*.
Computer Vision and Pattern Recognition Workshop (CVPRW) [C], 2023
arXiv | Code

Realize defocusing effect in 3D scenarios.

💻 Academic Services

  • Journal Reviewer:
    • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
    • International Journal of Computer Vision (IJCV)
    • IEEE Transactions on Image Processing (TIP)
    • IEEE Transactions on Multimedia (TMM)
    • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
    • ACM Transactions on Multimedia Computing, Communications and Applications (TOMM)
    • IEEE Journal of Selected Topics in Signal Processing (JSTSP)

🏫 Educations

  • Sep’2022-Jul’2027: Ph.D. (Computer Science), PKU Peking University
  • Sep’2018-Jul’2022: B.Eng (Software Engineering), DLUT Dalian University of Technology