Haoning Wu 「吴浩宁」

I am currently a 3rd-year PhD candidate of MediaBrain at Shanghai Jiao Tong University (SJTU), advised by Prof. Weidi Xie and Prof. Ya Zhang. Previously, I received my B.S. degree in EE (IEEE Pilot Class) also from SJTU in June 2022.

I'm generally interested in computer vision, especially generative models, AI4Science and AI4Sports. Feel free to contact me via my email!!!

WeChat: haoningwu_

Email  /  CV  /  Google Scholar  /  Github  /  Zhihu  /  LinkedIn

profile photo
News
Preprints

* denotes equal contribution, and denotes corresponding author.


Publications

* denotes equal contribution, and denotes corresponding author.

megafusion MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Haoning Wu*, Shaocheng Shen*, Qiang Hu, Xiaoyun Zhang, Ya Zhang, Yanfeng Wang
WACV, 2025.   (NEW)
project page / arXiv / code

In this work, we propose a tuning-free strategy to extend the higher-resolution image generation capabilities of existing diffusion models.

matchtime MatchTime: Towards Automatic Soccer Game Commentary Generation
Jiayuan Rao*, Haoning Wu*, Chang Liu, Yanfeng Wang, Weidi Xie
EMNLP, 2024.   (Oral Presentation)   (NEW)
project page / arXiv / code

In this work, we focus on building an visual-language model for automatic soccer game commentary generation.

storygen Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
Chang Liu*, Haoning Wu*, Yujie Zhong, Xiaoyun Zhang, Yanfeng Wang, Weidi Xie
CVPR, 2024.   (NEW)
project page / arXiv / code

In this work, we focus on the task of generating a series of coherent image sequence based on a given storyline, denoted as open-ended visual storytelling.

nerf_sdp NeRF-SDP: Efficient Generalizable Neural Radiance Field with Scene Depth Perception
Qiuwen Wang, Shuai Guo, Haoning Wu, Rong Xie, Li Song, Wenjun Zhang
ACM Multimedia Asia, 2023.   (Oral Presentation)
paper / code

In this work, we propose a novel framework, termed as NeRF-SDP, to address the challenge of balancing rendering speed and quality in generalizable NeRF.

vfi_adapter Boost Video Frame Interpolation via Simple Motion Adaptation
Haoning Wu, Xiaoyun Zhang, Weidi Xie, Ya Zhang, Yanfeng Wang
BMVC, 2023.   (Oral Presentation)
project page / arXiv / code

In this work, we propose a novel optimization-based VFI method that can adapt to unseen motions at test time and boost existing pre-trained models.

lar_sr LAR-SR: A Local Autoregressive Model for Image Super-Resolution
Baisong Guo*, Xiaoyun Zhang*, Haoning Wu, Yu Wang, Ya Zhang, Yanfeng Wang
CVPR, 2022.
paper / code

We propose a novel approach called LAR-SR for super-resolution based on a Local AutoRegessive module, which achieves superior performance compared with other generative models for SR.

Reviewer Service
Computer Vision and Pattern Recognition (CVPR 2023, 2024, 2025)
International Conference on Computer Vision (ICCV 2023)
European Conference on Computer Vision (ECCV 2024)
ACM Multimedia (ACM MM 2024)
British Machine Vision Conference (BMVC 2024) (Outstanding Reviewer)
AAAI Conference on Artificial Intelligence (AAAI 2025)
Awards

[2024] BMVC 2024 Outstanding Reviewer
[2021] China National Scholarship
[2021] School Scholarship B Prize
[2021] Outstanding Student Leader of Shanghai Jiao Tong University
[2020] School Scholarship C Prize
[2020] Three Good Student of Shanghai Jiao Tong University

Updated in November. 2024

Thanks Jon Barron for this amazing website template.