Preprints
* denotes equal contribution, and
† denotes corresponding author.
|
Publications
* denotes equal contribution, and
† denotes corresponding author.
|
|
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Haoning Wu*, Shaocheng Shen*, Qiang Hu, Xiaoyun Zhang†, Ya Zhang, Yanfeng Wang
WACV, 2025. (NEW)
project page
/
arXiv
/
code
In this work, we propose a tuning-free strategy to extend the higher-resolution image generation capabilities of existing diffusion models.
|
|
MatchTime: Towards Automatic Soccer Game Commentary Generation
Jiayuan Rao*, Haoning Wu*, Chang Liu, Yanfeng Wang†, Weidi Xie†
EMNLP, 2024. (Oral Presentation) (NEW)
project page
/
arXiv
/
code
In this work, we focus on building an visual-language model for automatic soccer game commentary generation.
|
|
Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
Chang Liu*, Haoning Wu*, Yujie Zhong, Xiaoyun Zhang, Yanfeng Wang†, Weidi Xie†
CVPR, 2024. (NEW)
project page
/
arXiv
/
code
In this work, we focus on the task of generating a series of coherent image sequence based on a given storyline, denoted as open-ended visual storytelling.
|
|
NeRF-SDP: Efficient Generalizable Neural Radiance Field with Scene Depth Perception
Qiuwen Wang, Shuai Guo, Haoning Wu, Rong Xie, Li Song†, Wenjun Zhang
ACM Multimedia Asia, 2023. (Oral Presentation)
paper
/
code
In this work, we propose a novel framework, termed as NeRF-SDP, to address the challenge of balancing rendering speed and quality in generalizable NeRF.
|
|
Boost Video Frame Interpolation via Simple Motion Adaptation
Haoning Wu, Xiaoyun Zhang†, Weidi Xie, Ya Zhang, Yanfeng Wang†
BMVC, 2023. (Oral Presentation)
project page
/
arXiv
/
code
In this work, we propose a novel optimization-based VFI method that can adapt to unseen motions at test time and boost existing pre-trained models.
|
|
LAR-SR: A Local Autoregressive Model for Image Super-Resolution
Baisong Guo*, Xiaoyun Zhang*†, Haoning Wu, Yu Wang, Ya Zhang, Yanfeng Wang†
CVPR, 2022.
paper
/
code
We propose a novel approach called LAR-SR for super-resolution based on a Local AutoRegessive module,
which achieves superior performance compared with other generative models for SR.
|
|
Computer Vision and Pattern Recognition (CVPR 2023, 2024, 2025)
International Conference on Computer Vision (ICCV 2023)
European Conference on Computer Vision (ECCV 2024)
ACM Multimedia (ACM MM 2024)
British Machine Vision Conference (BMVC 2024) (Outstanding Reviewer)
AAAI Conference on Artificial Intelligence (AAAI 2025)
|
|
[2024] BMVC 2024 Outstanding Reviewer
[2021] China National Scholarship
[2021] School Scholarship B Prize
[2021] Outstanding Student Leader of Shanghai Jiao Tong University
[2020] School Scholarship C Prize
[2020] Three Good Student of Shanghai Jiao Tong University
|
Updated in November. 2024
Thanks Jon Barron for this amazing website template.
|
|