About Me
I am a tenure-track Assistant Professor of AI at The Hong Kong University of Science and Technology (Guangzhou). I obtained my PhD from Nanyang Technological University, Singapore; I was an intern researcher with TikTok and Horizon Robotics.
My research interests include Spatial Intelligence, LLM Agents, Multimodal Learning and related areas. I have published over 50 papers in top-tier conferences and journals, including TPAMI, IJCV, CVPR, and NeurIPS. I also serve as an area chair and reviewer for multiple leading conferences. I received the rising star award on ICCSE 2025, Guangdong provincial talent award and etc.
There are RA and intern position openings on the 3D reconstruction projects.Please drop me an email if you are interested in collaborations.
News
- [Nov 2025] Two papers accepted to AAAI 2026, one paper selected as AAAI 2026 Oral.
- [Aug 2025] One paper accepted to EMNLP Main 2025, one paper accepted to EMNLP Findings 2025.
- [Jul 2025] Four papers accepted to ACM-MM 2025.
- [Jun 2025] Two papers accepted to ICCV 2025.
- [May 2025] One paper accepted to ICML 2025.
- [Apr 2025] Two papers accepted to CVPR 2025.
- [Jan 2025] Two papers accepted to ICRA 2025.
Highlighted Projects
Selected Publications
- Interpreting Fedspeak with Confidence: A LLM-Based Uncertainty-Aware Framework Guided by Monetary Policy Transmission Paths
Rui Yao, Qi Chai, Jinhai Yao, Siyuan Li, Junhao Chen, Qi Zhang, Hao Wang*
AAAI 2026, Oral
[Paper] [Code] - FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar Animation
Jian Shu, Nanjie Yao, Gangjian Zhang, Junlong Ren, Yu Feng, Hao Wang*
AAAI 2026
[Paper] [Project page] - Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps
Chong Cheng, Sicheng Yu, Zijian Wang, Yifan Zhou, Hao Wang*
ICCV 2025
[Paper] [Project Page] [Code] - RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration
Chong Cheng, Yu Hu, Sicheng Yu, Beizhen Zhao, Zijian Wang, Hao Wang*
ICCV 2025
[Paper] [Project Page] [Code] - MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction
Gangjian Zhang, Nanjie Yao, Shunsi Zhang, Hanfeng Zhao, Guoliang Pang, Jian Shu, Hao Wang*
CVPR 2025
[Paper] [Project page] - RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes
Sicheng Yu, Chong Cheng, Yifan Zhou, Xiaojun Yang, Hao Wang*
ICRA 2025
[Paper] [Project page] - SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
Junlong Ren, Hao Wu, Hui Xiong, Hao Wang*
ICRA 2025
[Paper] [Code] - Graph-Guided Scene Reconstruction from Images with 3D Gaussian Splatting
Chong Cheng, Gaochao Song, Yiyang Yao, Gangjian Zhang, Qinzheng Zhou, Hao Wang*
ICLR 2025
[Paper] [Code] - VistaWise: Building Cost-Effective Agent with Cross-Modal Knowledge Graph for Minecraft
Honghao Fu, Junlong Ren, Qi Chai, Deheng Ye, Yujun Cai, Hao Wang*
EMNLP Main 2025
[Paper] - CausalMACE: Causality Empowered Multi-Agents in Minecraft Cooperative Tasks
Qi Chai, Zhang Zheng, Junlong Ren, Deheng Ye, Zichuan Lin, Hao Wang*
EMNLP Findings 2025
[Paper] - SAT: Supervisor Regularization and Animation Augmentation for Two-process Monocular Texture 3D Human Reconstruction
Gangjian Zhang, Jian Shu, Nanjie Yao, Hao Wang*
ACM MM 2025
[Paper] - MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind
Zhang Zheng, Nuoqian Xiao, Qi Chai, Deheng Ye, Hao Wang*
ACM MM 2025
[Paper] - Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition
Beizhen Zhao, Yifan Zhou, Sicheng Yu, Zijian Wang, Hao Wang*
ACM MM 2025
[Paper] - Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation
Hongbin Lin, Yifan Jiang, Juangui Xu, Jesse Jiaxi Xu, Yi Lu, Zhengyu Hu, Ying-Cong Chen, Hao Wang*
ACM MM 2025
[Paper] - GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes
Gaochao Song, Cheng Chong, Hao Wang*
NeurIPS 2024
[Paper] [Project Page] - LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang*
EMNLP Main 2024
[Paper] [Code] - ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization
Wenhao Shen, Wanqi Yin, Xiaofeng Yang, Cheng Chen, Chaoyue Song, Zhongang Cai, Lei Yang, Hao Wang*, Guosheng Lin*
ICML 2025
[Paper] - HMR-Adapter: A Lightweight Adapter with Dual-Path Cross Augmentation for Expressive Human Mesh Recovery
Wenhao Shen, Wanqi Yin, Hao Wang*, Chen Wei, Zhongang Cai, Lei Yang, Guosheng Lin*
ACM MM 2024
[Paper] - ManiCLIP: Multi-Attribute Face Manipulation from Text
Hao Wang, Guosheng Lin, Ana García del Molino, Anran Wang, Jiashi Feng, Zhiqi Shen
IJCV 2024
[Paper] [Code] - TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision
Jiacheng Wei*, Hao Wang*, Jiashi Feng, Guosheng Lin, Kim-Hui Yap
CVPR 2023
[Paper] [Code] - Cross-Modal Graph with Meta Concepts for Video Captioning
Hao Wang, Guosheng Lin, Steven Hoi, Chunyan Miao
TIP 2022
[Paper] [Code] - Learning Structural Representations for Recipe Generation and Food Retrieval
Hao Wang, Guosheng Lin, Steven Hoi, Chunyan Miao
TPAMI 2022
[Paper] - Structure-Aware Generation Network for Recipe Generation from Images
Hao Wang, Guosheng Lin, Steven Hoi, Chunyan Miao
ECCV 2020
[Paper] [Code] - Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images
Hao Wang, Doyen Sahoo, Chenghao Liu, Ee-peng Lim, Steven C. H. Hoi
CVPR 2019
[Paper] [Code]
Services
- Area Chair: ACL ARR
- Conference Reviewer: CVPR, ECCV, ICCV, ACM MM, NeurIPS, ICLR, AAAI
- Journal Reviewer: IEEE TPAMI, IJCV, TNNLS, TMM, TCSVT
Teaching
- Introduction to Computer Vision Spring, 2026
- Multimodal Artificial Intelligence Spring, 2026
- Introduction to Computer Science Fall, 2025
- Multimodal Artificial Intelligence Spring, 2025
- Introduction to Computer Science Fall, 2024
- Artificial Intelligence Seminar Fall, 2024
- Multimodal Artificial Intelligence Spring, 2024
