Publications
Conference/Journal Papers
-  MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction 
 Gangjian Zhang, Nanjie Yao, Shunsi Zhang, Hanfeng Zhao, Guoliang Pang, Jian Shu, Hao Wang*. (* denotes corresponding author)
 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2025)
 
-  StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements 
 Mingkun Lei, Xue Song, Beier Zhu, Hao Wang, Chi Zhang.
 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2025)
 
-  RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes 
 Sicheng Yu, Chong Cheng, Yifan Zhou, Xiaojun Yang, Hao Wang*. (* denotes corresponding author)
 International Conference on Robotics and Automation (ICRA 2025)
 
-  SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation 
 Junlong Ren, Hao Wu, Hui Xiong, Hao Wang*. (* denotes corresponding author)
 International Conference on Robotics and Automation (ICRA 2025)
 
-  Graph-Guided Scene Reconstruction from Images with 3D Gaussian Splatting [Paper] 
 Chong Cheng, Gaochao Song, Yiyang Yao, Gangjian Zhang, Qinzheng Zhou, Hao Wang*. (* denotes corresponding author)
 International Conference on Learning Representations (ICLR 2025)
 
-  DVM: Towards Controllable LLM Agents in Social Deduction Games [Paper] 
 Zheng Zhang, Yihuai Lan, Yangsen Chen, Lei Wang, Xiang Wang, Hao Wang*. (* denotes corresponding author)
 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)
 
-  Diversified Augmentation with Domain Adaption for Debiased Video Temporal Grounding [Paper] 
 Junlong Ren, Gangjian Zhang, Haifeng Sun, Hao Wang*. (* denotes corresponding author)
 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)
 
-  BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction [Paper] 
 Honghao Fu, Hao Wang*, Jing Jih Chin, Zhiqi Shen. (* denotes corresponding author)
 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)
 
-  GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes [Paper] 
 Gaochao Song, Cheng Chong, Hao Wang*. (* denotes corresponding author)
 Conference on Neural Information Processing Systems (NeurIPS 2024)
 
-  LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay [Paper] [Code] 
 Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang*. * denotes corresponding author)
 The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024 Main)
 
-  HMR-Adapter: A Lightweight Adapter with Dual-Path Cross Augmentation for Expressive Human Mesh Recovery [Paper] 
 Wenhao Shen, Wanqi Yin, Hao Wang*, Chen Wei, Zhongang Cai, Lei Yang, Guosheng Lin. (* denotes corresponding author)
 ACM International Conference on Multimedia (ACM MM-2024)
 
-  Learning Temporal Variations for 4D Point Cloud Segmentation [Paper] 
 Hanyu Shi, Jiacheng Wei, Hao Wang, Fayao Liu, Guosheng Lin.
 International Journal of Computer Vision (IJCV-2024) [IF:19.5]
 
-  ManiCLIP: Multi-Attribute Face Manipulation from Text [Paper] [Code] 
 Hao Wang, Guosheng Lin, Ana GarcĂa del Molino, Anran Wang, Jiashi Feng, Zhiqi Shen.
 International Journal of Computer Vision (IJCV-2024) [IF:19.5]
 
-  COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval [Paper] 
 Hao Wu, Ruochong LI, Hao Wang*, Hui Xiong. (* denotes corresponding author)
 IEEE Conference on Multimedia Expo  (ICME-2024 Oral)
 
-  TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision [Paper] [Code] 
 Jiacheng Wei*, Hao Wang*, Jiashi Feng, Guosheng Lin, Kim-Hui Yap. (* denotes equal contributions)
 IEEE Conference on Computer Vision and Pattern Recognition  (CVPR-2023)
 
-  Smart Decision-Support System for Pig Farming [Paper] [Data] 
 Hao Wang, Boyang Li, Haoming Zhong, Ahong Xu, Yingjie Huang, Jingfu Zou, Yuanyuan Chen, Pengcheng Wu, Yiqiang Chen, Cyril Leung, Chunyan Miao.
 Drones (2022) [IF:5.532]
 
-  Cross-Modal Graph with Meta Concepts for Video Captioning [Paper] [Code] 
 Hao Wang, Guosheng Lin, Steven Hoi, Chunyan Miao.
 IEEE Transactions on Image Processing (TIP-2022) [IF:11.041]
 
-  Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval [Paper] 
 Hao Wang, Guosheng Lin, Steven Hoi, Chunyan Miao.
 ACM International Conference on Multimedia (ACM MM-2022)
 
-  Learning Structural Representations for Recipe Generation and Food Retrieval [Paper] 
 Hao Wang, Guosheng Lin, Steven Hoi, Chunyan Miao.
 IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI-2022) [IF:24.314]
 
-  Decomposing Generation Networks with Structure Prediction for Recipe Generation [Paper] 
 Hao Wang, Guosheng Lin, Steven Hoi, Chunyan Miao.
 Pattern Recognition (PR-2022) [IF:8.518]
 
-  Cycle-Consistent Inverse GAN for Text-to-Image Synthesis [Paper] 
 Hao Wang, Guosheng Lin, Steven Hoi, Chunyan Miao.
 ACM International Conference on Multimedia (ACM MM-2021)
 
-  Cross-Modal Food Retrieval: Learning a Joint Embedding of Food Images and Recipes with Semantic Consistency and Attention Mechanism [Paper] 
 Hao Wang, Doyen Sahoo, Chenghao Liu, Ke Shu, Palakorn Achananuparp, Ee-peng Lim, Steven C. H. Hoi.
 IEEE Transactions on Multimedia (TMM-2021) [IF:8.182]
 
-  Structure-Aware Generation Network for Recipe Generation from Images [Paper] [Code] 
 Hao Wang, Guosheng Lin, Steven Hoi, Chunyan Miao.
 European Conference on Computer Vision (ECCV-2020)
 
-  SpSequenceNet: Semantic Segmentation Network on 4D Point Clouds [Paper] [Code] 
 Hanyu Shi, Guosheng Lin, Hao Wang, Tzu-Yi Hung, Zhenhua Wang.
 IEEE Conference on Computer Vision and Pattern Recognition (CVPR-2020)
 
-  FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging [Paper] 
 Doyen Sahoo, Hao Wang, Shu Ke, Xiongwei Wu, Hung Le, Palakorn Achananuparp, Ee-Peng Lim, Steven Hoi.
 ACM SIGKDD conference, 2019 (KDD-2019)
 
-  Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images [Paper] [Code] 
 Hao Wang, Doyen Sahoo, Chenghao Liu, Ee-peng Lim, Steven C. H. Hoi.
 IEEE Conference on Computer Vision and Pattern Recognition (CVPR-2019)