HOU Yuenan (侯跃南)

Yuenan Hou is a Researcher of Shanghai AI Laboratory, led by Dr. Xiao Sun and Prof. Yu Qiao. He obtained the Ph.D. degree from the Chinese University of Hong Kong (Multimedia Lab) in August 2021, under the supervision of Prof. Chen Change Loy and Prof. Xiaoou Tang. He received his B.E degree from Nanjing University in July 2017, supervised by Prof. Chunlin Chen and Prof. Xianzhong Zhou. His research interest includes 3D Perception, Embodied AI and Large Vision Models.

If you are interested in research intern/engineer or joint-training PhD programs or academic/industrial collaboration in Shanghai AI Lab, please send the e-mail to houyuenan[at]pjlab.org.cn.

Email / Google Scholar / Github

News
  • One paper is accepted by ISPRS (IF=12.7). (2024-09)NEW!

  • One paper is accepted by TPAMI (IF=20.8). (2024-08)

  • One paper is accepted by ECCV. (2024-07)

  • Two papers are accepted by CVPR. (2024-02)

  • Two papers are accepted by AAAI, including one Oral. (2024-01)

Education

Aug. 2017 - Aug. 2021 , Information Engineering, the Chinese University of Hong Kong

Doctor of Philosophy

Sep. 2013 - Jul. 2017 , Automation, Nanjing University

Bachelor of Engineering

Work Experience

Jan. 2023 - now , work in Shanghai AI Lab (OpenGVLab)

Aug. 2021 - Dec. 2022 , work in Shanghai AI Lab (ADG)

Apr. 2020 - May. 2020 , internship in SenseTime

Mar. 2017 - May. 2017 , internship in SenseTime

Publications

*: Corresponding Author(s)

Vision-Centric BEV Perception: A Survey
Yuexin Ma, Tai Wang, Xuyang Bai, Huitong Yang, Yuenan Hou, Yaming Wang, Yu Qiao, Ruigang Yang, Dinesh Manocha, Xinge Zhu
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
[pdf] [code] GitHub stars GitHub forks

Predicting Gradient is Better: Exploring Self-Supervised Learning for SAR ATR with a Joint-Embedding Predictive Architecture
Weijie Li, Wei Yang, Tianpeng Liu, Yuenan Hou, Yuxuan Li, Zhen Liu, Yongxiang Liu, Li Liu
ISPRS Journal of Photogrammetry and Remote Sensing, 2024
[pdf]

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
Xiaopei Wu, Yuenan Hou*, Xiaoshui Huang*, Binbin Lin, Tong He, Xinge Zhu, Yuexin Ma, Boxi Wu, Haifeng Liu*, Deng Cai, Wanli Ouyang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[pdf]

WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language
Zhenxiang Lin, Xidong Peng, Peishan Cong, Yuenan Hou, Xinge Zhu, Sibei Yang, Yuexin Ma
European Conference on Computer Vision, 2024
[pdf]

Point Cloud Pre-training with Diffusion Models
Xiao Zheng, Xiaoshui Huang, Guofeng Mei, Yuenan Hou, Zhaoyang Lyu, Bo Dai, Wanli Ouyang, Yongshun Gong
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[pdf]

EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder
Xiaoshui Huang, Zhou Huang, Sheng Li, Wentao Qu, Tong He, Yuenan Hou, Yifan Zuo, Wanli Ouyang
AAAI Conference on Artificial Intelligence (AAAI, Oral), 2024
[pdf]

Semi-Supervised 3D Object Detection with PatchTeacher and PillarMix
Xiaopei Wu, Liang Peng, Liang Xie, Yuenan Hou, Binbin Lin, Xiaoshui Huang, Haifeng Liu, Deng Cai, Wanli Ouyang
AAAI Conference on Artificial Intelligence (AAAI), 2024
[pdf]

Advances in 3D Pre-Training and Downstream Tasks: A Survey
Yuenan Hou*, Xiaoshui Huang, Shixiang Tang, Tong He, Wanli Ouyang
Vicinagearth, 2024
[pdf]

CluB: Cluster Meets BEV for LiDAR-Based 3D Object Detection
Yingjie Wang, Jiajun Deng, Yuenan Hou, Yao Li, Yu Zhang, Jianmin Ji, Wanli Ouyang, Yanyong Zhang
Neural Information Processing Systems (NeurIPS), 2023
[pdf]

RangePerception: Taming LiDAR Range View for Efficient and Accurate 3D Object Detection
Yeqi Bai, Ben Fei, Youquan Liu, Tao Ma, Yuenan Hou, Botian Shi, Yikang Li
Neural Information Processing Systems (NeurIPS), 2023
[pdf]

Rethinking Range View Representation for LiDAR Segmentation
Lingdong Kong, Youquan Liu, Runnan Chen, Yuexin Ma, Xinge Zhu, Yikang Li, Yuenan Hou*, Yu Qiao, Ziwei Liu*
International Conference on Computer Vision (ICCV), 2023
[pdf]

UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase
Youquan Liu, Runnan Chen, Xin Li, Lingdong Kong, Yuchen Yang, Zhaoyang Xia, Yeqi Bai*, Yuexin Ma, Xinge Zhu, Yikang Li*, Yu Qiao, Yuenan Hou*
International Conference on Computer Vision (ICCV), 2023
[pdf] [code] GitHub stars GitHub forks

Human-centric Scene Understanding for 3D Large-scale Scenarios
Yiteng Xu, Peishan Cong, Yichen Yao, Runnan Chen, Yuenan Hou, Xinge Zhu, Xuming He, Jingyi Yu, Yuexin Ma
International Conference on Computer Vision (ICCV), 2023
[pdf]

See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data
Yuhang Lu, Qi Jiang, Runnan Chen, Yuenan Hou, Xinge Zhu, Yuexin Ma
International Conference on Computer Vision (ICCV), 2023
[pdf]

SCPNet: Semantic Scene Completion on Point Cloud
Zhaoyang Xia, Youquan Liu, Xin Li, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou*, Yu Qiao
IEEE Conference on Computer Vision and Pattern Recognition (CVPR, Highlight, top 10%), 2023
[pdf] [code] GitHub stars

CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP
Runnan Chen, Youquan Liu, Lingdong Kong, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou*, Yu Qiao, Wenping Wang*
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[pdf] [code] GitHub stars

LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion
Xin Li, Tao Ma, Yuenan Hou, Botian Shi, Yuchen Yang, Youquan Liu, Xingjiao Wu, Qin Chen, Yikang Li, Yu Qiao, Liang He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[pdf] [code] GitHub stars GitHub forks

Network Pruning via Resource Reallocation
Yuenan Hou, Zheng Ma, Chunxiao Liu, Zhe Wang, Chen Change Loy
Pattern Recognition, 2023
[pdf] [code]

Gradient Modulated Contrastive Distillation of Low-Rank Multi-Modal Knowledge for Disease Diagnosis
Xiaohan Xing, Zhen Chen, Yuenan Hou, Yixuan Yuan
Medical Image Analysis (MedIA), 2023
[pdf]

How to Tame Mobility in Federated Learning over Mobile Networks?
Yan Peng, Xiaogang Tang, Yiqing Zhou, Yuenan Hou, Jintao Li, Yanli Qi, Ling Liu, Hai Lin
IEEE Transactions on Wireless Communications (TWC), 2023
[pdf]

Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation
Yuenan Hou, Xinge Zhu, Yuexin Ma, Chen Change Loy, Yikang Li
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[pdf] [code] GitHub stars GitHub forks

Rank 1st on the Waymo 2022 3D Semantic Segmentation Challenge and SemanticKITTI LiDAR Semantic Segmentation Challenge (single-scan)!!!

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection
Xin Li, Botian Shi, Yuenan Hou, Xingjiao Wu, Tianlong Ma, Yikang Li, Liang He
European Conference on Computer Vision, 2022
[pdf] [code] GitHub stars

Mind the Gap in Distilling StyleGANs
Guodong Xu, Yuenan Hou, Ziwei Liu, Chen Change Loy
European Conference on Computer Vision, 2022
[pdf] [code] GitHub stars

STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes
Peishan Cong, Xinge Zhu, Feng Qiao, Yiming Ren, Xidong Peng, Yuenan Hou, Lan Xu, Ruigang Yang, Dinesh Manocha, Yuexin Ma
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[pdf] [code] GitHub stars

Discrepancy and Gradient-guided Multi-modal Knowledge Distillation for Pathological Glioma Grading
Xiaohan Xing, Zhen Chen, Meilu Zhu, Yuenan Hou, Zhifan Gao, Yixuan Yuan
International Conference on Medical Image Computing and Computer Assisted Intervension (MICCAI), 2022
[pdf] [code] GitHub stars

Categorical Relation-Preserving Contrastive Knowledge Distillation for Medical Image Classification
Xiaohan Xing, Yuenan Hou, Hang Li, Yixuan Yuan, Hongsheng Li, Max Q.-H. Meng
International Conference on Medical Image Computing and Computer Assisted Intervension (MICCAI), 2021
[pdf] [code] GitHub stars

Inter-Region Affinity Distillation for Road Marking Segmentation
Yuenan Hou, Zheng Ma, Chunxiao Liu, Tak-Wai Hui, Chen Change Loy
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
[pdf] [code] GitHub stars GitHub forks

Learning Lightweight Lane Detection CNNs by Self Attention Distillation
Yuenan Hou, Zheng Ma, Chunxiao Liu, Chen Change Loy
International Conference on Computer Vision (ICCV), 2019
[pdf] [code] GitHub stars GitHub forks

One of the most influential paper in the lane detection field!

Learning to Steer by Mimicking Features from Heterogeneous Auxiliary Networks
Yuenan Hou, Zheng Ma, Chunxiao Liu, Chen Change Loy
AAAI Conference on Artificial Intelligence (AAAI, Oral), 2019
[pdf] [project page] [code] GitHub stars GitHub forks

A Novel DDPG Method with Prioritized Experience Replay
Yuenan Hou, Lifeng Liu, Qing Wei, Xudong Xu, Chunlin Chen
IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2017
[pdf] [code] GitHub stars GitHub forks

Preprint

OccMamba: Semantic Occupancy Prediction with State Space Models
Heng Li, Yuenan Hou*, Xiaohan Xing, Xiao Sun, Yanyong Zhang*
arXiv preprint arXiv:2408.09859, 2024
[pdf]

SARATR-X: A Foundation Model for Synthetic Aperture Radar Images Target Recognition
Weijie Li, Wei Yang, Yuenan Hou, Li Liu, Yongxiang Liu, Xiang Li
arXiv preprint arXiv:2405.09365, 2024
[pdf]

NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Chenxi Huang, Yuenan Hou*, Weicai Ye, Di Huang, Xiaoshui Huang, Binbin Lin*, Deng Cai, Wanli Ouyang
arXiv preprint arXiv:2402.14464, 2024
[pdf] [code]

A Comprehensive Survey on 3D Content Generation
Jian Liu, Xiaoshui Huang, Tianyu Huang, Lu Chen, Yuenan Hou, Shixiang Tang, Ziwei Liu, Wanli Ouyang, Wangmeng Zuo, Junjun Jiang, Xianming Liu
arXiv preprint arXiv:2402.01166, 2024
[pdf] [code] GitHub stars GitHub forks

Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models
Dingning Liu, Xiaoshui Huang, Yuenan Hou, Zhihui Wang, Zhenfei Yin, Yongshun Gong, Peng Gao, Wanli Ouyang
arXiv preprint arXiv:2402.03327, 2024
[pdf]

Self-Supervised Learning for SAR ATR with a Knowledge-Guided Predictive Architecture
Weijie Li, Yang Wei, Tianpeng Liu, Yuenan Hou, Yongxiang Liu, Li Liu
arXiv preprint arXiv:2311.15153, 2023
[pdf]

Clothes-Invariant Feature Learning by Causal Intervention for Clothes-Changing Person Re-identification
Xulin Li, Yan Lu, Bin Liu, Yuenan Hou, Yating Liu, Qi Chu, Wanli Ouyang, Nenghai Yu
arXiv preprint arXiv:2305.06145, 2023
[pdf]

OpenPCSeg: An Open Source Point Cloud Segmentation Codebase
Youquan Liu, Yeqi Bai, Lingdong Kong, Runnan Chen, Yuenan Hou, Botian Shi, Yikang Li
Good performance on SemanticKITTI, ScribbleKITTI and Waymo.
[code] GitHub stars GitHub forks

ICPV: Deep Fusion of Different Point Cloud Representations
Hao Tian, Yuenan Hou, Huijie Wang, Youquan Liu, Jiawei Li, Xinge Zhu, Wenkang Qin, Junchao Gong, Yang Li, Kai Li
1st Place Solution for 3D Semantic Segmentation Track in Waymo Open Dataset Challenge, 2022
[pdf] [code] GitHub stars GitHub forks

Agnostic Lane Detection
Yuenan Hou
arXiv preprint arXiv:1905.03704, 2019
[pdf] [code] GitHub stars GitHub forks

Current Students
Previous Mentorship
Academic Services
  • Area chair of PRCV 2023

  • Conference reviewer of CVPR, ICCV, ECCV, ICLR, NeurIPS, AAAI, IJCAI, WACV

  • Journal reviewer of TPAMI, IJCV, TIP, Neurocomputing, T-IV, RA-L, TCSVT, IET, Advanced Robotics, TGRS, JSTARS

  • Invited talk at Zhidongxi Open Classes, "Advances and Challenges of 3D Foundation Models in the Autonomous Driving Field", 2023

  • Invited talk at School of Computer Science and Technology, East China Normal University, "Advances and Challenges of 3D Foundation Models in the Autonomous Driving Field", 2023

  • Invited talk at School of Information Science and Technology, ShanghaiTech University, "Recent Advances and Challenges of 3D Perception Tasks", 2022

  • Invited talk at School of Computer Science, Wuhan University, "Improving Deep Network Performance via Model Compression", 2021 [ppt pdf]

  • Invited talk at School of Cyber Science and Engineering, Wuhan University, "Improving Deep Network Performance via Model Compression", 2021

  • Invited talk at School of Artificial Intelligence, Sun Yat-sen University, "Improving Deep Network Performance via Model Compression", 2020

Honors and Awards
  • Rank 1st in SemanticKITTI LiDAR Semantic Segmentation Challenge (multi-scan), 1/50, Apr. 2023

  • Visiting Scholar, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Mar. 2023

  • Rank 1st in nuScenes LiDAR Semantic Segmentation Challenge, 1/25, the "UniSeg" entry, Oct. 2022

  • Rank 1st in Waymo 3D Object Detection Challenge, 1/100, the "LoGoNet_Ens" entry, Oct. 2022

  • Rank 1st in SemanticKITTI Semantic Scene Completion Challenge, 1/19, the "SCPNet" entry, Oct. 2022

  • Rank 1st in SemanticKITTI Panoptic Segmentation Challenge, 1/30, the "UniSeg" entry, Jul. 2022

  • Rank 1st in SemanticKITTI 4D Panoptic Segmentation Challenge, 1/7, the "UniSeg" entry, Jul. 2022

  • Rank 1st in SemanticKITTI LiDAR Semantic Segmentation Challenge (single-scan), 1/300, the "Point-Voxel-KD" entry, Jun. 2022

  • Rank 1st in Waymo 3D Semantic Segmentation Challenge, 1/20, the "Cylinder3D" entry, May 2022

  • Shanghai Leading Talent Program, Shanghai Government, Dec. 2021

  • Shanghai Specially-Invited Expert, Shanghai Government, Dec. 2021

  • Rank 2nd in SemanticKITTI LiDAR Semantic Segmentation Challenge (multi-scan), 2/100, the "PVKD" entry, Dec. 2021

  • Rank 1st in ApolloScape Lane Segmentation Challenge, Baidu, 1/104, the "Codes-for-IntRA-KD" entry, Sep. 2020

  • Postgraduate Scholarship, the Chinese University of Hong Kong, 2017 ~ 2021

  • Third Prize of Jiangsu Province for Undergraduate Thesis, Nanjing University, 2017 [paper pdf] [awards list]

  • Zhenggang Scholarship, Nanjing University, 2017

  • Mathematical Contest in Modeling (MCM), Meritorious Winner (13%), 2016 [paper pdf]

  • Liao's Scholarship, Nanjing University, 2015

  • National Scholarship, Nanjing University, 2014