Haomeng ZHANG 张皓萌

I am a second-year Computer Science Ph.D. student at Purdue University, advised by Prof. Raymond A. Yeh.

I hold a Master of Science in Computer Science degree from University of Illinois at Urbana-Champaign (UIUC), where I worked with Mr. Yunze Man and Prof. Liangyan Gui on 3D Question Answering and Trajectory Prediction.

I received my B.S.E. degree in Data Science at University of Michigan (UMich) with a minor in Mathematics. I was fortunate to work with Dr. Junming Zhang and Prof. Matthew Johnson-Roberson on 3D point cloud recognition at the UM Ford Center for Autonomous Vehicles (FCAV).

I also hold a B.E. degree in Electrical and Computer Engineering at Shanghai Jiao Tong University (SJTU) (Dual Degree Program).

Email  /  CV  /  Google Scholar  /  Github  /  Twitter

profile photo
Research

I am broadly interested in Computer Vision, Machine Learning and Robotics, with a focus on 3D Vision and Multi-modal Learning. I currently work on problems related to affordance. Previously, I have also worked on the following fields: (i) 3D visual grounding, (ii) 3D visual question answering, (iii) point cloud completion, (iv) pedestrian trajectory prediction. My research goal is to develop computer vision techniques for real-world embodied agents and autonomous systems, enhancing their capability to perceive and reason effectively in uncertain environments.

dlisa Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention
Haomeng Zhang, Chiao-An Yang, Raymond A. Yeh
NeurIPS 2024
arXiv / Project Page / Code

Our proposed model D-LISA has a novel vision module that allows for a dynamic number of proposal boxes and extracts features from dynamic viewpoints per scene. Furthermore, we propose a fusion module that is spatially aware with explicit language conditioning.


hyperpc Hyperspherical Embedding for Point Cloud Completion
Junming Zhang, Haomeng Zhang, Ram Vasudevan, Matthew Johnson-Roberson
CVPR 2023
arXiv / Project Page / Code

We propose a hyperspherical module which could be inserted into any existing Encoder-Decoder structure and consistently improve the point cloud completion result in both single-task and multi-task learning.

Teaching

Teaching Assistant of CS 47100 Introduction to Artificial Intelligence, Purdue, Fall 2023, Spring 2024, Fall 2024.
Teaching Assistant of CS 444 Deep Learning for Computer Vision, UIUC, Spring 2023.
Teaching Assistant of CS 441 Applied Machine Learning, UIUC, Fall 2022, Spring 2022, Fall 2021.
Teaching Assistant of VP 140 Physics I, SJTU, Summer 2019.

Service

Conference Reviewer: NeurIPS, ICLR.

Awards & Honors

Top Reviewers, NeurIPS, 2024.
Outstanding Graduate, Shanghai Municipal Education Commission, 2021.
James B. Angell Scholar, University of Michigan, 2021.
National Scholarship (Top 1%), Ministry of Education of China, 2019.
Undergraduate Excellent Scholarship (Top 10%), Shanghai Jiao Tong University, 2018, 2019.

Updated at Sept. 2024. Thanks Jon Barron for this amazing template.