Zihan Wang

Robotics Institute, Carnegie Mellon University

I'm a Master's student in Computer Vision (MSCV) at the Robotics Institute, School of Computer Science, Carnegie Mellon University, advised by Prof. Deva Ramanan. My research focuses on 4D reconstruction and its applications to robotics.

Prior to CMU, I obtained my B.Eng. with Honours in Electrical and Electronic Engineering from the University of Edinburgh, where I worked on counterfactual estimation with the Causality in Healthcare AI Hub. During my undergraduate years, I also spent time at AIRLab working with Prof. Sebastian Scherer and Prof. Chen Wang.

I build machine learning, computer vision, and robotics systems that close the loop between perception and action. If you're exploring robotics projects that need strong vision capabilities, feel free to drop me a line—I'm always open to collaboration.

Email  /  Google Scholar  /  GitHub  /  LinkedIn  /  X

Zihan Wang
Research

We live in a dynamic 4D world, constantly seeing, understanding, and interacting with objects and environments. My goal is to bridge the gap between reconstructed virtual worlds and practical robotic applications. I develop methods that help robots perceive, understand, and interact with objects and environments from sparse observations.

Selected Publications
CRISP: Contact-guided Real2Sim from Monocular Video with Planar Scene Primitives
Zihan Wang*, Jiashun Wang*, Jeff Tan, Yiwen Zhao, Jessica Hodgins, Shubham Tulsiani, Deva Ramanan*
In submission
paper (coming soon) / project page / code & data (coming soon)

A real-to-sim framework that turns monocular RGB video into whole-body control across diverse, complex terrains.

MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion
Zihan Wang, Jeff Tan, Tarasha Khurana, Neehar Peri, Deva Ramanan*
International Conference on Computer Vision (ICCV), 2025
paper / project page / code

The first approach that enables free-view synthesis for 4D dynamic scene reconstruction under sparse-view capture.

AirShot: Efficient Few-Shot Detection for Autonomous Exploration
Zihan Wang, Bowen Li, Chen Wang, Sebastian Scherer*
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
paper / project page / code

Class-agnostic relations for few-shot detection without fine-tuning, enabling fast and efficient field deployment.

HDMI: Learning Interactive Humanoid Whole-Body Control from Human Videos
Haoyang Weng, Yitang Li, Nikhil Sobanbabu, Zihan Wang, Zhengyi Luo, Tairan He, Deva Ramanan, Guanya Shi*
In submission
paper / project page / code

A general framework that learns whole-body humanoid-object interaction skills directly from monocular RGB videos.

Learning with Noisy Foundation Models Learning with Noisy Foundation Models
Hao Chen, Zihan Wang, Jindong Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
paper / project page

Characterizes noise in large-scale pre-training data and proposes strategies to mitigate its impact on downstream tasks.

Identity Preservation for Counterfactual Estimation Identity Preservation for Counterfactual Estimation
Zihan Wang
Bachelor Thesis, 2023

Cross-sectional counterfactual estimation conditioned on disentangled identity-preserving features.

ONLS ONLS: Optimal Noise Level Search in Diffusion Autoencoders Without Fine-Tuning
Zihan Wang
International Conference on Learning Representations (ICLR) Workshop, 2024

A simple, effective, fine-tuning-free strategy to pick the optimal diffusion depth for each sample.

PyPose PyPose: A Library for Robot Learning with Physics-based Optimization
PyPose Team
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022 & IEEE/RSJ IROS Workshop, 2023
paper / extension / project page / code / video

Bridges classic robotics and modern learning with a physics-aware optimization toolkit.

Projects
A Style-Preserved Motion Controller for Simulated Humanoid Characters
Robot Learning
report

Clusters motion data, trains latent variable models, and deploys a style-aware high-level controller for multi-task behaviors.

PyPose PyPose: Physics-based Optimization for Robot Learning
paper / extension / project page / code

Contributed physics-aware constrained optimization modules for trajectory planning and motion control.

MONAI MONAI: Medical Open Network for Artificial Intelligence
project page / code / video

Developed diffusion autoencoder components for medical imaging, with a focus on MRI generation and reconstruction.

Awards
  • Edinburgh Award (Research Experience), 2022
  • Edinburgh Scholarship (£4,000), 2022
  • Edinburgh Scholarship (£4,000), 2021
  • QIANJIANG ELECTRIC Scholarship (£3,000), 2020
Service

Academic Reviewer

  • International Conference on Learning Representations (ICLR)
  • European Conference on Computer Vision (ECCV)
  • IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
  • IEEE Transactions on Image Processing (TIP)
  • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
Advisors & Mentors
Deva Ramanan
Prof. Deva Ramanan @ CMU RI
Sebastian Scherer
Prof. Sebastian Scherer @ CMU RI
Chen Wang
Prof. Chen Wang @ University at Buffalo
Sotirios Tsaftaris
Prof. Sotirios Tsaftaris @ University of Edinburgh
Contact
  • Email: zihanwa3@cs.cmu.edu
  • Phone: +1-412-589-8046
  • Address: EDSH, Carnegie Mellon University, Pittsburgh, PA 15213, USA
Visitors

I am confident that my "boundaryless", "website-stalking" friends have contributed to at least 74.47% of the traffic here.


Template adapted from Qitao Zhao (credit to Yijia Weng & Jon Barron).
Last updated: February 2025