I build machine learning, computer vision, and robotics systems that close the loop between perception and action. If you're exploring robotics projects that need strong vision capabilities, feel free to drop me a line—I'm always open to collaboration.
We live in a dynamic 4D world, constantly seeing, understanding, and interacting with objects and environments. My goal is to bridge the gap between reconstructed virtual worlds and practical robotic applications. I develop methods that help robots perceive, understand, and interact with objects and environments from sparse observations.
The first approach that enables free-view synthesis for 4D dynamic scene reconstruction under sparse-view capture.
AirShot: Efficient Few-Shot Detection for Autonomous Exploration Zihan Wang,
Bowen Li,
Chen Wang,
Sebastian Scherer*
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
paper /
project page /
code
Class-agnostic relations for few-shot detection without fine-tuning, enabling fast and efficient field deployment.
A general framework that learns whole-body humanoid-object interaction skills directly from monocular RGB videos.
Learning with Noisy Foundation Models Hao Chen,
Zihan Wang,
Jindong Wang,
Ran Tao,
Hongxin Wei,
Xing Xie,
Masashi Sugiyama,
Bhiksha Raj
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
paper /
project page
Characterizes noise in large-scale pre-training data and proposes strategies to mitigate its impact on downstream tasks.
Identity Preservation for Counterfactual Estimation Zihan Wang
Bachelor Thesis, 2023
Cross-sectional counterfactual estimation conditioned on disentangled identity-preserving features.
ONLS: Optimal Noise Level Search in Diffusion Autoencoders Without Fine-Tuning Zihan Wang
International Conference on Learning Representations (ICLR) Workshop, 2024
A simple, effective, fine-tuning-free strategy to pick the optimal diffusion depth for each sample.
PyPose: A Library for Robot Learning with Physics-based Optimization PyPose Team
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022 & IEEE/RSJ IROS Workshop, 2023
paper /
extension /
project page /
code /
video
Bridges classic robotics and modern learning with a physics-aware optimization toolkit.
Projects
A Style-Preserved Motion Controller for Simulated Humanoid Characters
Robot Learning
report
Clusters motion data, trains latent variable models, and deploys a style-aware high-level controller for multi-task behaviors.