Biography

I am a Member of Technical Staff at Amazon FAR (Frontier AI & Robotics).

I received my Master’s degree from the Robotics Institute at Carnegie Mellon University advised by Guanya Shi, and Bachelor’s degree from Shanghai Jiao Tong University advised by Hesheng Wang and Zhongqiang Ren. I also interned at Tsinghua University (IIIS) advised by Huazhe Xu. During my undergraduate career, I led the programming group at SJTU VEX Robotics Club.

Current Focus: Scalable learning systems for robots to perceive, reason, and act reliably in the real world.

Robots: I have worked on many types of robots, including humanoids, mobile manipulators, and aerial robots.

News

Experience

Applied Scientist Intern May 2025 - Feb. 2026 Amazon Frontier AI & Robotics Team Advisor: Prof. Pieter Abbeel and Dr. Rocky Duan
Research Assistant Oct. 2024 - May 2026 LeCAR Lab, CMU RI Advisor: Prof. Guanya Shi
Research Assistant Jan. 2024 - Jul. 2024 Tsinghua Embodied AI Lab (TEA Lab) Advisor: Prof. Huazhe Xu

Education

Carnegie Mellon University Aug. 2024 - May 2026 M.S. in Robotic Systems Development
Shanghai Jiao Tong University Sep. 2019 - Jun. 2023 B.Eng. in Automation

Research

(* denotes equal contributions, indicates equal advising)

Humanoids

RPL: Learning Robust Humanoid Perceptive Locomotion over Challenging Terrains
Yuanhang Zhang, Younggyo Seo, Juyue Chen, Yifu Yuan, Koushil Sreenath, Pieter Abbeel, Carmelo Sferrazza, Karen Liu, Rocky Duan, Guanya Shi

In Submission

▶ TL;DR
RPL enables robust humanoid perceptive locomotion through a unified multi-depth policy that handles challenging terrains (slopes, stairs, stepping stones), multi-directional movements, payloads.
RPL: Learning Robust Humanoid Perceptive Locomotion over Challenging Terrains

Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching
Zhen Wu*, Xiaoyu Huang*, Lujie Yang*, Yuanhang Zhang, Koushil Sreenath, Xi Chen, Pieter Abbeel, Rocky Duan, Angjoo Kanazawa, Carmelo Sferrazza, Guanya Shi, Karen Liu

RSS 2026

▶ TL;DR
PHP enables agile and long-horizon humanoid parkour with depth perception and motion matching for chaining skills.
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

FALCON: Learning Force-Adaptive Humanoid Loco-Manipulation
Yuanhang Zhang, Yifu Yuan, Prajwal Gurunath, Ishita Gupta, Shayegan Omidshafiei, Ali-akbar Agha-mohammadi, Marcell Vazquez-Chanlatte, Liam Pedersen, Tairan He, Guanya Shi

L4DC 2026 (Oral)

▶ TL;DR
FALCON empowers humanoids with robust locomotion and precise manipulation under significant, unknown 3D end-effector forces, via a novel dual-agent force-adaptive RL framework.
FALCON: Learning Force-Adaptive Humanoid Loco-Manipulation

Hold My Beer: Learning Gentle Humanoid Locomotion and End-Effector Stabilization Control
Yitang Li, Yuanhang Zhang, Wenli Xiao, Chaoyi Pan, Haoyang Weng, Guanqi He, Tairan He, Guanya Shi

CoRL 2025

▶ TL;DR
A slow-fast dual-agent RL framework achieving human-level end-effector stability for humanoids.
Hold My Beer

ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills
Tairan He*, Jiawei Gao*, Wenli Xiao*, Yuanhang Zhang*, Zi Wang, Jiashun Wang, Zhengyi Luo, Guanqi He, Nikhil Sobanbab, Chaoyi Pan, Zeji Yi, Guannan Qu, Kris Kitani, Jessica Hodgins, Linxi "Jim" Fan, Yuke Zhu, Changliu Liu, Guanya Shi

RSS 2025

▶ TL;DR
ASAP learns agile whole-body humanoid motions via learning a residual action model from the real world to align sim-to-real physics and achieve athletic motions.
ASAP

Mobile Manipulation

Catch It! Learning to Catch in Flight with Mobile Dexterous Hands
Yuanhang Zhang*, Tianhai Liang*, Zhenyang Chen, Yanjie Ze, Huazhe Xu

ICRA 2025CoRL 2024 Workshop (Best Paper Nomination)

▶ TL;DR
We build a mobile manipulator with a dexterous hand, and leverage reinforcement learning to train a whole-body control policy for the robot to catch diverse objects randomly thrown by humans.
Catch It! Learning to Catch in Flight with Mobile Dexterous Hands

ViTaS: Visual Tactile Soft Fusion Contrastive Learning for Reinforcement Learning
Yufeng Tian*, Shuiqi Cheng*, Tianming Wei, Tianxing Zhou, Yuanhang Zhang, Zixian Liu,
Zhecheng Yuan, Huazhe Xu

ICRA 2026

▶ TL;DR
Introduced ViTaS, a framework using using Soft Fusion Contrastive Learning and a CVAE module to fuse visual and tactile information, achieving state-of-the-art performance with high sample efficiency and low parameter count in robotic manipulation.
ViTaS: Visual Tactile Soft Fusion Contrastive Learning for Reinforcement Learning

Aerial Manipulation / Drones

Flying Hand: End-Effector-Centric Framework for Versatile Aerial Manipulation Teleoperation and Policy Learning
Guanqi He*, Xiaofeng Guo*, Luyi Tang, Yuanhang Zhang, Mohammadreza Mousaei, Jiahe Xu, Junyi Geng, Sebastian Scherer, Guanya Shi

RSS 2025

▶ TL;DR
A unified aerial manipulation framework that enhances precision and versatility through an end-effector-centric interface.
Flying Hand

Perception-constrained Visual Servoing Based NMPC for Quadrotor Flight

Undergraduate Thesis

▶ TL;DR
Incorporated quadrotor dynamics and visual feature dynamics into NMPC to enable the quadrotor to flight purely based on visual information without localization.
Perception-constrained Visual Servoing Based NMPC for Quadrotor Flight

Multi-Agent Path Planning

Multi-Agent Combinatorial Path Finding with Heterogeneous Task Duration
Yuanhang Zhang, Xuemian Wu, Hesheng Wang, Zhongqiang Ren

SoCS 2024

▶ TL;DR
Proposed two conflict-based search methods--CBSS-TPG (conflict-free) and CBSS-D (conflict-free and minimal cost) to solve the multi-agent combinatorial path finding problem with target duration.
Multi-Agent Combinatorial Path Finding with Heterogeneous Task Duration


Competitions

UAV Intelligent Perception Technology Competition Autonomous drone racing and perception
Shanghai, China
2022.09 - 2022.11
Team Lead National Third Prize (Top 10%)
UAV racing demo 1 UAV racing demo 2
National University IOT Design Competition HarClass intelligent classroom system
Shanghai, China
2022.06 - 2022.09
Team Lead National First Prize & Harmony Innovation Award (Top 1%) Video
HarClass interface HarClass system
National University ICT Competition (Innovation) Edge intelligence and autonomous vehicle deployment
Shanghai, China
2021.09 - 2022.01
Team Lead National Second Prize (Top 5%) Video
ICT competition project

Leadership

SJTU-VEX Programming Team Leader
Shanghai, China
2020.03 - 2022.07
Season Reveal Video Tournament Champions Skills Champion Excellence Award
  • 2021 National VEX Robotics Competition: Tournament Champions & Skills Champion (World Record)
  • 2021 VEX Robotics Competition Asian Open: Tournament Champions VEXU; Excellence Award
  • 2021 VEX Robotics Competition China Final: Tournament Champions VEXU; Excellence Award
SJTU VEX team photo SJTU VEX robot SJTU VEX competition

Awards

  • Outstanding Paper Nomination in LFDM Workshop at CoRL 2024
  • SJTU Outstanding Graduate (top 3%)
  • SJTU Merit Student (top 3%)
  • SJTU Academic Progress Scholarship

Reviewer Service

IROS 2026 IEEE/RSJ International Conference on Intelligent Robots and Systems
CoRL 2025 Conference on Robot Learning
ICRA 2025 IEEE International Conference on Robotics and Automation
IROS 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems