NEWS

New Apple Study Teaches Robots How to Act by Watching First-Person Videos of Humans

05/22/2025

661

New Apple Study Teaches Robots How to Act by Watching First-Person Videos of Humans

 

In a new paper called “Humanoid Policy ∼ Human Policy,” Apple researchers propose an interesting way to train humanoid robots. And it involves wearing an Apple Vision Pro.

 

Robot see, robot do

 

The project is a collaboration between Apple, MIT, Carnegie Mellon, the University of Washington, and UC San Diego. It explores how first-person footage of people manipulating objects can be used to train general-purpose robot models.

 

In total, the researchers gathered over 25,000 human demonstrations and 1,500 robot demonstrations (a dataset they called PH2D), and fed them into a unified AI policy that could then control a real humanoid robot in the physical world.

 

As the authors explain:

 

  • Training manipulation policies for humanoid robots with diverse data enhances their robustness and generalization across tasks and platforms. However, learning solely from robot demonstrations is labor-intensive and requires expensive teleoperated data collection, which is difficult to scale.
  • This paper investigates a more scalable data source, egocentric human demonstrations, to serve as cross-embodiment training data for robot learning.

 

Their solution? Let humans show the way.

 

Cheaper, faster training

 

To collect the training data, the team developed an Apple Vision Pro app that captures video from the device’s bottom-left camera, and uses Apple’s ARKit to track 3D head and hand motion.

 

However, to explore a more affordable solution, they also 3D-printed a mount to attach a ZED Mini Stereo camera to other headsets, like the Meta Quest 3, offering similar 3D motion tracking at a lower cost.

 

New Apple Study Teaches Robots How to Act by Watching First-Person Videos of Humans

 

The result was a setup that let them record high-quality demonstrations in seconds, a pretty big improvement over traditional robot tele-op methods, which are slower, more expensive, and harder to scale.

 

And here’s one last interesting detail: since people move way faster than robots, the researchers slowed down the human demos by a factor of four during training, just enough for the robot to keep up without needing further adjustments.

 

The Human Action Transformer (HAT)

 

The key to the whole study is the HAT, a model trained on both human and robot demonstrations in a shared format.

 

Instead of splitting the data by source (humans v. robots), HAT learns a single policy that generalizes across both types of bodies, making the system more flexible and data-efficient.

 

In some tests, this shared training approach helped the robot handle more challenging tasks, including ones that it hadn’t seen before, when compared to more traditional methods.

 

New Apple Study Teaches Robots How to Act by Watching First-Person Videos of Humans

 

Overall, the study is pretty interesting and worth checking out if you are into robotics.

 

Is the idea of a house humanoid robot scary, exciting, or pointless to you? Let us know in the comments.

 

Source: 9to5mac

Windows
Mac OS
iOS
Linux
3uTools
Win 64-bit For this device
V9.0 2025-11-11
Download
Win 32-bit For this device
V9.0 2025-11-11
Download
3uTools V3.18
2025-09-18
Please use the 3uTools PC client to install the iOS client:
1、 Install either the Windows or Mac version of 3uTools on your computer
2、 Open the PC client and connect your device to the computer via USB cable
3、 After the connection is successful, wait for the computer to automatically install the mobile app for the device, or locate “Install Mobile App” on the computer and manually click to install.
3uTools
deb file
v3.01 2025-11-20
Download
rpm file
v3.01 2025-11-20
Download
Windows
iOS
Android
3uAirPlayer
Win 64-bit For this device
V6.0.2 2025-11-19
Download
Win 32-bit For this device
V6.0.2 2025-11-19
Download
iOS Device Mirroring (No App Required)
1、 Install 3uAirPlayer on the Windows PC
2、 Open Control Center and select Screen Mirroring
3、 From the list, choose your PC to start mirroring
4、 Or connect your iOS device to the PC via USB to begin mirroring
Scan to get "3uAirPlayer" App