Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Open X-Embodiment was released on October 13, 2023 by the Open X-Embodiment Collaboration, an effort led from Google DeepMind that pulled together more than 290 authors across 21 institutions. The project tackled a structural problem in robot learning: every lab collected its own data on its own robot, so datasets could not be combined the way image and text corpora are. The collaboration standardized and pooled data from 22 different robot embodiments, covering 527 skills and over 160,000 distinct tasks, into a single open dataset.

On top of this dataset the team trained RT-X, high-capacity generalist policies adapted from the RT-1 and RT-2 architectures. The key empirical result was positive transfer across embodiments: a single model trained on the combined multi-robot data outperformed models trained on any one robot’s data alone, meaning experience gathered on one platform improved performance on others. This is the robotics analogue of the lesson from NLP and vision, where pretraining on diverse data beats narrow task-specific training.

Open X-Embodiment became core infrastructure for the field. Later open generalist models including Octo and OpenVLA were trained on it, and it gave robot learning a shared benchmark dataset comparable to ImageNet’s role in computer vision - a common substrate that many groups could build on rather than each starting from zero.

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Sources

Related