GR00T (Project GR00T)

First open humanoid VLA. GR00T N1 opened the era of open foundation models for humanoid robots.
Practical Dual-System architecture. A representative example of human cognition-inspired System 1/2 separation applied to real robot VLA.
Demonstrated synthetic data potential. Showed the viability of a training pipeline that reduces teleop dependency through Isaac Sim physics simulation + Neural Trajectory generation.

Overview

GR00T N1 Architecture

GR00T N1 Architecture: System 2 (VLM) + System 1 (DiT) Dual-System Structure

Inspired by human cognition principles (Kahneman, 2011):

System	Role	Implementation
System 2 (Slow)	Environment understanding, planning	Vision-Language Model
System 1 (Fast)	Convert plans to precise motions	Diffusion Transformer