Shuran Song
Home > People > Shuran Song
Profile
| Field | Details |
|---|---|
| Current Position | Assistant Professor, Stanford University |
| Previous | Assistant Professor, Columbia University (2019-2024) |
| PhD | Princeton University |
| Lab | Columbia Artificial Intelligence and Robotics Lab (CAIR) |
Key Contributions
- Diffusion Policy: Applied diffusion to robot action generation, new paradigm for VLA action generation
- 3D Perception: 3D perception research for robotics
- UMI (Universal Manipulation Interface): Universal data collection interface
- Columbia to Stanford Move: Strengthening robot learning research
Research Timeline
PhD & Postdoc (2013-2019)
Princeton -> Postdoc
| Year | Work | Impact |
|---|---|---|
| 2015 | 3D ShapeNets | Early 3D deep learning research |
| 2017 | Semantic Scene Completion | 3D scene understanding |
| 2018 | PhD Graduation |
Columbia University (2019-2024)
CAIR Lab Founding
| Year | Work | Impact |
|---|---|---|
| 2019 | Joined Columbia as Professor | Founded CAIR Lab |
| 2021 | Transporter Networks collaboration | Object rearrangement |
| 2023 | Diffusion Policy | Pioneering robot diffusion research |
| 2024 | UMI | Universal manipulation interface |
Stanford University (2024-present)
Moved to Stanford
| Year | Work | Impact |
|---|---|---|
| 2024 | Moved to Stanford as Professor | |
| 2024- | Continuing robot learning research |
Major Publications
Diffusion for Robotics
- Diffusion Policy (RSS 2023, IJRR 2024) - Pioneering robot diffusion research
- 3D Diffusion Policy (2024)
3D Perception
- 3D ShapeNets (CVPR 2015)
- Semantic Scene Completion (CVPR 2017)
- ScanNet (CVPR 2017)
Robot Manipulation
- UMI (Universal Manipulation Interface, 2024)
- Transporter Networks related research
Key Ideas
Diffusion Policy (2023)
Core: Model robot action generation as a denoising diffusion process
Noise -> ... -> Action sequence
(gradual denoising)
Advantages:
- Handling multimodal action distributions
- High training stability
- Suitable for high-dimensional action spaces
Impact:
- Influenced pi0 (flow matching), Octo (diffusion decoder), and others
- LeRobot default supported model
- New paradigm for robot action generation
UMI (Universal Manipulation Interface, 2024)
Core: Universal robot data collection interface
Features:
- Applicable to various robot platforms
- Low-cost data collection
- Standardized interface
Philosophy & Direction
Research Philosophy
“3D world understanding and robot manipulation are closely connected”
Research Direction Evolution
- 2013-2018: 3D deep learning, scene understanding
- 2019-2022: 3D perception for robotics
- 2023-present: Diffusion for robot learning, manipulation interfaces
Key Collaborations
- Cheng Chi: Diffusion Policy lead author, UMI collaboration
- Toyota Research Institute: Diffusion Policy collaboration
- MIT: Diffusion Policy collaboration
Awards & Recognition
- NSF CAREER Award
- Amazon Research Award
- Google Research Scholar