Gaussian Splatting as SE(3) Equivariant Features for Imitation Learning

Research Intership @ SNURPM
Sanghyun Hahn, Taekyun Ha, Inhee Lee

In this project, we achieve one-shot imitation learning of robotic manipulations with Gaussian Splatting. The key idea is that the Gaussian Features obtained from Gaussian Splatting can be as features for Iterative Closest Points. Pointclouds have 3 information: (x,y,z). Gaussian Splats, in contrast, carry the position (x,y,z), the standard deviations, and the surface normal vectors: thats 3 times more information. Using this information, we can align different objects within the same category (which was impossible without the information from the Gaussians). For example if we have two different cups with one handle, ICP matching with Gaussians would align the handle to the handle. Our task is: We have cup A, and a demonstration of grasping that cup A. We want to know how to grasp a new cup B. Using the obtained SE(3) transform from Gaussian ICP, we transform the demonstration so if we playback the transformed demonstration, it exactly grasps the cup B where it grasped cup A. (by the handle) This project was initially aiming for a submission for ICRA 2025, but before doing the real-world experiment I found out that the Gaussian ICP was already a proposed method in the paper RGBD GS ICP SLAM of ECCV 2024. While that paper used G-ICP to find correspondences between the map, the idea to treat Gaussians as feature descriptors is the shared. Therefore, I left it as a project paper for one of the graduate classes. From this experience, I learned the importance of literature reviews to my heart. Check the pdf for more information.