NeurIPS Poster H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation

Poster

H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation

Yanjie Ze · Yanjie Ze · Yuyao Liu · Ruizhe Shi · Jiaxin Qin · Zhecheng Yuan · Jiashun Wang · Huazhe Xu

Great Hall & Hall B1+B2 (level 1) #1311

[ Abstract ] [ Project Page ]

[ Paper] [ Poster] [ OpenReview]

Abstract: Human hands possess remarkable dexterity and have long served as a source of inspiration for robotic manipulation. In this work, we propose a human

H

$\textbf{H}$ and-

In

$\textbf{In}$ formed visual representation learning framework to solve difficult

Dex

$\textbf{Dex}$ terous manipulation tasks (

H-InDex

$\textbf{H-InDex}$ ) with reinforcement learning. Our framework consists of three stages:

(i)

$\textit{(i)}$ pre-training representations with 3D human hand pose estimation,

(ii)

$\textit{(ii)}$ offline adapting representations with self-supervised keypoint detection, and

(iii)

$\textit{(iii)}$ reinforcement learning with exponential moving average BatchNorm. The last two stages only modify

0.36

$0.36$ % parameters of the pre-trained representation in total, ensuring the knowledge from pre-training is maintained to the full extent. We empirically study

12

$\textbf{12}$ challenging dexterous manipulation tasks and find that

H-InDex

$\textbf{H-InDex}$ largely surpasses strong baseline methods and the recent visual foundation models for motor control. Code and videos are available at https://yanjieze.com/H-InDex .

Chat is not available.