Timezone: »
Modeling the world can benefit robot learning by providing a rich training signal for shaping an agent's latent state space. However, learning world models in unconstrained environments over high-dimensional observation spaces such as images is challenging. One source of difficulty is the presence of irrelevant but hard-to-model background distractions, and unimportant visual details of task-relevant entities. We address this issue by learning a recurrent latent dynamics model which contrastively predicts the next observation. This simple model leads to surprisingly robust robotic control even with simultaneous camera, background, and color distractions. We outperform alternatives such as bisimulation methods which impose state-similarity measures derived from divergence in future reward or future optimal actions. We obtain state-of-the-art results on the Distracting Control Suite, a challenging benchmark for pixel-based robotic control.
Author Information
Nitish Srivastava (Apple Inc)
Walter Talbott (Apple)
Shuangfei Zhai (Apple)
Joshua Susskind (Apple Inc.)
I was an undergraduate in Cognitive Science at UCSD from 1995-2003 (with some breaks). Then I earned a PhD from UofT in machine learning and cognitive neuroscience, with Dr. Geoff Hinton and Dr. Adam Anderson. Following grad school I moved to UCSD for a post-doctoral position. Before coming to Apple I co-founded Emotient in 2012 and led the deep learning effort for facial expression and demographics recognition. Since joining Apple, I led the Face ID neural network team responsible for face recognition, and then started a machine learning research group within the hardware organization focused on fundamental ML technology.
More from the Same Authors
-
2022 : The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the \emph{Grokking Phenomenon} »
Vimal Thilak · Etai Littwin · Shuangfei Zhai · Omid Saremi · Roni Paiss · Joshua Susskind -
2022 Poster: GAUDI: A Neural Architect for Immersive 3D Scene Generation »
Miguel Angel Bautista · Pengsheng Guo · Samira Abnar · Walter Talbott · Alexander Toshev · Zhuoyuan Chen · Laurent Dinh · Shuangfei Zhai · Hanlin Goh · Daniel Ulbricht · Afshin Dehghan · Joshua Susskind -
2020 Poster: Collegial Ensembles »
Etai Littwin · Ben Myara · Sima Sabah · Joshua Susskind · Shuangfei Zhai · Oren Golan -
2020 Spotlight: Collegial Ensembles »
Etai Littwin · Ben Myara · Sima Sabah · Joshua Susskind · Shuangfei Zhai · Oren Golan -
2019 Poster: Adversarial Fisher Vectors for Unsupervised Representation Learning »
Shuangfei Zhai · Walter Talbott · Carlos Guestrin · Joshua Susskind -
2019 Spotlight: Adversarial Fisher Vectors for Unsupervised Representation Learning »
Shuangfei Zhai · Walter Talbott · Carlos Guestrin · Joshua Susskind -
2016 Poster: Doubly Convolutional Neural Networks »
Shuangfei Zhai · Yu Cheng · Weining Lu · Zhongfei (Mark) Zhang -
2014 Poster: Learning Generative Models with Visual Attention »
Yichuan Charlie Tang · Nitish Srivastava · Russ Salakhutdinov -
2014 Demonstration: Toronto Deep Learning »
Jamie Kiros · Russ Salakhutdinov · Nitish Srivastava · Yichuan Charlie Tang -
2014 Oral: Learning Generative Models with Visual Attention »
Yichuan Charlie Tang · Nitish Srivastava · Russ Salakhutdinov -
2013 Poster: Discriminative Transfer Learning with Tree-based Priors »
Nitish Srivastava · Russ Salakhutdinov -
2012 Poster: Multimodal Learning with Deep Boltzmann Machines »
Nitish Srivastava · Russ Salakhutdinov -
2012 Oral: Multimodal Learning with Deep Boltzmann Machines »
Nitish Srivastava · Russ Salakhutdinov