Timezone: »
This paper takes a step towards temporal reasoning in a dynamically changing video, not in the pixel space that constitutes its frames, but in a latent space that describes the non-linear dynamics of the objects in its world. We introduce the Kalman variational auto-encoder, a framework for unsupervised learning of sequential data that disentangles two latent representations: an object's representation, coming from a recognition model, and a latent state describing its dynamics. As a result, the evolution of the world can be imagined and missing data imputed, both without the need to generate high dimensional frames at each time step. The model is trained end-to-end on videos of a variety of simulated physical systems, and outperforms competing methods in generative and missing data imputation tasks.
Author Information
Marco Fraccaro (Technical University of Denmark (DTU))
Simon Kamronn (Technical University of Denmark)
Deep learning and Bayesian statistical modelling of time series from a long-term intervention study.
Ulrich Paquet
Ole Winther (Technical University of Denmark)
Related Events (a corresponding poster, oral, or spotlight)
-
2017 Spotlight: A Disentangled Recognition and Nonlinear Dynamics Model for Unsupervised Learning »
Wed. Dec 6th 07:35 -- 07:40 PM Room Hall A
More from the Same Authors
-
2020 Meetup: MeetUp: Copenhagen, Denmark »
Ole Winther -
2021 : Hierarchical Few-Shot Generative Models »
Giorgio Giannone · Ole Winther -
2022 : Identifying endogenous peptide receptors by combining structure and transmembrane topology prediction »
Felix Teufel · Jan Christian Refsgaard · Christian Toft Madsen · Carsten Stahlhut · Mads Grønborg · Dennis Madsen · Ole Winther -
2022 : Few-Shot Diffusion Models »
Giorgio Giannone · Didrik Nielsen · Ole Winther -
2019 Poster: BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling »
Lars Maaløe · Marco Fraccaro · Valentin Liévin · Ole Winther -
2018 Poster: Recurrent Relational Networks »
Rasmus Berg Palm · Ulrich Paquet · Ole Winther -
2017 : Panel Session »
Neil Lawrence · Finale Doshi-Velez · Zoubin Ghahramani · Yann LeCun · Max Welling · Yee Whye Teh · Ole Winther -
2017 Poster: Hash Embeddings for Efficient Word Representations »
Dan Tito Svenstrup · Jonas Hansen · Ole Winther -
2016 Poster: Sequential Neural Models with Stochastic Layers »
Marco Fraccaro · Søren Kaae Sønderby · Ulrich Paquet · Ole Winther -
2016 Oral: Sequential Neural Models with Stochastic Layers »
Marco Fraccaro · Søren Kaae Sønderby · Ulrich Paquet · Ole Winther