Timezone: »
Deep generative models have become increasingly effective at producing realistic images from randomly sampled seeds, but using such models for controllable manipulation of existing images remains challenging. We propose the Swapping Autoencoder, a deep model designed specifically for image manipulation, rather than random sampling. The key idea is to encode an image into two independent components and enforce that any swapped combination maps to a realistic image. In particular, we encourage the components to represent structure and texture, by enforcing one component to encode co-occurrent patch statistics across different parts of the image. As our method is trained with an encoder, finding the latent codes for a new input image becomes trivial, rather than cumbersome. As a result, our method enables us to manipulate real input images in various ways, including texture swapping, local and global editing, and latent code vector arithmetic. Experiments on multiple datasets show that our model produces better results and is substantially more efficient compared to recent generative models.
Author Information
Taesung Park (UC Berkeley)
Jun-Yan Zhu (Adobe, CMU)
Oliver Wang (Adobe Research)
Jingwan Lu (Adobe Research)
Jingwan joined Adobe Research in August 2014. Her current research interests include deep-learning based image editing and generation, sketch-based search, creative applications for AR and VR, data-driven visual content creation, computational photography and other vision and machine learning topics. Jingwan received her Ph.D. in computer science from Princeton University. Her PhD work focused on designing algorithms and interfaces for data-driven painting applications. During her PhD, she was awarded Google Research Fellowship from 2012 to 2014 and Siebel Scholarship from 2013 to 2014.
Eli Shechtman (Adobe Research, US)
Alexei Efros (UC Berkeley)
Richard Zhang (Adobe)
Richard Zhang is a Research Scientist at Adobe Research, with interests in computer vision, deep learning, machine learning, and graphics. He obtained his PhD in EECS, advised by Professor Alexei A. Efros, at UC Berkeley in 2018. He graduated summa cum laude with BS and MEng degrees from Cornell University in ECE. He is a recipient of the 2017 Adobe Research Fellowship. More information can be found on his webpage: http://richzhang.github.io/.
More from the Same Authors
-
2022 : Videogenic: Video Highlights via Photogenic Moments »
David Chuan-En Lin · Fabian Caba · Joon-Young Lee · Oliver Wang · Nikolas Martelaro -
2022 : VideoMap: Video Editing in Latent Space »
David Chuan-En Lin · Fabian Caba · Joon-Young Lee · Oliver Wang · Nikolas Martelaro -
2022 : Studying Bias in GANs through the Lens of Race »
Vongani Maluleke · Neerja Thakkar · Tim Brooks · Ethan Weber · Trevor Darrell · Alexei Efros · Angjoo Kanazawa · Devin Guillory -
2022 Poster: Test-Time Training with Masked Autoencoders »
Yossi Gandelsman · Yu Sun · Xinlei Chen · Alexei Efros -
2022 Poster: Visual Prompting via Image Inpainting »
Amir Bar · Yossi Gandelsman · Trevor Darrell · Amir Globerson · Alexei Efros -
2022 Poster: Generating Long Videos of Dynamic Scenes »
Tim Brooks · Janne Hellsten · Miika Aittala · Ting-Chun Wang · Timo Aila · Jaakko Lehtinen · Ming-Yu Liu · Alexei Efros · Tero Karras -
2021 : KDSalBox: A toolbox of efficient knowledge-distilled saliency models »
Ard Kastrati · Zoya Bylinskii · Eli Shechtman -
2021 Poster: MarioNette: Self-Supervised Sprite Learning »
Dmitriy Smirnov · MICHAEL GHARBI · Matthew Fisher · Vitor Guizilini · Alexei Efros · Justin Solomon -
2020 : Panel Discussion & Closing »
Yejin Choi · Alexei Efros · Chelsea Finn · Kristen Grauman · Quoc V Le · Yann LeCun · Ruslan Salakhutdinov · Eric Xing -
2020 : QA: Alexei Efros »
Alexei Efros -
2020 : Invited Talk: Alexei Efros »
Alexei Efros -
2020 Poster: Few-shot Image Generation with Elastic Weight Consolidation »
Yijun Li · Richard Zhang · Jingwan (Cynthia) Lu · Eli Shechtman -
2020 Poster: Space-Time Correspondence as a Contrastive Random Walk »
Allan Jabri · Andrew Owens · Alexei Efros -
2020 Oral: Space-Time Correspondence as a Contrastive Random Walk »
Allan Jabri · Andrew Owens · Alexei Efros -
2019 : Poster Presentations »
Rahul Mehta · Andrew Lampinen · Binghong Chen · Sergio Pascual-Diaz · Jordi Grau-Moya · Aldo Faisal · Jonathan Tompson · Yiren Lu · Khimya Khetarpal · Martin Klissarov · Pierre-Luc Bacon · Doina Precup · Thanard Kurutach · Aviv Tamar · Pieter Abbeel · Jinke He · Maximilian Igl · Shimon Whiteson · Wendelin Boehmer · Raphaël Marinier · Olivier Pietquin · Karol Hausman · Sergey Levine · Chelsea Finn · Tianhe Yu · Lisa Lee · Benjamin Eysenbach · Emilio Parisotto · Eric Xing · Ruslan Salakhutdinov · Hongyu Ren · Anima Anandkumar · Deepak Pathak · Christopher Lu · Trevor Darrell · Alexei Efros · Phillip Isola · Feng Liu · Bo Han · Gang Niu · Masashi Sugiyama · Saurabh Kumar · Janith Petangoda · Johan Ferret · James McClelland · Kara Liu · Animesh Garg · Robert Lange -
2019 : Oral Presentations »
Janith Petangoda · Sergio Pascual-Diaz · Jordi Grau-Moya · Raphaël Marinier · Olivier Pietquin · Alexei Efros · Phillip Isola · Trevor Darrell · Christopher Lu · Deepak Pathak · Johan Ferret -
2019 Poster: Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity »
Deepak Pathak · Christopher Lu · Trevor Darrell · Phillip Isola · Alexei Efros -
2019 Spotlight: Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity »
Deepak Pathak · Christopher Lu · Trevor Darrell · Phillip Isola · Alexei Efros -
2018 Poster: Self-Supervised Generation of Spatial Audio for 360° Video »
Pedro Morgado · Nuno Nvasconcelos · Timothy Langlois · Oliver Wang -
2017 : How to stop worrying and learn to love Nearest Neighbors »
Alexei Efros -
2017 Poster: Toward Multimodal Image-to-Image Translation »
Jun-Yan Zhu · Richard Zhang · Deepak Pathak · Trevor Darrell · Alexei Efros · Oliver Wang · Eli Shechtman -
2016 : What makes ImageNet good for Transfer Learning? »
Jacob MY Huh · Pulkit Agrawal · Alexei Efros