Timezone: »
This paper addresses the problem of manipulating images using natural language description. Our task aims to semantically modify visual attributes of an object in an image according to the text describing the new visual appearance. Although existing methods synthesize images having new attributes, they do not fully preserve text-irrelevant contents of the original image. In this paper, we propose the text-adaptive generative adversarial network (TAGAN) to generate semantically manipulated images while preserving text-irrelevant contents. The key to our method is the text-adaptive discriminator that creates word level local discriminators according to input text to classify fine-grained attributes independently. With this discriminator, the generator learns to generate images where only regions that correspond to the given text is modified. Experimental results show that our method outperforms existing methods on CUB and Oxford-102 datasets, and our results were mostly preferred on a user study. Extensive analysis shows that our method is able to effectively disentangle visual attributes and produce pleasing outputs.
Author Information
Seonghyeon Nam (Yonsei University)
Yunji Kim (Yonsei University)
Seon Joo Kim (Yonsei University)
Related Events (a corresponding poster, oral, or spotlight)
-
2018 Poster: Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language »
Wed. Dec 5th 03:45 -- 05:45 PM Room Room 517 AB #126
More from the Same Authors
-
2022 Poster: Mutual Information Divergence: A Unified Metric for Multimodal Generative Models »
Jin-Hwa Kim · Yunji Kim · Jiyoung Lee · Kang Min Yoo · Sang-Woo Lee -
2022 Poster: VITA: Video Instance Segmentation via Object Token Association »
Miran Heo · Sukjun Hwang · Seoung Wug Oh · Joon-Young Lee · Seon Joo Kim -
2022 Poster: Dense Interspecies Face Embedding »
Sejong Yang · Subin Jeon · Seonghyeon Nam · Seon Joo Kim -
2022 Poster: ComMU: Dataset for Combinatorial Music Generation »
Hyun Lee · Taehyun Kim · Hyolim Kang · Minjoo Ki · Hyeonchan Hwang · kwanho park · Sharang Han · Seon Joo Kim -
2021 Poster: Video Instance Segmentation using Inter-Frame Communication Transformers »
Sukjun Hwang · Miran Heo · Seoung Wug Oh · Seon Joo Kim -
2019 Poster: Unsupervised Keypoint Learning for Guiding Class-Conditional Video Prediction »
Yunji Kim · Seonghyeon Nam · In Cho · Seon Joo Kim