Timezone: »
Conditional image synthesis aims to create an image according to some multi-modal guidance in the forms of textual descriptions, reference images, and image blocks to preserve, as well as their combinations. In this paper, instead of investigating these control signals separately, we propose a new two-stage architecture, UFC-BERT, to unify any number of multi-modal controls. In UFC-BERT, both the diverse control signals and the synthesized image are uniformly represented as a sequence of discrete tokens to be processed by Transformer. Different from existing two-stage autoregressive approaches such as DALL-E and VQGAN, UFC-BERT adopts non-autoregressive generation (NAR) at the second stage to enhance the holistic consistency of the synthesized image, to support preserving specified image blocks, and to improve the synthesis speed. Further, we design a progressive algorithm that iteratively improves the non-autoregressively generated image, with the help of two estimators developed for evaluating the compliance with the controls and evaluating the fidelity of the synthesized image, respectively. Extensive experiments on a newly collected large-scale clothing dataset M2C-Fashion and a facial dataset Multi-Modal CelebA-HQ verify that UFC-BERT can synthesize high-fidelity images that comply with flexible multi-modal controls.
Author Information
Zhu Zhang (Zhejiang University)
Jianxin Ma (Alibaba Group)
Chang Zhou (Alibaba Group)
Rui Men (Alibaba Group)
Zhikang Li (Alibaba Group)
Ming Ding (Tsinghua University)
Jie Tang (Tsinghua University)
Jingren Zhou (Alibaba Group)
Hongxia Yang (Alibaba Group)
More from the Same Authors
-
2021 : Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning »
Qinkai Zheng · Xu Zou · Yuxiao Dong · Yukuo Cen · Da Yin · Jiarong Xu · Yang Yang · Jie Tang -
2022 Poster: CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers »
Ming Ding · Wendi Zheng · Wenyi Hong · Jie Tang -
2021 : Invited talk 3 »
Jie Tang -
2021 Poster: Adaptive Diffusion in Graph Neural Networks »
Jialin Zhao · Yuxiao Dong · Ming Ding · Evgeny Kharlamov · Jie Tang -
2021 Poster: CogView: Mastering Text-to-Image Generation via Transformers »
Ming Ding · Zhuoyi Yang · Wenyi Hong · Wendi Zheng · Chang Zhou · Da Yin · Junyang Lin · Xu Zou · Zhou Shao · Hongxia Yang · Jie Tang -
2021 Poster: Low-Rank Subspaces in GANs »
Jiapeng Zhu · Ruili Feng · Yujun Shen · Deli Zhao · Zheng-Jun Zha · Jingren Zhou · Qifeng Chen -
2021 Poster: A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems »
Yi Ma · Xiaotian Hao · Jianye Hao · Jiawen Lu · Xing Liu · Tong Xialiang · Mingxuan Yuan · Zhigang Li · Jie Tang · Zhaopeng Meng -
2020 Poster: Graph Random Neural Networks for Semi-Supervised Learning on Graphs »
Wenzheng Feng · Jie Zhang · Yuxiao Dong · Yu Han · Huanbo Luan · Qian Xu · Qiang Yang · Evgeny Kharlamov · Jie Tang -
2020 Oral: Graph Random Neural Networks for Semi-Supervised Learning on Graphs »
Wenzheng Feng · Jie Zhang · Yuxiao Dong · Yu Han · Huanbo Luan · Qian Xu · Qiang Yang · Evgeny Kharlamov · Jie Tang -
2020 Poster: A Matrix Chernoff Bound for Markov Chains and Its Application to Co-occurrence Matrices »
Jiezhong Qiu · Chi Wang · Ben Liao · Richard Peng · Jie Tang -
2020 Poster: Counterfactual Prediction for Bundle Treatment »
Hao Zou · Peng Cui · Bo Li · Zheyan Shen · Jianxin Ma · Hongxia Yang · Yue He -
2020 Poster: Learning to Mutate with Hypergradient Guided Population »
Zhiqiang Tao · Yaliang Li · Bolin Ding · Ce Zhang · Jingren Zhou · Yun Fu -
2020 Poster: Counterfactual Contrastive Learning for Weakly-Supervised Vision-Language Grounding »
Zhu Zhang · Zhou Zhao · Zhijie Lin · jieming zhu · Xiuqiang He -
2020 Poster: CogLTX: Applying BERT to Long Texts »
Ming Ding · Chang Zhou · Hongxia Yang · Jie Tang -
2019 Poster: Learning Disentangled Representations for Recommendation »
Jianxin Ma · Chang Zhou · Peng Cui · Hongxia Yang · Wenwu Zhu -
2018 Poster: Bandit Learning with Implicit Feedback »
Yi Qi · Qingyun Wu · Hongning Wang · Jie Tang · Maosong Sun