Timezone: »
BERTs are incapable of processing long texts due to its quadratically increasing memory and time consumption. The straightforward thoughts to address this problem, such as slicing the text by a sliding window or simplifying transformers, suffer from insufficient long-range attentions or need customized CUDA kernels. The limited text length of BERT reminds us the limited capacity (5∼ 9 chunks) of the working memory of humans – then how do human beings Cognize Long TeXts? Founded on the cognitive theory stemming from Baddeley, our CogLTX framework identifies key sentences by training a judge model, concatenates them for reasoning and enables multi-step reasoning via rehearsal and decay. Since relevance annotations are usually unavailable, we propose to use treatment experiments to create supervision. As a general algorithm, CogLTX outperforms or gets comparable results to SOTA models on NewsQA, HotpotQA, multi-class and multi-label long-text classification tasks with memory overheads independent of the text length.
Author Information
Ming Ding (Tsinghua University)
Chang Zhou (Alibaba Group)
Hongxia Yang (Alibaba Group)
Jie Tang (Tsinghua University)
More from the Same Authors
-
2021 : Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning »
Qinkai Zheng · Xu Zou · Yuxiao Dong · Yukuo Cen · Da Yin · Jiarong Xu · Yang Yang · Jie Tang -
2022 Poster: CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers »
Ming Ding · Wendi Zheng · Wenyi Hong · Jie Tang -
2021 : Invited talk 3 »
Jie Tang -
2021 Poster: Adaptive Diffusion in Graph Neural Networks »
Jialin Zhao · Yuxiao Dong · Ming Ding · Evgeny Kharlamov · Jie Tang -
2021 Poster: CogView: Mastering Text-to-Image Generation via Transformers »
Ming Ding · Zhuoyi Yang · Wenyi Hong · Wendi Zheng · Chang Zhou · Da Yin · Junyang Lin · Xu Zou · Zhou Shao · Hongxia Yang · Jie Tang -
2021 Poster: UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis »
Zhu Zhang · Jianxin Ma · Chang Zhou · Rui Men · Zhikang Li · Ming Ding · Jie Tang · Jingren Zhou · Hongxia Yang -
2021 Poster: A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems »
Yi Ma · Xiaotian Hao · Jianye Hao · Jiawen Lu · Xing Liu · Tong Xialiang · Mingxuan Yuan · Zhigang Li · Jie Tang · Zhaopeng Meng -
2020 Poster: Graph Random Neural Networks for Semi-Supervised Learning on Graphs »
Wenzheng Feng · Jie Zhang · Yuxiao Dong · Yu Han · Huanbo Luan · Qian Xu · Qiang Yang · Evgeny Kharlamov · Jie Tang -
2020 Oral: Graph Random Neural Networks for Semi-Supervised Learning on Graphs »
Wenzheng Feng · Jie Zhang · Yuxiao Dong · Yu Han · Huanbo Luan · Qian Xu · Qiang Yang · Evgeny Kharlamov · Jie Tang -
2020 Poster: A Matrix Chernoff Bound for Markov Chains and Its Application to Co-occurrence Matrices »
Jiezhong Qiu · Chi Wang · Ben Liao · Richard Peng · Jie Tang -
2020 Poster: Counterfactual Prediction for Bundle Treatment »
Hao Zou · Peng Cui · Bo Li · Zheyan Shen · Jianxin Ma · Hongxia Yang · Yue He -
2019 Poster: Learning Disentangled Representations for Recommendation »
Jianxin Ma · Chang Zhou · Peng Cui · Hongxia Yang · Wenwu Zhu -
2018 Poster: Bandit Learning with Implicit Feedback »
Yi Qi · Qingyun Wu · Hongning Wang · Jie Tang · Maosong Sun