Timezone: »
Poster
MorphTE: Injecting Morphology in Tensorized Embeddings
Guobing Gan · Peng Zhang · Sunzhu Li · Xiuqing Lu · Benyou Wang
In the era of deep learning, word embeddings are essential when dealing with text tasks. However, storing and accessing these embeddings requires a large amount of space. This is not conducive to the deployment of these models on resource-limited devices. Combining the powerful compression capability of tensor products, we propose a word embedding compression method with morphological augmentation, Morphologically-enhanced Tensorized Embeddings (MorphTE). A word consists of one or more morphemes, the smallest units that bear meaning or have a grammatical function. MorphTE represents a word embedding as an entangled form of its morpheme vectors via the tensor product, which injects prior semantic and grammatical knowledge into the learning of embeddings. Furthermore, the dimensionality of the morpheme vector and the number of morphemes are much smaller than those of words, which greatly reduces the parameters of the word embeddings. We conduct experiments on tasks such as machine translation and question answering. Experimental results on four translation datasets of different languages show that MorphTE can compress word embedding parameters by about $20$ times without performance loss and significantly outperforms related embedding compression methods.
Author Information
Guobing Gan (Tianjin University)
Peng Zhang (Tianjin University)
Sunzhu Li (Tianjin University)
Xiuqing Lu (Tianjin University)
Benyou Wang (Universita' degli studi di Padova)
More from the Same Authors
-
2023 Poster: Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias »
Zhongwei Wan · Che Liu · Mi Zhang · Jie Fu · Benyou Wang · Sibo Cheng · Lei Ma · César Quilodrán-Casas · Rossella Arcucci -
2023 Poster: All In One: A Chinese Multi-Modal Dataset for Multi-Affection Detection in Conversations »
Yazhou Zhang · Yang Yu · Qing Guo · Benyou Wang · Dongming Zhao · Sagar Uprety · Dawei Song · Jing Qin · Qiuchi Li -
2022 Spotlight: Lightning Talks 6B-4 »
Junjie Chen · Chuanxia Zheng · JINLONG LI · Yu Shi · Shichao Kan · Yu Wang · Fermín Travi · Ninh Pham · Lei Chai · Guobing Gan · Tung-Long Vuong · Gonzalo Ruarte · Tao Liu · Li Niu · Jingjing Zou · Zequn Jie · Peng Zhang · Ming LI · Yixiong Liang · Guolin Ke · Jianfei Cai · Gaston Bujia · Sunzhu Li · Siyuan Zhou · Jingyang Lin · Xu Wang · Min Li · Zhuoming Chen · Qing Ling · Xiaolin Wei · Xiuqing Lu · Shuxin Zheng · Dinh Phung · Yigang Cen · Jianlou Si · Juan Esteban Kamienkowski · Jianxin Wang · Chen Qian · Lin Ma · Benyou Wang · Yingwei Pan · Tie-Yan Liu · Liqing Zhang · Zhihai He · Ting Yao · Tao Mei -
2022 Spotlight: MorphTE: Injecting Morphology in Tensorized Embeddings »
Guobing Gan · Peng Zhang · Sunzhu Li · Xiuqing Lu · Benyou Wang -
2021 Poster: Word2Fun: Modelling Words as Functions for Diachronic Word Representation »
Benyou Wang · Emanuele Di Buccio · Massimo Melucci -
2019 Poster: A Tensorized Transformer for Language Modeling »
Xindian Ma · Peng Zhang · Shuai Zhang · Nan Duan · Yuexian Hou · Ming Zhou · Dawei Song