Timezone: »
Slot-filling and intent detection are the backbone of conversational agents such as voice assistants, and are active areas of research. Even though state-of-the-art techniques on publicly available benchmarks show impressive performance, their ability to generalize to realistic scenarios is yet to be demonstrated. In this work, we present NATURE, a set of simple spoken-language-oriented transformations, applied to the evaluation set of datasets, to introduce human spoken language variations while preserving the semantics of an utterance. We apply NATURE to common slot-filling and intent detection benchmarks and demonstrate that simple perturbations from the standard evaluation set by NATURE can deteriorate model performance significantly. Through our experiments we demonstrate that when NATURE operators are applied to evaluation set of popular benchmarks the model accuracy can drop by up to 40%.
Author Information
David Alfonso-Hermelo (Huawei Technologies Ltd.)
David Alfonso-Hermelo is an associate researcher at the Huawei Noah's Ark Lab in Montreal. Prior to joining Huawei, he worked at the RALI lab of the University of Montreal as research agent. He has obtained 3 MSc degrees: in Natural Language Processing from the Sorbonne Nouvelle University, in Language Sciences from Grenoble III University and Applied Linguistics from the University of Havana. His current research interests are Natural Language Processing, Knowledge Distillation, semantics representation for neural models and user-computer communication.
Ahmad Rashid (Huawei Technologies)
Abbas Ghaddar (Huawei Noah's Ark Lab, Montreal Research Center, Canada)
Philippe Langlais (University of Montreal)
Mehdi Rezagholizadeh (Huawei Technologies)
More from the Same Authors
-
2021 : A Short Study on Compressing Decoder-Based Language Models »
Tianda Li · Yassir El Mesbahi · Ivan Kobyzev · Ahmad Rashid · Atif Mahmud · Nithin Anchuri · Habib Hajimolahoseini · Yang Liu · Mehdi Rezagholizadeh -
2021 : Kronecker Decomposition for GPT Compression »
Ali Edalati · Marzieh Tahaei · Ahmad Rashid · Vahid Partovi Nia · James J. Clark · Mehdi Rezaghoizadeh -
2022 : Attribute Controlled Dialogue Prompting »
Runcheng Liu · Ahmad Rashid · Ivan Kobyzev · Mehdi Rezaghoizadeh · Pascal Poupart -
2022 : Attribute Controlled Dialogue Prompting »
Runcheng Liu · Ahmad Rashid · Ivan Kobyzev · Mehdi Rezaghoizadeh · Pascal Poupart -
2022 Workshop: Second Workshop on Efficient Natural Language and Speech Processing (ENLSP-II) »
Mehdi Rezagholizadeh · Peyman Passban · Yue Dong · Lili Mou · Pascal Poupart · Ali Ghodsi · Qun Liu -
2022 Poster: A new dataset for multilingual keyphrase generation »
Frédéric Piedboeuf · Philippe Langlais