Skip to yearly menu bar Skip to main content


video
in
Workshop: Data Centric AI

How should human translation coexist with NMT? Efficient tool for building high quality parallel corpus


Abstract:

This paper proposes a tool for efficiently constructing high-quality parallel corpora with minimizing human labor and making this tool publicly available. Our proposed construction process is based on neural machine translation (NMT) to allow for it to not only coexist with human translation, but also improve its efficiency by combining data quality control with human translation in a data-centric approach.