Back-translation as strategy to tackle the lack of corpus in natural language generation from semantic representations

Marco Antonio Sobrevilla Cabezudo, Simon Mille, Thiago Alexandre Salgueiro Pardo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

This paper presents an exploratory study that aims to evaluate the usefulness of back-translation in Natural Language Generation (NLG) from semantic representations for non-English languages. Specifically, Abstract Meaning Representation and Brazilian Portuguese (BP) are chosen as semantic representation and language, respectively. Two methods (focused on Statistical and Neural Machine Translation) are evaluated on two datasets (one automatically generated and another one human-generated) to compare the performance in a real context. Also, several cuts according to quality measures are performed to evaluate the importance (or not) of the data quality in NLG. Results show that there are still many improvements to be made but this is a promising approach.

Original languageEnglish
Title of host publicationMSR@EMNLP-IJCNLP 2019 - 2nd Workshop on Multilingual Surface Realisation, Proceedings
PublisherAssociation for Computational Linguistics (ACL)
Pages94-103
Number of pages10
ISBN (Electronic)9781950737765
StatePublished - 2019
Externally publishedYes
Event2nd Workshop on Multilingual Surface Realisation, MSR@EMNLP-IJCNLP 2019 - Hong Kong, China
Duration: 3 Nov 2019 → …

Publication series

NameMSR@EMNLP-IJCNLP 2019 - 2nd Workshop on Multilingual Surface Realisation, Proceedings

Conference

Conference2nd Workshop on Multilingual Surface Realisation, MSR@EMNLP-IJCNLP 2019
Country/TerritoryChina
CityHong Kong
Period3/11/19 → …

Fingerprint

Dive into the research topics of 'Back-translation as strategy to tackle the lack of corpus in natural language generation from semantic representations'. Together they form a unique fingerprint.

Cite this