TY - GEN
T1 - Back-translation as strategy to tackle the lack of corpus in natural language generation from semantic representations
AU - Cabezudo, Marco Antonio Sobrevilla
AU - Mille, Simon
AU - Pardo, Thiago Alexandre Salgueiro
N1 - Publisher Copyright:
© 2019 Association for Computational Linguistics.
PY - 2019
Y1 - 2019
N2 - This paper presents an exploratory study that aims to evaluate the usefulness of back-translation in Natural Language Generation (NLG) from semantic representations for non-English languages. Specifically, Abstract Meaning Representation and Brazilian Portuguese (BP) are chosen as semantic representation and language, respectively. Two methods (focused on Statistical and Neural Machine Translation) are evaluated on two datasets (one automatically generated and another one human-generated) to compare the performance in a real context. Also, several cuts according to quality measures are performed to evaluate the importance (or not) of the data quality in NLG. Results show that there are still many improvements to be made but this is a promising approach.
AB - This paper presents an exploratory study that aims to evaluate the usefulness of back-translation in Natural Language Generation (NLG) from semantic representations for non-English languages. Specifically, Abstract Meaning Representation and Brazilian Portuguese (BP) are chosen as semantic representation and language, respectively. Two methods (focused on Statistical and Neural Machine Translation) are evaluated on two datasets (one automatically generated and another one human-generated) to compare the performance in a real context. Also, several cuts according to quality measures are performed to evaluate the importance (or not) of the data quality in NLG. Results show that there are still many improvements to be made but this is a promising approach.
UR - http://www.scopus.com/inward/record.url?scp=85097971963&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85097971963
T3 - MSR@EMNLP-IJCNLP 2019 - 2nd Workshop on Multilingual Surface Realisation, Proceedings
SP - 94
EP - 103
BT - MSR@EMNLP-IJCNLP 2019 - 2nd Workshop on Multilingual Surface Realisation, Proceedings
PB - Association for Computational Linguistics (ACL)
T2 - 2nd Workshop on Multilingual Surface Realisation, MSR@EMNLP-IJCNLP 2019
Y2 - 3 November 2019
ER -