TY - GEN
T1 - Natural language inference for portuguese using BERT and multilingual information
AU - Cabezudo, Marco Antonio Sobrevilla
AU - Inácio, Marcio
AU - Rodrigues, Ana Carolina
AU - Casanova, Edresson
AU - de Sousa, Rogério Figueredo
N1 - Publisher Copyright:
© Springer Nature Switzerland AG 2020.
PY - 2020
Y1 - 2020
N2 - Recognizing Textual Entailment, also known as inference recognition, aims to identify when the meaning of a piece of text contains the meaning of another fragment of text. In this work, we investigate multiples approaches for recognizing inference in the ASSIN dataset, an entailment recognition corpus for Portuguese. We also investigate the consequences of adding external data to improve training in two different forms: multilingual data and automatically translated corpus. Our results outperform, using the multilingual pre-trained BERT model, the current state-of-the-art for the ASSIN corpus. Finally, we show that using external data did not improve the performance of the model or the improvements are not significant.
AB - Recognizing Textual Entailment, also known as inference recognition, aims to identify when the meaning of a piece of text contains the meaning of another fragment of text. In this work, we investigate multiples approaches for recognizing inference in the ASSIN dataset, an entailment recognition corpus for Portuguese. We also investigate the consequences of adding external data to improve training in two different forms: multilingual data and automatically translated corpus. Our results outperform, using the multilingual pre-trained BERT model, the current state-of-the-art for the ASSIN corpus. Finally, we show that using external data did not improve the performance of the model or the improvements are not significant.
KW - BERT
KW - Cross-lingual training
KW - Multilingual training
KW - Natural Language Inference
UR - http://www.scopus.com/inward/record.url?scp=85081579496&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-41505-1_33
DO - 10.1007/978-3-030-41505-1_33
M3 - Conference contribution
AN - SCOPUS:85081579496
SN - 9783030415044
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 346
EP - 356
BT - Computational Processing of the Portuguese Language - 14th International Conference, PROPOR 2020, Proceedings
A2 - Quaresma, Paulo
A2 - Vieira, Renata
A2 - Gonçalves, Teresa
A2 - Aluísio, Sandra
A2 - Moniz, Helena
A2 - Batista, Fernando
PB - Springer
T2 - 14th International Conference on Computational Processing of the Portuguese Language, PROPOR 2020
Y2 - 2 March 2020 through 4 March 2020
ER -