Towards a general abstract meaning representation corpus for Brazilian Portuguese

Marco Antonio Sobrevilla Cabezudo, Thiago Alexandre Salgueiro Pardo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

20 Scopus citations

Abstract

Meaning Representation (AMR) is a recent and prominent semantic representation with good acceptance and several applications in the Natural Language Processing area. For English, there is a large annotated corpus (with approximately 39K sentences) that supports the research with the representation. However, to the best of our knowledge, there is only one restricted corpus for Portuguese, which contains 1,527 sentences. In this context, this paper presents an effort to build a general purpose AMR-annotated corpus for Brazilian Portuguese by translating and adapting AMR English guidelines. Our results show that such approach is feasible, but there are some challenging phenomena to solve. More than this, efforts are necessary to increase the coverage of the corresponding lexical resource that supports the annotation.

Original languageEnglish
Title of host publicationLAW 2019 - 13th Linguistic Annotation Workshop, Proceedings of the Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages236-244
Number of pages9
ISBN (Electronic)9781950737383
StatePublished - 2019
Externally publishedYes
Event13th Linguistic Annotation Workshop, LAW 2019, held in conjunction with the Annual Meeting of the Association for Computational Linguistics, ACL 2019 - Florence, Italy
Duration: 1 Aug 2019 → …

Publication series

NameLAW 2019 - 13th Linguistic Annotation Workshop, Proceedings of the Workshop

Conference

Conference13th Linguistic Annotation Workshop, LAW 2019, held in conjunction with the Annual Meeting of the Association for Computational Linguistics, ACL 2019
Country/TerritoryItaly
CityFlorence
Period1/08/19 → …

Fingerprint

Dive into the research topics of 'Towards a general abstract meaning representation corpus for Brazilian Portuguese'. Together they form a unique fingerprint.

Cite this