Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru

Rodolfo Mercado-Gonzales, José Pereira-Noriega, Marco Sobrevilla, Arturo Oncevay

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

Linguistic corpus annotation is one of the most important phases for addressing Natural Language Processing (NLP) tasks, as these methods are deeply involved with corpus-based techniques. However, meta-data annotation is a highly laborious manual task. A supportive alternative requires the use of computational tools. They are likely to simplify some of these operations, while can be adjusted appropriately to the needs of particular language features at the same time. Therefore, this paper presents ChAnot, a web-based annotation tool developed for Peruvian indigenous and highly agglutinative languages, where Shipibo-Konibo was the case study. This new tool is able to support a diverse set of linguistic annotation tasks, such as morphological segmentation markup, POS-tag markup, among others. Also, it includes a suggestion engine based on historic and machine learning models, and a set of statistics about previous annotations.

Idioma originalInglés
Título de la publicación alojadaLREC 2018 - 11th International Conference on Language Resources and Evaluation
EditoresHitoshi Isahara, Bente Maegaard, Stelios Piperidis, Christopher Cieri, Thierry Declerck, Koiti Hasida, Helene Mazo, Khalid Choukri, Sara Goggi, Joseph Mariani, Asuncion Moreno, Nicoletta Calzolari, Jan Odijk, Takenobu Tokunaga
EditorialEuropean Language Resources Association (ELRA)
Páginas4150-4154
Número de páginas5
ISBN (versión digital)9791095546009
EstadoPublicada - 2019
Evento11th International Conference on Language Resources and Evaluation, LREC 2018 - Miyazaki, Japón
Duración: 7 may. 201812 may. 2018

Serie de la publicación

NombreLREC 2018 - 11th International Conference on Language Resources and Evaluation

Conferencia

Conferencia11th International Conference on Language Resources and Evaluation, LREC 2018
País/TerritorioJapón
CiudadMiyazaki
Período7/05/1812/05/18

Huella

Profundice en los temas de investigación de 'Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru'. En conjunto forman una huella única.

Citar esto