Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo

Roberto Zariquiey, Claudia Alvarado, Ximena Echevarria, Luisa Gomez, Rosa Gonzales, Mariana Illescas, Sabina Oporto, Frederic Blum, Arturo Oncevay, Javier Vera

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

In this paper, we launch a new Universal Dependencies treebank for an endangered language from Amazonia: Kakataibo, a Panoan language spoken in Peru. We first discuss the collaborative methodology implemented, which proved effective to create a treebank in the context of a Computational Linguistic course for undergraduates. Then, we describe the general details of the treebank and the language-specific considerations implemented for the proposed annotation. We finally conduct some experiments on part-of-speech tagging and syntactic dependency parsing. We focus on monolingual and transfer learning settings, where we study the impact of a Shipibo-Konibo treebank, another Panoan language resource.

Idioma originalInglés
Título de la publicación alojada2022 Language Resources and Evaluation Conference, LREC 2022
EditoresNicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Jan Odijk, Stelios Piperidis
EditorialEuropean Language Resources Association (ELRA)
Páginas3840-3851
Número de páginas12
ISBN (versión digital)9791095546726
EstadoPublicada - 2022
Evento13th International Conference on Language Resources and Evaluation Conference, LREC 2022 - Marseille, Francia
Duración: 20 jun. 202225 jun. 2022

Serie de la publicación

Nombre2022 Language Resources and Evaluation Conference, LREC 2022

Conferencia

Conferencia13th International Conference on Language Resources and Evaluation Conference, LREC 2022
País/TerritorioFrancia
CiudadMarseille
Período20/06/2225/06/22

Huella

Profundice en los temas de investigación de 'Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo'. En conjunto forman una huella única.

Citar esto