Neural Borrowing Detection with Monolingual Lexical Models

John E. Miller, Emanuel Pariasca, César A. Beltrán Castañón

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

2 Citas (Scopus)

Resumen

Identification of lexical borrowings, transfer of words between languages, is an essential practice of historical linguistics and a vital tool in analysis of language contact and cultural events in general. We seek to improve tools for automatic detection of lexical borrowings, focusing here on detecting borrowed words from monolingual wordlists. Starting with a recurrent neural network lexical model and competing entropies approach, we incorporate a more current Transformer based lexical model. From there we experiment with several different models and approaches including a lexical donor model with augmented wordlist. The Transformer model reduces execution time and minimally improves borrowing detection, and the augmented donor model shows some promise. A substantive change in approach or model seems necessary for significant gains in detection of lexical borrowings.

Idioma originalInglés
Título de la publicación alojadaProceedings of the 1stWorkshop on Multimodal Machine Translation for Low Resource Languages, MMTLRL 2021 in conjunction with International Conference on Recent Advances in Natural Language Processing, RANLP 2021
EditoresReinhard Rapp, Thoudam Doren Singh, Cristina Espana i Bonet, Reinhard Rapp, Sivaji Bandyopadhyay, Serge Sharoff, Josef Van Genabith, Pierre Zweigenbaum
EditorialIncoma Ltd
Páginas109-117
Número de páginas9
ISBN (versión digital)9789544520731, 9789544520762
DOI
EstadoPublicada - 2021
Evento2021 Student Research Workshop, SRW 2021 associated with the International Conference on Recent Advances in Natural Language Processing, RANLP 2021 - Virtual, Online
Duración: 1 set. 20213 set. 2021

Serie de la publicación

NombreInternational Conference Recent Advances in Natural Language Processing, RANLP
Volumen2021-September
ISSN (versión impresa)1313-8502

Conferencia

Conferencia2021 Student Research Workshop, SRW 2021 associated with the International Conference on Recent Advances in Natural Language Processing, RANLP 2021
CiudadVirtual, Online
Período1/09/213/09/21

Huella

Profundice en los temas de investigación de 'Neural Borrowing Detection with Monolingual Lexical Models'. En conjunto forman una huella única.

Citar esto