TY - GEN
T1 - Spell-checking based on syllabification and character-level graphs for a peruvian agglutinative language
AU - Alva, Carlo
AU - Oncevay-Marcos, Arturo
N1 - Publisher Copyright:
© EMNLP 2017.All right reserved.
PY - 2017
Y1 - 2017
N2 - There are several native languages in Peru which are mostly agglutinative. These languages are transmitted from generation to generation mainly in oral form, causing different forms of writing across different communities. For this reason, there are recent efforts to standardize the spelling in the written texts, and it would be beneficial to support these tasks with an automatic tool such as a spell-checker. In this way, this spelling corrector is being developed based on two steps: An automatic rule-based syllabification method and a character-level graph to detect the degree of error in a misspelled word. The experiments were realized on Shipibo-konibo, a highly agglutinative and Amazonian language, and the results obtained have been promising in a dataset built for the purpose.
AB - There are several native languages in Peru which are mostly agglutinative. These languages are transmitted from generation to generation mainly in oral form, causing different forms of writing across different communities. For this reason, there are recent efforts to standardize the spelling in the written texts, and it would be beneficial to support these tasks with an automatic tool such as a spell-checker. In this way, this spelling corrector is being developed based on two steps: An automatic rule-based syllabification method and a character-level graph to detect the degree of error in a misspelled word. The experiments were realized on Shipibo-konibo, a highly agglutinative and Amazonian language, and the results obtained have been promising in a dataset built for the purpose.
UR - http://www.scopus.com/inward/record.url?scp=85121457073&partnerID=8YFLogxK
U2 - 10.18653/v1/w17-4116
DO - 10.18653/v1/w17-4116
M3 - Conference contribution
AN - SCOPUS:85121457073
T3 - EMNLP 2017 - 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017 - Proceedings of the Workshop
SP - 109
EP - 116
BT - EMNLP 2017 - 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017 - Proceedings of the Workshop
A2 - Faruqui, Manaal
A2 - Schutze, Hinrich
A2 - Trancoso, Isabel
A2 - Yadollah, Yaghoobzadeh
PB - Association for Computational Linguistics (ACL)
T2 - EMNLP 2017 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017
Y2 - 7 September 2017
ER -