TY - JOUR
T1 - Findings of the Second AmericasNLP Competition on Speech-to-Text Translation
AU - Ebrahimi, Abteen
AU - Mager, Manuel
AU - Wiemerslage, Adam
AU - Denisov, Pavel
AU - Oncevay, Arturo
AU - Liu, Danni
AU - Koneru, Sai
AU - Ugan, Enes Yavuz
AU - Li, Zhaolin
AU - Niehues, Jan
AU - Romero, Monica
AU - Torre, Ivan G.
AU - Alumäe, Tanel
AU - Kong, Jiaming
AU - Polezhaev, Sergey
AU - Belousov, Yury
AU - Chen, Wei Rui
AU - Sullivan, Peter
AU - Adebara, Ife
AU - Talafha, Bashar
AU - Inciarte, Alcides Alcoba
AU - Abdul-Mageed, Muhammad
AU - Chiruzzo, Luis
AU - Coto-Solano, Rolando
AU - Cruz, Hilaria
AU - Flores-Solórzano, Sofía
AU - López, Aldo Andrés Alvarez
AU - Meza-Ruiz, Ivan
AU - Ortega, John E.
AU - Palmer, Alexis
AU - Zevallos, Rodolfo
AU - Stenzel, Kristine
AU - Vu, Thang
AU - Kann, Katharina
N1 - Publisher Copyright:
© 2023 A. Ebrahimi et al.
PY - 2023
Y1 - 2023
N2 - Indigenous languages, including those from the Americas, have received very little attention from the machine learning (ML) and natural language processing (NLP) communities. To tackle the resulting lack of systems for these languages and the accompanying social inequalities affecting their speakers, we conduct the second AmericasNLP competition (and the first one in collaboration with NeurIPS), which is centered around speech-to-text translation systems for Indigenous languages of the Americas. The competition features three tasks – (1) automatic speech recognition, (2) text-based machine translation, and (3) speech-to-text translation – and two tracks: constrained and unconstrained. Five Indigenous languages are covered: Bribri, Guarani, Kotiria, Wa’ikhana, and Quechua. In this overview paper, we describe the tasks, tracks, and languages, introduce the baseline and participating systems, and end with a summary of ongoing and future challenges for the automatic translation of Indigenous languages.
AB - Indigenous languages, including those from the Americas, have received very little attention from the machine learning (ML) and natural language processing (NLP) communities. To tackle the resulting lack of systems for these languages and the accompanying social inequalities affecting their speakers, we conduct the second AmericasNLP competition (and the first one in collaboration with NeurIPS), which is centered around speech-to-text translation systems for Indigenous languages of the Americas. The competition features three tasks – (1) automatic speech recognition, (2) text-based machine translation, and (3) speech-to-text translation – and two tracks: constrained and unconstrained. Five Indigenous languages are covered: Bribri, Guarani, Kotiria, Wa’ikhana, and Quechua. In this overview paper, we describe the tasks, tracks, and languages, introduce the baseline and participating systems, and end with a summary of ongoing and future challenges for the automatic translation of Indigenous languages.
KW - Indigenous languages
KW - automatic speech recognition
KW - low-resource languages
KW - low-resource machine translation
KW - machine translation
KW - natural language processing
KW - speech-to-text translation
UR - http://www.scopus.com/inward/record.url?scp=85178551391&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85178551391
SN - 2640-3498
VL - 220
SP - 217
EP - 232
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 36th Annual Conference on Neural Information Processing Systems, NeurIPS 2022
Y2 - 28 November 2022 through 9 December 2022
ER -