Abstract
Indigenous languages, including those from the Americas, have received very little attention from the machine learning (ML) and natural language processing (NLP) communities. To tackle the resulting lack of systems for these languages and the accompanying social inequalities affecting their speakers, we conduct the second AmericasNLP competition (and the first one in collaboration with NeurIPS), which is centered around speech-to-text translation systems for Indigenous languages of the Americas. The competition features three tasks – (1) automatic speech recognition, (2) text-based machine translation, and (3) speech-to-text translation – and two tracks: constrained and unconstrained. Five Indigenous languages are covered: Bribri, Guarani, Kotiria, Wa’ikhana, and Quechua. In this overview paper, we describe the tasks, tracks, and languages, introduce the baseline and participating systems, and end with a summary of ongoing and future challenges for the automatic translation of Indigenous languages.
| Original language | English |
|---|---|
| Pages (from-to) | 217-232 |
| Number of pages | 16 |
| Journal | Proceedings of Machine Learning Research |
| Volume | 220 |
| State | Published - 2023 |
| Event | 36th Annual Conference on Neural Information Processing Systems - Competition Track, NeurIPS 2022 - Virtual, Online, United States Duration: 28 Nov 2022 → 9 Dec 2022 |
Keywords
- Indigenous languages
- automatic speech recognition
- low-resource languages
- low-resource machine translation
- machine translation
- natural language processing
- speech-to-text translation
Fingerprint
Dive into the research topics of 'Findings of the Second AmericasNLP Competition on Speech-to-Text Translation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver