Quality data extraction methodology based on the labeling of coffee leaves with nutritional deficiencies

Adolfo Jungbluth, Jon Li Yeng, Luis Vives

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

Nutritional deficiencies detection for coffee leaves is a task which is often undertaken manually by experts on the field known as agronomists. The process they follow to carry this task is based on observation of the different characteristics of the coffee leaves while relying on their own experience. Visual fatigue and human error in this empiric approach cause leaves to be incorrectly labeled and thus affecting the quality of the data obtained. In this context, different crowdsourcing approaches can be applied to enhance the quality of the data extracted. These approaches separately propose the use of voting systems, association rule filters and evolutive learning. In this paper, we extend the use of association rule filters and evolutive approach by combining them in a methodology to enhance the quality of the data while guiding the users during the main stages of data extraction tasks. Moreover, our methodology proposes a reward component to engage users and keep them motivated during the crowdsourcing tasks. The extracted dataset by applying our proposed methodology in a case study on Peruvian coffee leaves resulted in 93.33% accuracy with 30 instances collected by 8 experts and evaluated by 2 agronomic engineers with background on coffee leaves. The accuracy of the dataset was higher than independently implementing the evolutive feedback strategy and an empiric approach which resulted in 86.67% and 70% accuracy respectively under the same conditions.

Idioma originalInglés
Título de la publicación alojadaICISDM 2018 - 2nd International Conference on Information System and Data Mining
EditorialAssociation for Computing Machinery
Páginas59-64
Número de páginas6
ISBN (versión digital)9781450363549
DOI
EstadoPublicada - 9 abr. 2018
Publicado de forma externa
Evento2nd International Conference on Information System and Data Mining, ICISDM 2018 - Lakeland, Estados Unidos
Duración: 9 abr. 201811 abr. 2018

Serie de la publicación

NombreACM International Conference Proceeding Series

Conferencia

Conferencia2nd International Conference on Information System and Data Mining, ICISDM 2018
País/TerritorioEstados Unidos
CiudadLakeland
Período9/04/1811/04/18

Huella

Profundice en los temas de investigación de 'Quality data extraction methodology based on the labeling of coffee leaves with nutritional deficiencies'. En conjunto forman una huella única.

Citar esto