Resumen
We present a Tailor Made Machine Learning (TMML) methodology combining different clustering algorithms, spatial statistical methods and cartographic tools. The methodology is currently being programmed in an R package, especially designed to handle multivariate spatial datasets. We highlight the strengths of unsupervised clustering for the management of environmental health phenomena, also pointing out several uncertainty sources affecting the results of our analysis. In particular, we acknowledge that the traditional hierarchical clustering is usually applied without performing dynamic reallocations, integrating spatial key-concepts or discussing the quality of outputs. Therefore we describe the foundations of the TMML methodology, which is applied to deal with these uncertainties, as well as with the variety of possible outputs. The R functions are applied to the spatial dataset included in the package so to illustrate the procedure to apply for identifying the most accurate clustering output, in the context of a sustainable agriculture example in Luxembourg.
Idioma original | Inglés |
---|---|
Páginas | 24-30 |
Número de páginas | 7 |
Estado | Publicada - 2016 |
Evento | 12th International Symposium on Spatial Accuracy Assessment in Natural Resources and Environmental Sciences, Accuracy 2016 - Montpellier, Francia Duración: 5 jul. 2016 → 8 jul. 2016 |
Conferencia
Conferencia | 12th International Symposium on Spatial Accuracy Assessment in Natural Resources and Environmental Sciences, Accuracy 2016 |
---|---|
País/Territorio | Francia |
Ciudad | Montpellier |
Período | 5/07/16 → 8/07/16 |