TY - JOUR
T1 - Prediction in non-sampled areas under spatial small area models
AU - Sikov, Anna
AU - Cerda-Hernandez, José
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Società Italiana di Statistica 2024.
PY - 2024
Y1 - 2024
N2 - In this article we study the prediction problem in small geographic areas in the situation where the survey data does not cover a substantial percentage of these areas. In such situation, the application of the Spatial Fay–Herriot model may involve a difficult and subtle process of determining neighboring areas. Ambiguity in definition of neighbors can potentially produce a problem of sensitivity of the conclusions to these definitions. In this article, we attempt to remedy this problem by incorporating random effects for higher level administrative divisions into the model. In this setting, only the higher-level random effects are supposed to have spatial correlations. This may potentially reduce the problem of ambiguity in the definition of spatial neighbors, provided that all higher level administrative divisions are represented in the sample. We also show that predicting in non-sampled areas is considerably more straightforward under the proposed model, as opposed to the case where the Spatial Fay–Herriot model is applied. In addition, we propose two new predictors for out-of-sample areas, under the spatial Fay–Herriot model. In order to compare the performance of the aforementioned models, we use the data from the Demographic and Family Health Survey of the year 2021, and the National Census carried out in 2017.
AB - In this article we study the prediction problem in small geographic areas in the situation where the survey data does not cover a substantial percentage of these areas. In such situation, the application of the Spatial Fay–Herriot model may involve a difficult and subtle process of determining neighboring areas. Ambiguity in definition of neighbors can potentially produce a problem of sensitivity of the conclusions to these definitions. In this article, we attempt to remedy this problem by incorporating random effects for higher level administrative divisions into the model. In this setting, only the higher-level random effects are supposed to have spatial correlations. This may potentially reduce the problem of ambiguity in the definition of spatial neighbors, provided that all higher level administrative divisions are represented in the sample. We also show that predicting in non-sampled areas is considerably more straightforward under the proposed model, as opposed to the case where the Spatial Fay–Herriot model is applied. In addition, we propose two new predictors for out-of-sample areas, under the spatial Fay–Herriot model. In order to compare the performance of the aforementioned models, we use the data from the Demographic and Family Health Survey of the year 2021, and the National Census carried out in 2017.
KW - Bootstrap
KW - Direct estimate
KW - Fay–Herriot model
KW - Mean square error
KW - Sensitivity
KW - Spatial autocorrelation
UR - http://www.scopus.com/inward/record.url?scp=85192848793&partnerID=8YFLogxK
U2 - 10.1007/s10260-024-00754-0
DO - 10.1007/s10260-024-00754-0
M3 - Article
AN - SCOPUS:85192848793
SN - 1618-2510
JO - Statistical Methods and Applications
JF - Statistical Methods and Applications
ER -