TY - JOUR
T1 - Finite mixture of regression models based on multivariate scale mixtures of skew-normal distributions
AU - Benites, Luis
AU - Lachos, Víctor H.
AU - Bolfarine, Heleno
AU - Zeller, Camila B.
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2025.
PY - 2025
Y1 - 2025
N2 - The traditional estimation of mixture regression models is based on the assumption of component normality (or symmetry), making it sensitive to outliers, heavy-tailed errors, and asymmetric errors. In this work, we propose addressing these issues simultaneously by considering a finite mixture of regression models with multivariate scale mixtures of skew-normal distributions. This approach provides greater flexibility in modeling data, accommodating both skewness and heavy tails. Additionally, the proposed model allows the use of a specific vector of regressors for each dependent variable. The main advantage of using the mixture of regression models under the class of multivariate scale mixtures of skew-normal distributions is their convenient hierarchical representation, which allows easy implementation of inference. We develop a simple expectation–maximization (EM) type algorithm to perform maximum likelihood inference for the parameters of the proposed model. The observed information matrix is derived analytically to calculate standard errors. Some simulation studies are also presented to examine the robustness of this flexible model against outlying observations. Finally, a real dataset is analyzed, demonstrating the practical value of the proposed method. The R scripts implementing our methods are available on the GitHub repository at https://bit.ly/3CLcI1W.
AB - The traditional estimation of mixture regression models is based on the assumption of component normality (or symmetry), making it sensitive to outliers, heavy-tailed errors, and asymmetric errors. In this work, we propose addressing these issues simultaneously by considering a finite mixture of regression models with multivariate scale mixtures of skew-normal distributions. This approach provides greater flexibility in modeling data, accommodating both skewness and heavy tails. Additionally, the proposed model allows the use of a specific vector of regressors for each dependent variable. The main advantage of using the mixture of regression models under the class of multivariate scale mixtures of skew-normal distributions is their convenient hierarchical representation, which allows easy implementation of inference. We develop a simple expectation–maximization (EM) type algorithm to perform maximum likelihood inference for the parameters of the proposed model. The observed information matrix is derived analytically to calculate standard errors. Some simulation studies are also presented to examine the robustness of this flexible model against outlying observations. Finally, a real dataset is analyzed, demonstrating the practical value of the proposed method. The R scripts implementing our methods are available on the GitHub repository at https://bit.ly/3CLcI1W.
KW - EM algorithm
KW - Multivariate finite mixtures
KW - Multivariate scale mixtures of skew-normal distributions
KW - Regression models
UR - https://www.scopus.com/pages/publications/105009545189
U2 - 10.1007/s00180-025-01646-x
DO - 10.1007/s00180-025-01646-x
M3 - Article
AN - SCOPUS:105009545189
SN - 0943-4062
JO - Computational Statistics
JF - Computational Statistics
ER -