Estimating non-overfitted convex production technologies: A stochastic machine learning approach

Maria D. Guillen, Vincent Charles, Juan Aparicio

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

Resumen

Overfitting is a classical statistical issue that occurs when a model fits a particular observed data sample too closely, potentially limiting its generalizability. While Data Envelopment Analysis (DEA) is a powerful non-parametric method for assessing the relative efficiency of decision-making units (DMUs), its reliance on the minimal extrapolation principle can lead to concerns about overfitting, particularly when the goal extends beyond evaluating the specific DMUs in the sample to making broader inferences. In this paper, we propose an adaptation of Stochastic Gradient Boosting to estimate production possibility sets that mitigate overfitting while satisfying shape constraints such as convexity and free disposability. Our approach is not intended to replace DEA but to complement it, offering an additional tool for scenarios where generalization is important. Through simulation experiments, we demonstrate that the proposed method performs well compared to DEA, especially in high-dimensional settings. Furthermore, the new machine learning-based technique is compared to the Corrected Concave Non-parametric Least Squares (C2NLS), showing competitive performance. We also illustrate how the usual efficiency measures in DEA can be implemented under our approach. Finally, we provide an empirical example based on data from the Program for International Student Assessment (PISA) to demonstrate the applicability of the new method.

Idioma originalInglés
Páginas (desde-hasta)224-240
Número de páginas17
PublicaciónEuropean Journal of Operational Research
Volumen323
N.º1
DOI
EstadoPublicada - 16 may. 2025
Publicado de forma externa

Huella

Profundice en los temas de investigación de 'Estimating non-overfitted convex production technologies: A stochastic machine learning approach'. En conjunto forman una huella única.

Citar esto