TY - JOUR
T1 - The curse of dimensionality of decision-making units
T2 - A simple approach to increase the discriminatory power of data envelopment analysis
AU - Charles, Vincent
AU - Aparicio, Juan
AU - Zhu, Joe
N1 - Publisher Copyright:
© 2019 Elsevier B.V.
PY - 2019/12/16
Y1 - 2019/12/16
N2 - Data envelopment analysis (DEA) is a technique for identifying the best practices of a given set of decision-making units (DMUs) whose performance is categorized by multiple performance metrics that are classified as inputs and outputs. Although DEA is regarded as non-parametric, the sample size can be an issue of great importance in determining the efficiency scores for the evaluated units, empirically, when the use of too many inputs and outputs may result in a significant number of DMUs being rated as efficient. In the DEA literature, empirical rules have been established to avoid too many DMUs being rated as efficient. These empirical thresholds relate the number of variables with the number of observations. When the number of DMUs is below the empirical threshold levels, the discriminatory power among the DMUs may weaken, which leads to the data set not being suitable to apply traditional DEA models. In the literature, the lack of discrimination is often referred to as the “curse of dimensionality”. To overcome this drawback, we provide a simple approach to increase the discriminatory power between efficient and inefficient DMUs using the well-known pure DEA model, which considers either inputs only or outputs only. Three real cases, namely printed circuit boards, Greek banks, and quality of life in Fortune's best cities, have been discussed to illustrate the proposed approach.
AB - Data envelopment analysis (DEA) is a technique for identifying the best practices of a given set of decision-making units (DMUs) whose performance is categorized by multiple performance metrics that are classified as inputs and outputs. Although DEA is regarded as non-parametric, the sample size can be an issue of great importance in determining the efficiency scores for the evaluated units, empirically, when the use of too many inputs and outputs may result in a significant number of DMUs being rated as efficient. In the DEA literature, empirical rules have been established to avoid too many DMUs being rated as efficient. These empirical thresholds relate the number of variables with the number of observations. When the number of DMUs is below the empirical threshold levels, the discriminatory power among the DMUs may weaken, which leads to the data set not being suitable to apply traditional DEA models. In the literature, the lack of discrimination is often referred to as the “curse of dimensionality”. To overcome this drawback, we provide a simple approach to increase the discriminatory power between efficient and inefficient DMUs using the well-known pure DEA model, which considers either inputs only or outputs only. Three real cases, namely printed circuit boards, Greek banks, and quality of life in Fortune's best cities, have been discussed to illustrate the proposed approach.
KW - Banking
KW - Best cities
KW - Data envelopment analysis
KW - Performance
KW - Printed circuit boards
UR - http://www.scopus.com/inward/record.url?scp=85068077311&partnerID=8YFLogxK
U2 - 10.1016/j.ejor.2019.06.025
DO - 10.1016/j.ejor.2019.06.025
M3 - Article
AN - SCOPUS:85068077311
SN - 0377-2217
VL - 279
SP - 929
EP - 940
JO - European Journal of Operational Research
JF - European Journal of Operational Research
IS - 3
ER -