Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/11000/38617

An adaptation of Random Forest to estimate convex non-parametric production technologies: an empirical illustration of efficiency measurement in education


no-thumbnailVer/Abrir:

 Int Trans Operational Res - 2024 - España.pdf



1,34 MB
Adobe PDF
Compartir:

Este recurso está restringido

Título :
An adaptation of Random Forest to estimate convex non-parametric production technologies: an empirical illustration of efficiency measurement in education
Autor :
España Roch, Victor Javier
Aparicio, Juan
Barber i Vallés, Josep Xavier
Editor :
Wiley
Departamento:
Departamentos de la UMH::Estadística, Matemáticas e Informática
Fecha de publicación:
2025
URI :
https://hdl.handle.net/11000/38617
Resumen :
This paper presents a novel approach to conduct non-parametric estimations of production technologies that adhere to the basic assumptions of production theory axioms, including free disposability in inputs and outputs and convexity. The methodology is rooted in adapting the highly effective machine learning techniques associated with Random Forest and the use of splines. The new method features a piecewise linear estimator analogous to data envelopment analysis (DEA); however, it distinguishes itself by addressing DEA's overfitting and lack of robustness via randomization of data and input variables in the construction of the models. In this paper, the virtues of employing machine learning techniques for assessing the efficiency of public services, particularly in the realm of educational institutions, are underscored. The new approach has the capability to predict outputs based on inputs, even for units not included in the observed sample. Furthermore, it enables the identification of the most relevant inputs in relation to output production. To demonstrate the advantages of our method, an estimation of the educational production function is conducted for Spanish regions utilizing data sourced from the Program for International Student Assessment.
Palabras clave/Materias:
data envelopment analysis
machine learning
random forest
prediction
importance of variables
Área de conocimiento :
CDU: Ciencias sociales: Demografía. Sociología. Estadística: Estadística
CDU: Ciencias puras y naturales: Matemáticas: Análisis
Tipo de documento :
info:eu-repo/semantics/article
Derechos de acceso:
info:eu-repo/semantics/closedAccess
Attribution-NonCommercial-NoDerivatives 4.0 Internacional
DOI :
https://doi.org/10.1111/itor.13561
Publicado en:
International Transactions in Operational Research
Aparece en las colecciones:
Artículos - Estadística, Matemáticas e Informática



Creative Commons La licencia se describe como: Atribución-NonComercial-NoDerivada 4.0 Internacional.