Please use this identifier to cite or link to this item: https://hdl.handle.net/11000/38879

Regression-Based Normative Data in Neuropsychology: Using Raw Scores as Observed Response Variable Outperforms Transforming for Normality

Title:
Regression-Based Normative Data in Neuropsychology: Using Raw Scores as Observed Response Variable Outperforms Transforming for Normality
Authors:
Oltra Cucarella, Javier
Pérez Elvira, Rubén
Bonete López, Beatriz
Iñesta Mena, Clara
Sitges Maciá, Esther
de Andrade Moral, Rafael
Editor:
American Psychological Association
Department:
Departamentos de la UMH::Psicología de la Salud
Issue Date:
2025
URI:
https://hdl.handle.net/11000/38879
Abstract:
Regression-based normative data for neuropsychological variables are increasing in popularity over the last years. However, some use raw data while others use transformation when the observed response variable is skewed. This work analyzes how well the linear models fit for each type of variable. We used real data from a sample of n = 163 cognitively healthy individuals and compared the fit of linear regression models for raw scores and for corrected scaled scores. We then simulated a population of 1,000,000 individuals and drew 1,000 random samples of different sizes (n = 100, 200, 5,000, 1,000, 10,000) for seven different scenarios, analyzed the percentage of individuals scoring in the lowest 5%, and analyzed the agreement between models with the Cohen’s κ statistic. Linear models for raw scores and for scaled scores were similar when the model included all the covariates, but barely identified low scores when scaled scores were corrected with covariates taken from different regressions (κ = 0.58). Models with raw scores showed that the expected number of individuals scoring low was close to the expected 5%, whereas models with scaled scores with covariates taken from different regressions were close to 0%. The two models agreed only when the response variable was random symmetrical and uncorrelated with the covariates. When calculating normative data using linear regressions, raw scores should be the preferred choice. If residuals analysis shows that the model does not fit the data well, researchers should consider using nonlinear models. Transforming data for normality of the observed response is discouraged
Keywords/Subjects:
generalized linear model
linear regression
neuropsychological assessment
normative data
residuals
Knowledge area:
CDU: Filosofía y psicología: Psicología
Type of document:
info:eu-repo/semantics/article
Access rights:
info:eu-repo/semantics/openAccess
Attribution-NonCommercial-NoDerivatives 4.0 Internacional
DOI:
https://doi.org/10.1037/pas0001398
Published in:
Psychological Assessment
Aparece en las colecciones:
Artículos- Psicología de la Salud



Creative Commons La licencia se describe como: Atribución-NonComercial-NoDerivada 4.0 Internacional.