Scientific article
English

Robust Linear Model Selection by Cross-Validation

Published inJournal of the American Statistical Association, vol. 92, no. 439, p. 1017-1023
Publication date1997
Abstract

This article gives a robust technique for model selection in regression models, an important aspect of any data analysis involving regression. There is a danger that outliers will have an undue influence on the model chosen and distort any subsequent analysis. We provide a robust algorithm for model selection using Shao's cross-validation methods for choice of variables as a starting point. Because Shao's techniques are based on least squares, they are sensitive to outliers. We develop our robust procedure using the same ideas of cross-validation as Shao but using estimators that are optimal bounded influence for prediction. We demonstrate the effectiveness of our robust procedure in providing protection against outliers both in a simulation study and in a real example. We contrast the results with those obtained by Shao's method, demonstrating a substantial improvement in choosing the correct model in the presence of outliers with little loss of efficiency at the normal model.

Keywords
  • Bounded influence
  • Construction sample
  • Outliers
  • Prediction error
  • Robust prediction
  • Validation sample
Citation (ISO format)
RONCHETTI, Elvezio, FIELD, Christopher, BLANCHARD, Wade. Robust Linear Model Selection by Cross-Validation. In: Journal of the American Statistical Association, 1997, vol. 92, n° 439, p. 1017–1023. doi: 10.1080/01621459.1997.10474057
Identifiers
750views
0downloads

Technical informations

Creation01/10/2012 10:44:00
First validation01/10/2012 10:44:00
Update14/03/2023 17:42:08
Status update14/03/2023 17:42:08
Last indexation29/10/2024 20:43:11
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack