Screening and Selection Methods in High-Dimensional Linear Regression Model
Shinpei Imori, Shota Katayama and Hirofumi Wakaki
Abstract
In the present paper, we propose a new variable selection procedure for a high-dimensional
linear model from two perspectives of the true and risk minimizing model selection. The
proposed method consists of two factors: screening and selection. Both parts are based on
the residual sum of squares, which can be easily understood. Our objective is to select a
model consisting of indices of all nonzero regression coefficients, which is known as the true
model when the true mean structure is included in the full model. Moreover, it minimizes the
risk function under a restriction of explanatory variables. Even when the space of the target
model is large, our selection method is consistent under mild conditions, i.e., the selection
probability of the objective model goes to 1. Additionally, we reveal that consistency is
retained when the true mean structure cannot be constructed from all available explanatory
variables. Through simulation studies, we illustrate that our screening and selection methods
are more effective than previous methods in various situations.
Preprint
http://www.math.sci.hiroshima-u.ac.jp/stat/TR/TR14/TR14-01.pdf