Abstract:
|
In this article, we propose a new efficient sparse estimate (ESE) in sufficient dimension reduction utilizing distance covariance. Our method is model-free and does not need any kernel function and bandwidth or slicing selection. Moreover, it can naturally deal with multivariate response scenarios, making it appealing in a modied sequential algorithm that targets the large p small n problems. Compared with screening procedures which only use marginal utility, our method can extract more useful information from the data and is capable of determining the size of the selected sub-model automatically while most of screening procedures cannot. Under mild conditions, based on manifold theories and techniques, it can be shown that our method would perform asymptotically as if the true irrelevant predictors were known, which is referred to as the oracle property. Extensive simulation studies and two real-data examples demonstrate the effectiveness and efficiency of the proposed approach. It is remarkable that the analysis in cardiomyopathy microarray data reveals distinct and interesting findings.
|