Efficient robust doubly adaptive regularized regression with applications

Abstract

We consider the problem of estimation and variable selection for general linear regression models. Regularized regression procedures have been widely used for variable selection, but most existing methods perform poorly in the presence of outliers. We construct a new penalized procedure that simultaneously attains full efficiency and maximum robustness. Furthermore, the proposed procedure satisfies the oracle properties. The new procedure is designed to achieve sparse and robust solutions by imposing adaptive weights on both the decision loss and the penalty function. The proposed method of estimation and variable selection attains full efficiency when the model is correct and, at the same time, achieves maximum robustness when outliers are present. We examine the robustness properties using the finite-sample breakdown point and an influence function. We show that the proposed estimator attains the maximum breakdown point. Furthermore, there is no loss in efficiency when there are no outliers or the error distribution is normal. For practical implementation of the proposed method, we present a computational algorithm. We examine the finite-sample and robustness properties using Monte Carlo studies. Two datasets are also analyzed.

Keywords

Regularized regression variable selection efficiency robustness

Get full access to this article

View all access options for this article.

References

Frank

Friedman

. A statistical view of some chemometrics regression tools. Technometrics 1993; 35: 109–135.

Tibshirani

. Regression shrinkage and selection via the lasso. J Royal Stat Soc Ser B 1996; 58: 267–288.

Fan

. Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 2001; 96: 1348–1360.

Zou

. The adaptive lasso and its oracle properties. J Am Stat Assoc 2006; 101: 1418–1429.

Zou

Hastie

. Regularization and variable selection via the elastic net. J Royal Stat Soc Ser B 2005; 67: 301–320.

Zou

Zhang

. On the adaptive elastic-net with a diverging number of parameters. Ann Stat 2009; 37: 1733–1751.

Zhang

. Nearly unbiased variable selection under minimax concave penalty. Ann Stat 2010; 38: 894–942.

Huber

. Robust statistics, New York, NY: Wiley, 1981.

Wang

Jiang

. Robust regression shrinkage and consistent variable selection through the LAD-Lasso. J Business Econ Stat 2007; 25: 347–355.

10.

Peng

Zhu

. Nonconcave penalized m-estimation with a diverging number of parameters. Statistica Sinica 2011; 21: 391–419.

11.

Lambert-Lacroix

Zwald

. Robust regression through the Huber’s criterion and adaptive lasso penalty. Electronic J Stat 2011; 5: 1015–1053.

12.

Arslan

. Weighted LAD-LASSO method for robust parameter estimation and variable selection in regression. Computat Stat Data Analys 2012; 56: 1952–1965.

13.

Liu

. Variable selection in quantile regression. Stat Sinica 2009; 19: 801–817.

14.

Wang

. Quantile regression for analyzing heterogeneity in ultra-high dimension. J Am Stat Assoc 2012; 107: 214–222.

15.

Kai

Zou

. New efficient estimation and variable selection methods for semiparametric varying-coefficient partially linear models. Ann Stat 2011; 39: 305–332.

16.

Zou

Yuan

. Composite quantile regression and the oracle model selection theory. Ann Stat 2008; 36: 1108–1126.

17.

Johnson

Peng

. Rank-based variable selection. J Nonparametric Stat 2008; 20: 241–252.

18.

Wang

. Weighted Wilcoxon-type smoothly clipped absolute deviation method. Biometrics 2009; 65: 564–571.

19.

Leng

. Variable selection and coefficient estimation via regularized rank regression. Stat Sinica 2010; 20: 167–181.

20.

Chen

Wang

McKeown

. Asymptotic analysis of robust LASSOs in the presence of noise with large variance. IEEE Transact Inform Theory 2010; 56: 5131–5149.

21.

Bradic

Fan

Wang

. Penalized composite quasi-likelihood for ultrahigh dimensional variable selection. J Royal Stat Soc Ser B 2011; 73: 325–349.

22.

Wang

Jiang

Huang

et al.

Robust variable selection with exponential squared loss. J Am Stat Assoc 2013; 108: 632–643.

23.

Fan

Barut

. Adaptive robust variable selection. Ann Stat 2014; 42: 324–351.

24.

Alfons

Croux

Gelper

. Sparse least trimmed squares regression for analyzing high-dimensional large data sets. Ann Appl Stat 2013; 7: 226–248.

25.

Öllerer

Croux

Alfons

. The influence function of penalized regression estimators. Statistics 2015; 49: 741–765.

26.

Smucler

Yohai

. Robust and sparse estimators for linear regression model. Computat Stat Data Analys 2017; 111: 116–130.

27.

Yohai

. High breakdown-point and high efficiency robust estimates for regression. Ann Stat 1987; 15: 642–656.

28.

Loh

P-L

. Statistical consistency and asymptotic normality for high-dimensional robust M-estimators. Ann Stat 2017; 45: 866–896.

29.

Hample

. The influence curve and its role in robust estimation. J Am Stat Assoc 1974; 69: 383–393.

30.

Huber

. Final sample breakdown of M- and P-estimators. Ann Stat 1984; 12: 119–126.

31.

Maronna

Martin

Yohai

. Robust statistics: theory and methods, New York, NY: Wiley, 2006.

32.

Gervini

Yohai

. A class of robust and fully efficient regression estimators. Ann Stat 2002; 30: 583–616.

33.

Maronna

Yohai

. Robust and efficient estimation of multivariate scatter and location. Computat Stat Data Analys 2017; 109: 64–75.

34.

Christensen

Sun

. Alternative goodness-of-fit tests for linear models. J Am Stat Assoc 2010; 105: 291–301.

35.

Le Cam

. Théorie Asymptotique de la Décision Statistique, Les Presses de l’Université de Montréal, 1969.

36.

Bickel

Klaassen

Ritov

et al.

Efficient and adaptive estimation for semiparametric models, New York, NY: Springer, 1998.

37.

van der Vaart

. Asymptotic statistics, Cambridge, UK: Cambridge University Press, 2000.

38.

Zou

. One-step sparse estimates in nonconcave penalized likelihood models. Ann Stat 2008; 36: 1509–1533.

39.

Efron

Hastie

Johnstone

et al.

Least angle regression. Ann Stat 2004; 32: 407–499.

40.

Friedman

Hastie

Höfling

et al.

Pathwise coordinate optimization. Ann Appl Stat 2007; 1: 302–332.

41.

Boyd

Parikh

Chu

et al.

Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations Trends Mach Learn 2011; 3: 1–122.

42.

Arnold

Tibshirani

. Efficient implementations of the generalized lasso dual path algorithm. J Computat Graphic Stat 2016; 25: 1–27.

43.

Zhu

. An augmented ADMM algorithm with application to the generalized lasso problem. J Computat Graph Stat 2017; 26: 195–204.

44.

Hampel

Ronchetti

Rousseeuw

et al.

Robust statistics: the approach based on influence functions, New York, NY: Wiley, 1986.

45.

Hampel

. A global qualitative definition of robustness. Ann Math Stat 1971; 42: 1887–1895.

46.

Donoho

Huber

The notion of breakdown point. In: Bickel

Doksum

Hodges

JL, Jr.

(eds). A Festschrift for E. L. Lehmann, Belmont, CA: Wadsworth, 1983, pp. 157–184.

47.

Rousseeuw

Yohai

. Robust regression by means of S-estimators. In: Robust and nonlinear time series analysis. Lecture Notes in Statistics, New York, NY: Springer, 1984, pp. 256–272.

48.

Gervini

. A robust and efficient adaptive reweighted estimator of multivariate location and scatter. J Multivariate Analys 2003; 84: 116–144.

49.

Rousseeuw

Leroy

. Robust regression and outlier detection, New York, NY: Wiley, 1987.

50.

Martin

Yohai

Zamar

. Min-max bias regression. Ann Stat 1989; 17: 1608–1630.

51.

Yohai

Zamar

. A minimax-bias property of the least-quantile estimates. Ann Stat 1993; 21: 1824–1842.

52.

Rousseeuw

Driessen

. A fast algorithm for the minimum covariance determinant estimator. Technometrics 1999; 41: 212–223.

53.

She

Owen

. Outlier detection using nonconvex penalized regression. J Am Stat Assoc 2011; 106: 626–639.

54.

Wei L. Simultaneous variable selection and outlier detection using LASSO with applications to aircraft landing data analysis. PhD Thesis: Rutgers University, NJ, 2012.

55.

Linden

. The challenges and promise of neuroimaging in psychiatry. Neuron 2012; 73: 8–22.

56.

Nestler

Hyman

. Animal models of neuropsychiatric disorders. Nat Neurosci 2010; 13: 1161–1169.

57.

Wang

Zang

et al.

Changes in hippocampal connectivity in the early stages of Alzheimer’s disease: evidence from resting state fMRI. Neuroimage 2006; 31: 496–504.

58.

Greicius

Flores

Menon

et al.

Resting-state functional connectivity in major depression: abnormally increased contributions from subgenual cingulate cortex and thalamus. Biol Psychiatr 2007; 62: 429–437.

59.

Rombouts

Damoiseaux

Goekoop

et al.

Model free group analysis shows altered BOLD FMRI networks in dementia. Human Brain Map 2009; 30: 256–266.

60.

ADHD-200

Consortium

. The ADHD-200 consortium: a model to advance the translationa l potential of neuroimaging in clinical neuroscience. Frontiers Syst Neurosci 2012; 6: 62–62.

61.

Bellec

Chu

Chouinard-Decorte

et al.

The Neuro bureau ADHD-200 preprocessed repository. Neuroimage 2017; 144: 275–286.

62.

Tzourio-Mazoyer

Landeau

Papathanassiou

et al.

Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage 2002; 15: 273–289.

63.

Tian

Jiang

Wang

et al.

Altered resting-state functional connectivity patterns of anterior cingulate cortex in adolescents with attention deficit hyperactivity disorder. Neurosci Lett 2006; 400: 39–43.

64.

Castellanos

Margulies

Kelly

et al.

Cingulate-precuneus interactions: a new locus of dysfunction in adult attention-deficit/hyperactivity disorder. Biol Psychiatr 2008; 63: 332–337.

65.

McDonald

Schwing

. Instabilities of regression estimates relating air pollution to mortality. Technometrics 1973; 15: 463–482.

66.

McCullagh

Nelder

. Generalized linear models., 2nd ed. London: Chapman and Hall, 1989.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.33 MB