0
RobustStatistics
PartialOverview,GeneralPrinciples
WernerA.Stahel,ETHZurich
FallSemester2012
1
1 In tr o d u c tio n
1.1
W h a t is R o b u st S ta tis tic s?
Example
Regression:Waterflowofariverat2locations.Simpleregression,13observations
2
WaterFlow
05101520253035404550
0 10 20 30 40 50
Libby
Newgate
3
RegressionLine(LeastSquares)
05101520253035404550
0 10 20 30 40 50
Libby
Newgate
4
RobustlyEstimatedRegressionLine
05101520253035404550
0 10 20 30 40 50
Libby
Newgate
5
RegressionLineafterdroppingtheoutlier,LeastSquares
05101520253035404550
0 10 20 30 40 50
Libby
Newgate
Least SquaresRobust MethodLSQ without outlier
6
MathematicalStatistics:
•Modeldefinesdistributionofvariables,parameterstobeestimated/tested
•Theorydeterminesoptimalproceduresundertheconditionsdefinedbythemodel
•Inpractice,conditionsarenotsatisfied,butmay“holdapproximately”.
Robuststatisticsisthestatisticsofapproximateparametricmodels.
7
1.2
H is to ry 1
Outliers
Rejectionofoutliers:DanielBernoulli(1770),astronomers
Trimmedmean:1821:Drop(k)largest&smallestobs.,averagetheremainingones.
Rejectionrules:Pierce(1852),...:Testextremeobservations(largest&smallest)forbeinganoutlier,Rejectiftestissignificant,averagetheremainingones.
8
RobustnessofTests
Fisher’sexacttheoryforthenormaldistr.−→“monopoly”.E.S.Pearson(1931):Non-robustnessofF-testforvariancesNon-parametrictestsFocusonrobustnessofthelevel,sometimesneglectingpower(butWilcoxontestisgood!)
Robustestimation
Non-robustnessofthemean:Tukey(1960)Minimaxtheory:Huber(1964)InfluenceFunctionandBreakdownpoint:Hampel(1968)Otherapproaches,seelater
9
Books
Huber(1981)−→Huber,Ronchetti(2009)Hampel,Ronchetti,Rousseeuw,Stahel(1986)Maronna,Martin,Yohai(2006)
Rousseeuw,Leroy(1987)Regression