Technical Papers
Mar 29, 2016

Multiple Outlier Detection: Hypothesis Tests versus Model Selection by Information Criteria

Publication: Journal of Surveying Engineering
Volume 142, Issue 4

Abstract

The detection of multiple outliers can be interpreted as a model selection problem. Models that can be selected are the null model, which indicates an outlier free set of observations, or a class of alternative models, which contain a set of additional bias parameters. A common way to select the right model is by using a statistical hypothesis test. In geodesy data snooping is most popular. Another approach arises from information theory. Here, the Akaike information criterion (AIC) is used to select an appropriate model for a given set of observations. The AIC is based on the Kullback-Leibler divergence, which describes the discrepancy between the model candidates. Both approaches are discussed and applied to test problems: the fitting of a straight line and a geodetic network. Some relationships between data snooping and information criteria are discussed. When compared, it turns out that the information criteria approach is more simple and elegant. Along with AIC there are many alternative information criteria for selecting different outliers, and it is not clear which one is optimal.

Get full access to this article

View all available purchase options and get full access to this article.

References

Abdi, H. (2007). “The Bonferroni and Šidák corrections for multiple comparisons.” Encyclopedia of measurement and statistics, N. Salkind, ed., Sage, Thousand Oaks, CA.
Akaike, H. (1974). “A new look at the statistical model identification.” IEEE Trans. Autom. Control, 19(6), 716–723.
Atkinson, A. C., and Riani, M. (2008). “A robust and diagnostic information criterion for selecting regression models.” J. Japan Stat. Soc., 38(1), 3–14.
Baarda, W. (1968). A testing procedure for use in geodetic networks, Vol. 2, Number 5, Netherlands Geodetic Commission, Publication on Geodesy, Delft, Netherlands.
Baselga, S. (2011). “Nonexistence of rigorous tests for multiple outlier detection in least-squares adjustment.” J Surv Eng, 109–112.
Beckman, R. J., and Cook, R. D. (1983). “Outlier….s.” Technometrics, 25(2), 119–149.
Blais, J. A. R. (1991). “On some model identification strategies using information theory.” Manuscr. Geodaet., 16(5), 326–332.
Burnham, K. P., and Anderson, D. R. (2002). Model selection and multimodel inference: A practical information-theoretic approach, Springer, Berlin.
Burnham, K. P., and Anderson, D. R. (2004). “Multimodel inference: Understanding AIC and BIC in model selection.” Sociol. Methods Res., 33(2), 261–304.
Ding, X., and Coleman, R. (1996). “Multiple outlier detection by evaluating redundancy contributions of observations.” J. Geod., 70(8), 489–498.
Felus, Y. A., and Felus, M. (2009). “On choosing the right coordinate transformation method.” Proc., FIG Working Week 2009, Eilat, Israel.
Fung, W. K. (1993). “A quasi-Bayesian analysis of regression outliers using Akaike's predictive likelihood.” Stat. Pap., 34(1), 133–141.
Hahn, M., Heck, B., Jäger, R., and Scheuring, R. (1989). “Ein Verfahren zur Abstimmung der Signifikanzniveaus für allgemeine Fm,n -verteilte Teststatistiken.” Teil I: Theorie, ZfV, 114, 234–248.
Hahn, M., Heck, B., Jäger, R., and Scheuring, R. (1991). “Ein Verfahren zur Abstimmung der Signifikanzniveaus für allgemeine Fm,n-verteilte Teststatistiken.” Teil II: Anwendungen, ZfV, 116, 15–26.
Heck, B. (1981). “Der Einfluß einzelner Beobachtungen auf das Ergebnis einer Ausgleichung und die Suche nach Ausreißern in den Beobachtungen.” AVN, 88(1), 17–34.
Hurvich, C. M., and Tsai, C. L. (1989). “Regression and time series model selection in small samples.” Biometrika, 76(2), 297–307.
Kitagawa, G. (1979). “On the use of AIC for the detection of outliers.” Technometrics, 21(2), 193–199.
Klees, R., Ditmar, P., and Broersen, P. (2002). “How to handle colored observation noise in large least-squares problems.” J. Geod., 76(11), 629–640.
Knight, N. L., Jinling, W. J., and Rizos, C. (2010). “Generalised measures of reliability for multiple outliers.” J. Geod., 84(10), 625–635.
Koch, K. R. (1999). Parameter estimation and hypothesis testing in linear models, 2nd Ed., Springer, Heidelberg.
Kok, J. J. (1984). “On data snooping and multiple outlier testing.” NOAA Tech. Rep. NOS NGS 30, National Geodetic Information Center, NOS/NOAA, Rockville, MD.
Kornacki, A. (2014). “Application of the Akaike criterion to detect outliers for the analysis of ash content in barley straw.” Int. Agrophys., 28(2), 257–260.
Kotsakis, C. (2012). “Reference frame stability and nonlinear distortion in minimum-constrained network adjustment.” J. Geod., 86(9), 755–774.
Lehmann, R. (2012). “Improved critical values for extreme normalized and studentized residuals in Gauss–Markov models.” J. Geod., 86(16), 1137–1146.
Lehmann, R. (2013a). “On the formulation of the alternative hypothesis for geodetic outlier detection.” J. Geod., 87(4), 373–386.
Lehmann, R. (2013b). “The 3σ-rule for outlier detection from the viewpoint of geodetic adjustment.” J. Surv. Eng., 157–165.
Lehmann, R. (2014). “Transformation model selection by multiple hypotheses testing.” J. Geod., 88(12), 1117–1130.
Lehmann, R. (2015). “Observation error model selection by information criteria vs. normality testing.” Stud. Geophys. Geod., 59(4), 489–504.
Lehmann, R., and Scheffler, T. (2011). “Monte Carlo based data snooping with application to a geodetic network.” J. Appl. Geod., 5(3–4), 123–134.
Lösler, M., Haas, R., and Eschelbach, C. (2016). “Terrestrial monitoring of a radio telescope reference point using comprehensive uncertainty budgeting–Investigations during CONT14 at the Onsala Space Observatory.” J. Geod., in press.
Lösler, M., Neidhardt, A., Mähler, S. (2013). “Impact of different observation strategies on reference point determination–Evaluations from a campaign at the Geodetic Observatory Wettzell.” Proc., 21th European VLBI for Geodesy and Astrometry (EVGA) Working Meeting, Espoo, Finland, N. Zubko and M. Poutanen, eds. Finnish Geodetic Institute, Helsinki, Finland.
Luo, X., Mayer, M., and Heck, B. (2011). “Verification of ARMA identification for modelling temporal correlations of GNSS observations using the ARMASA toolbox.” Stud. Geophys. Geod., 55, 537–556.
Luo, X., Mayer, M., and Heck, B. (2012). “Analysing time series of GNSS residuals by means of AR(I)MA processes.” VII Hotine-Marussi Symp. on Mathematical Geodesy, Int. Association of Geodesy Symposia 137, N. Sneeuw et al., eds., Springer-Verlag, Berlin.
Mierlo, J. (1983). “Problems of computing costs in decision problems.” Mathematical models of geodetic/photogrammetric point determination with regard to outliers and systematic errors, F. E. Ackermann, ed., DGK Reihe A, München, Germany.
Pope, A. J. (1976). “The statistics of residuals and the detection of outliers.” NOAA Tech. Rep. NOS65 NGS1, U.S. Dept. of Commerce, National Geodetic Survey, Rockville, MD.
Pynnönen, S. (1992). “Detection of outliers in regression analysis by information criteria.” Proc., Univ. of Vaasa/Discussion Papers, Vol. 146, Vaasa, Finland.
Rousseeuw, P. J., and Leroy, A. M. (1987). Robust regression and outlier detection, John Wiley & Sons, New York.
Teunissen, P. J. G. (2000). Testing theory: An introduction, 2nd Ed., Series on Mathematical Geodesy and Positioning, Delft Univ. of Technology, Delft, Netherlands.
Teunissen, P. J. G., and de Bakker, P. F. (2013). “Single-receiver single-channel multi-frequency GNSS integrity: Outliers, slips, and ionospheric disturbances.” J. Geod., 87(2), 161–177.
Ueda, T. (2009). “A simple method for the detection of outliers. EJASA, Electron.” J. App. Stat. Anal., 2(1), 67–76.
Wang, J., Almagbile, A., Wu, Y., and Tsujii, T. (2012). “Correlation analysis for fault detection statistics in integrated GNSS/INS systems.” J. Global Positioning Syst., 11(2), 89–99.
Wang, J., and Knight, N. (2012). “New outlier separability test and its application in GNSS positioning.” J. Global Positioning Syst., 11(1), 46–57.
Yang, L., Wang, J., Knight, N. L., and Shen, Y. (2013). “Outlier separability analysis with a multiple alternative hypotheses test.” J. Geod., 87(6), 591–604.

Information & Authors

Information

Published In

Go to Journal of Surveying Engineering
Journal of Surveying Engineering
Volume 142Issue 4November 2016

History

Received: Aug 28, 2015
Accepted: Feb 16, 2016
Published online: Mar 29, 2016
Discussion open until: Aug 29, 2016
Published in print: Nov 1, 2016

Permissions

Request permissions for this article.

Authors

Affiliations

Rüdiger Lehmann [email protected]
Lecturer, Faculty of Spatial Information, Univ. of Applied Sciences Dresden, Friedrich-List-Platz 1, Dresden D-01069, Germany (corresponding author). E-mail: [email protected]
Michael Lösler [email protected]
Scientific Coworker, Faculty of Architecture, Civil Engineering and Geomatics, Laboratory for Industrial Metrology, Frankfurt Univ. of Applied Sciences, Nibelungenplatz 1, Frankfurt am Main D-60318, Germany. E-mail: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

View Options

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share