IDEAS home Printed from https://ideas.repec.org/a/taf/japsta/v33y2006i10p1131-1147.html
   My bibliography  Save this article

A Simple Diagnostic Plot Connecting Robust Estimation, Outlier Detection, and False Discovery Rates

Author

Listed:
  • Kenneth Rice
  • David Spiegelhalter

Abstract

Robust estimation of parameters, and identification of specific data points that are discordant with an assumed model, are often treated as different statistical problems. The two aims are, however, closely inter-related and in many cases the two analyses are required simultaneously. We present a simple diagnostic plot that connects existing robust estimators with simultaneous outlier detection, and uses the concept of false discovery rates to allow for the multiple comparisons induced by considering each point as a potential outlier. It is straightforward to implement, and applicable in any situation for which robust estimation procedures exist. Several examples are given.

Suggested Citation

  • Kenneth Rice & David Spiegelhalter, 2006. "A Simple Diagnostic Plot Connecting Robust Estimation, Outlier Detection, and False Discovery Rates," Journal of Applied Statistics, Taylor & Francis Journals, vol. 33(10), pages 1131-1147.
  • Handle: RePEc:taf:japsta:v:33:y:2006:i:10:p:1131-1147
    DOI: 10.1080/02664760600747002
    as

    Download full text from publisher

    File URL: http://www.tandfonline.com/doi/abs/10.1080/02664760600747002
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/02664760600747002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Li, Bing & Zamar, Ruben H., 1991. "Min-max asymptotic variance of M-estimates of location when scale is unknown," Statistics & Probability Letters, Elsevier, vol. 11(2), pages 139-145, February.
    2. Efron B. & Tibshirani R. & Storey J.D. & Tusher V., 2001. "Empirical Bayes Analysis of a Microarray Experiment," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1151-1160, December.
    3. John D. Storey, 2002. "A direct approach to false discovery rates," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(3), pages 479-498, August.
    4. John D. Storey & Jonathan E. Taylor & David Siegmund, 2004. "Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(1), pages 187-205, February.
    5. Tzeng J-Y. & Byerley W. & Devlin B. & Roeder K. & Wasserman L., 2003. "Outlier Detection and False Discovery Rates for Whole-Genome DNA Matching," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 236-246, January.
    6. Chen-An Tsai & Huey-miin Hsueh & James J. Chen, 2003. "Estimation of False Discovery Rates in Multiple Testing: Application to Gene Microarray Data," Biometrics, The International Biometric Society, vol. 59(4), pages 1071-1081, December.
    7. Adimari, Gianfranco & Ventura, Laura, 2001. "Robust inference for generalized linear models with application to logistic regression," Statistics & Probability Letters, Elsevier, vol. 55(4), pages 413-419, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lourenço, V.M. & Pires, A.M., 2014. "M-regression, false discovery rates and outlier detection with application to genetic association studies," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 33-42.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guo Wenge & Peddada Shyamal, 2008. "Adaptive Choice of the Number of Bootstrap Samples in Large Scale Multiple Testing," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-21, March.
    2. Alessio Farcomeni, 2006. "More Powerful Control of the False Discovery Rate Under Dependence," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 15(1), pages 43-73, May.
    3. Nik Tuzov & Frederi Viens, 2011. "Mutual fund performance: false discoveries, bias, and power," Annals of Finance, Springer, vol. 7(2), pages 137-169, May.
    4. Leek Jeffrey T & Storey John D., 2011. "The Joint Null Criterion for Multiple Hypothesis Tests," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-22, June.
    5. Laurent Barras & Olivier Scaillet & Russ Wermers, 2010. "False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas," Journal of Finance, American Finance Association, vol. 65(1), pages 179-216, February.
    6. Habiger, Joshua D. & Peña, Edsel A., 2014. "Compound p-value statistics for multiple testing procedures," Journal of Multivariate Analysis, Elsevier, vol. 126(C), pages 153-166.
    7. Xiaoquan Wen, 2017. "Robust Bayesian FDR Control Using Bayes Factors, with Applications to Multi-tissue eQTL Discovery," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(1), pages 28-49, June.
    8. Alejandro Ochoa & John D Storey & Manuel Llinás & Mona Singh, 2015. "Beyond the E-Value: Stratified Statistics for Protein Domain Prediction," PLOS Computational Biology, Public Library of Science, vol. 11(11), pages 1-21, November.
    9. Chen, Xiongzhi, 2019. "Uniformly consistently estimating the proportion of false null hypotheses via Lebesgue–Stieltjes integral equations," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 724-744.
    10. Debashis Ghosh & Wei Chen & Trivellore Raghuanthan, 2004. "The false discovery rate: a variable selection perspective," The University of Michigan Department of Biostatistics Working Paper Series 1040, Berkeley Electronic Press.
    11. Hunt, Daniel L. & Cheng, Cheng & Pounds, Stanley, 2009. "The beta-binomial distribution for estimating the number of false rejections in microarray gene expression studies," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1688-1700, March.
    12. Kim, Donggyu & Zhang, Chunming, 2014. "Adaptive linear step-up multiple testing procedure with the bias-reduced estimator," Statistics & Probability Letters, Elsevier, vol. 87(C), pages 31-39.
    13. Andrew Y. Chen, 2022. "Do t-Statistic Hurdles Need to be Raised?," Papers 2204.10275, arXiv.org, revised Apr 2024.
    14. Wen Shi & Xi Chen & Jennifer Shang, 2019. "An Efficient Morris Method-Based Framework for Simulation Factor Screening," INFORMS Journal on Computing, INFORMS, vol. 31(4), pages 745-770, October.
    15. Jianqing Fan & Xu Han, 2017. "Estimation of the false discovery proportion with unknown dependence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1143-1164, September.
    16. Shigeyuki Matsui & Hisashi Noma, 2011. "Estimating Effect Sizes of Differentially Expressed Genes for Power and Sample-Size Assessments in Microarray Experiments," Biometrics, The International Biometric Society, vol. 67(4), pages 1225-1235, December.
    17. Lianming Wang & David B. Dunson, 2010. "Semiparametric Bayes Multiple Testing: Applications to Tumor Data," Biometrics, The International Biometric Society, vol. 66(2), pages 493-501, June.
    18. Ghosh Debashis, 2012. "Incorporating the Empirical Null Hypothesis into the Benjamini-Hochberg Procedure," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(4), pages 1-21, July.
    19. Psaradellis, Ioannis & Laws, Jason & Pantelous, Athanasios A. & Sermpinis, Georgios, 2023. "Technical analysis, spread trading, and data snooping control," International Journal of Forecasting, Elsevier, vol. 39(1), pages 178-191.
    20. Wang Chamont & Gevertz Jana L., 2016. "Finding causative genes from high-dimensional data: an appraisal of statistical and machine learning approaches," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(4), pages 321-347, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:japsta:v:33:y:2006:i:10:p:1131-1147. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/CJAS20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.