IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v78y2014icp33-42.html
   My bibliography  Save this article

M-regression, false discovery rates and outlier detection with application to genetic association studies

Author

Listed:
  • Lourenço, V.M.
  • Pires, A.M.

Abstract

Robust multiple linear regression methods are valuable tools when underlying classical assumptions are not completely fulfilled. In this setting, robust methods ensure that the analysis is not significantly disturbed by any outlying observation. However, knowledge of these observations may be important to assess the underlying mechanisms of the data. Therefore, a robust outlier test is discussed, together with an adequate false discovery rate correction measure, to be used in the context of multiple linear regression with categorical explanatory variables. The methodology focuses on genetic association studies of quantitative traits, though it has much broader applications. The method is also compared to a benchmark rule from the literature and its good performance is validated by a simulation study and a real data example from a candidate gene study.

Suggested Citation

  • Lourenço, V.M. & Pires, A.M., 2014. "M-regression, false discovery rates and outlier detection with application to genetic association studies," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 33-42.
  • Handle: RePEc:eee:csdana:v:78:y:2014:i:c:p:33-42
    DOI: 10.1016/j.csda.2014.03.019
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947314000966
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2014.03.019?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yoav Benjamini & Abba M. Krieger & Daniel Yekutieli, 2006. "Adaptive linear step-up procedures that control the false discovery rate," Biometrika, Biometrika Trust, vol. 93(3), pages 491-507, September.
    2. Cerioli, Andrea & Farcomeni, Alessio & Riani, Marco, 2013. "Robust distances for outlier-free goodness-of-fit testing," Computational Statistics & Data Analysis, Elsevier, vol. 65(C), pages 29-45.
    3. Peristera Paschou & Petros Drineas & Jamey Lewis & Caroline M Nievergelt & Deborah A Nickerson & Joshua D Smith & Paul M Ridker & Daniel I Chasman & Ronald M Krauss & Elad Ziv, 2008. "Tracing Sub-Structure in the European American Population with PCA-Informative Markers," PLOS Genetics, Public Library of Science, vol. 4(7), pages 1-13, July.
    4. Cerioli, Andrea & Farcomeni, Alessio, 2011. "Error rates for multivariate outlier detection," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 544-553, January.
    5. Tzeng J-Y. & Byerley W. & Devlin B. & Roeder K. & Wasserman L., 2003. "Outlier Detection and False Discovery Rates for Whole-Genome DNA Matching," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 236-246, January.
    6. Kenneth Rice & David Spiegelhalter, 2006. "A Simple Diagnostic Plot Connecting Robust Estimation, Outlier Detection, and False Discovery Rates," Journal of Applied Statistics, Taylor & Francis Journals, vol. 33(10), pages 1131-1147.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Axel Gandy & Georg Hahn & Dong Ding, 2020. "Implementing Monte Carlo tests with p‐value buckets," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(3), pages 950-967, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Francesco Dotto & Alessio Farcomeni & Luis Angel García-Escudero & Agustín Mayo-Iscar, 2017. "A fuzzy approach to robust regression clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(4), pages 691-710, December.
    2. Alessio Farcomeni & Luca Greco, 2015. "S-estimation of hidden Markov models," Computational Statistics, Springer, vol. 30(1), pages 57-80, March.
    3. Pedro Galeano & Daniel Peña, 2019. "Data science, big data and statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 289-329, June.
    4. Claudio Agostinelli & Luca Greco, 2019. "Weighted likelihood estimation of multivariate location and scatter," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 756-784, September.
    5. Michael L. Anderson & Fangwen Lu, 2017. "Learning to Manage and Managing to Learn: The Effects of Student Leadership Service," Management Science, INFORMS, vol. 63(10), pages 3246-3261, October.
    6. Daniel Yekutieli, 2008. "Comments on: Control of the false discovery rate under dependence using the bootstrap and subsampling," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 17(3), pages 458-460, November.
    7. Emilio Depetris-Chauvin & Ömer Özak, 2020. "The origins of the division of labor in pre-industrial times," Journal of Economic Growth, Springer, vol. 25(3), pages 297-340, September.
    8. Fernández Guerrico, Sofía, 2021. "The effects of trade-induced worker displacement on health and mortality in Mexico," Journal of Health Economics, Elsevier, vol. 80(C).
    9. Daniel Bjorkegren & Joshua Blumenstock & Omowunmi Folajimi-Senjobi & Jacqueline Mauro & Suraj R. Nair, 2022. "Instant Loans Can Lift Subjective Well-Being: A Randomized Evaluation of Digital Credit in Nigeria," Papers 2202.13540, arXiv.org.
    10. Cattaneo, Cristina & Grieco, Daniela, 2021. "Turning opposition into support to immigration: The role of narratives," Journal of Economic Behavior & Organization, Elsevier, vol. 190(C), pages 785-801.
    11. Blimpo, Moussa P. & Pugatch, Todd, 2021. "Entrepreneurship education and teacher training in Rwanda," Journal of Development Economics, Elsevier, vol. 149(C).
    12. Davide Nicola Continanza & Andrea del Monaco & Marco di Lucido & Daniele Figoli & Pasquale Maddaloni & Filippo Quarta & Giuseppe Turturiello, 2023. "Stacking machine learning models for anomaly detection: comparing AnaCredit to other banking data sets," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Data science in central banking: applications and tools, volume 59, Bank for International Settlements.
    13. Girum Abebe & A Stefano Caria & Marcel Fafchamps & Paolo Falco & Simon Franklin & Simon Quinn, 2021. "Anonymity or Distance? Job Search and Labour Market Exclusion in a Growing African City [Endogenous Stratification in Randomized Experiments]," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 88(3), pages 1279-1310.
    14. Hanck, Christoph, 2011. "Now, whose schools are really better (or weaker) than Germany's? A multiple testing approach," Economic Modelling, Elsevier, vol. 28(4), pages 1739-1746, July.
    15. Luca Barbaglia & Sergio Consoli & Sebastiano Manzan, 2024. "Forecasting GDP in Europe with textual data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(2), pages 338-355, March.
    16. Ghosh Debashis, 2012. "Incorporating the Empirical Null Hypothesis into the Benjamini-Hochberg Procedure," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(4), pages 1-21, July.
    17. Derksen, Laura & Kerwin, Jason Theodore & Reynoso, Natalia Ordaz & Sterck, Olivier, 2021. "Appointments: A More Effective Commitment Device for Health Behaviors," SocArXiv y8gh7, Center for Open Science.
    18. Fehr, Dietmar & Sutter, Matthias, 2019. "Gossip and the efficiency of interactions," Games and Economic Behavior, Elsevier, vol. 113(C), pages 448-460.
    19. Liana Fraenkel & Marilyn Stolar & Jonathan R. Bates & Richard L. Street Jr & Harjinder Chowdhary & Sarah Swift & Ellen Peters, 2018. "Variability in Affect and Willingness to Take Medication," Medical Decision Making, , vol. 38(1), pages 34-43, January.
    20. Marcel Fafchamps & Julien Labonne, 2016. "Using Split Samples to Improve Inference about Causal Effects," NBER Working Papers 21842, National Bureau of Economic Research, Inc.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:78:y:2014:i:c:p:33-42. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.