IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v76y2020i4p1374-1382.html
   My bibliography  Save this article

Simple outlier detection for a multi‐environmental field trial

Author

Listed:
  • Emi Tanaka

Abstract

The aim of plant breeding trials is often to identify crop variety that are well adapt to target environments. These varieties are identified through genomic prediction from the analysis of multi‐environmental field trial (MET) using linear mixed models. The occurrence of outliers in MET is common and known to adversely impact the accuracy of genomic prediction yet the detection of outliers are often neglected. A number of reasons stand for this—first, complex data such as a MET give rise to distinct levels of residuals (eg, at a trial level or individual observation level). This complexity offers additional challenges for an outlier detection method. Second, many linear mixed model software packages that cater for complex variance structures needed in the analysis of MET are not well streamlined for diagnostics by practitioners. We demonstrate outlier detection methods that are simple to implement in any linear mixed model software packages and computationally fast. Although these methods are not optimal methods in outlier detection, they offer practical value for ease of application in the analysis pipeline of regularly collected data. These are demonstrated using simulation based on two real bread wheat yield METs. In particular, models that consider analysis of yield trials either independently or jointly (thus borrowing strength across trials) are considered. Case studies are presented to highlight benefit of joint analysis for outlier detection.

Suggested Citation

  • Emi Tanaka, 2020. "Simple outlier detection for a multi‐environmental field trial," Biometrics, The International Biometric Society, vol. 76(4), pages 1374-1382, December.
  • Handle: RePEc:bla:biomet:v:76:y:2020:i:4:p:1374-1382
    DOI: 10.1111/biom.13216
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13216
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13216?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gilmour, Arthur & Cullis, Brian & Welham, Sue & Gogel, Beverley & Thompson, Robin, 2004. "An efficient computing strategy for prediction in mixed linear models," Computational Statistics & Data Analysis, Elsevier, vol. 44(4), pages 571-586, January.
    2. Gumedze, Freedom N. & Welham, Sue J. & Gogel, Beverley J. & Thompson, Robin, 2010. "A variance shift model for detection of outliers in the linear mixed model," Computational Statistics & Data Analysis, Elsevier, vol. 54(9), pages 2128-2144, September.
    3. Schützenmeister, André & Piepho, Hans-Peter, 2012. "Residual analysis of linear mixed models using a simulation approach," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1405-1416.
    4. Alison Smith & Brian Cullis & Robin Thompson, 2001. "Analyzing Variety by Environment Data Using Multiplicative Mixed Models and Adjustments for Spatial Field Trend," Biometrics, The International Biometric Society, vol. 57(4), pages 1138-1147, December.
    5. G. N. Wilkinson & C. E. Rogers, 1973. "Symbolic Description of Factorial Models for Analysis of Variance," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 22(3), pages 392-399, November.
    6. Koller, Manuel, 2016. "robustlmm: An R Package for Robust Estimation of Linear Mixed-Effects Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 75(i06).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Brian R. Cullis & Alison B. Smith & Nicole A. Cocks & David G. Butler, 2020. "The Design of Early-Stage Plant Breeding Trials Using Genetic Relatedness," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(4), pages 553-578, December.
    2. Payne, Roger W., 1998. "Design keys, pseudo-factors and general balance," Computational Statistics & Data Analysis, Elsevier, vol. 29(2), pages 217-229, December.
    3. Pinho, Luis Gustavo B. & Nobre, Juvêncio S. & Singer, Julio M., 2015. "Cook’s distance for generalized linear mixed models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 126-136.
    4. Francis K. C. Hui & Samuel Müller & Alan H. Welsh, 2021. "Random Effects Misspecification Can Have Severe Consequences for Random Effects Inference in Linear Mixed Models," International Statistical Review, International Statistical Institute, vol. 89(1), pages 186-206, April.
    5. Bent Nielsen, 2014. "Deviance analysis of age-period-cohort models," Economics Papers 2014-W03, Economics Group, Nuffield College, University of Oxford.
    6. T. Caliński & S. Czajka & Z. Kaczmarek & P. Krajewski & W. Pilarczyk, 2005. "Analyzing Multi-environment Variety Trials Using Randomization-Derived Mixed Models," Biometrics, The International Biometric Society, vol. 61(2), pages 448-455, June.
    7. Rüdiger Lehmann & Michael Lösler & Frank Neitzel, 2020. "Mean Shift versus Variance Inflation Approach for Outlier Detection—A Comparative Study," Mathematics, MDPI, vol. 8(6), pages 1-21, June.
    8. Julio M. Singer & Francisco M.M. Rocha & Juvêncio S. Nobre, 2017. "Graphical Tools for Detecting Departures from Linear Mixed Model Assumptions and Some Remedial Measures," International Statistical Review, International Statistical Institute, vol. 85(2), pages 290-324, August.
    9. Quirin Gehmacher & Juliane Schubert & Fabian Schmidt & Thomas Hartmann & Patrick Reisinger & Sebastian Rösch & Konrad Schwarz & Tzvetan Popov & Maria Chait & Nathan Weisz, 2024. "Eye movements track prioritized auditory features in selective attention to natural speech," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    10. Riehl, Kevin & Kiesel, Florian & Schiereck, Dirk, 2022. "Political and Socioeconomic Factors That Determine the Financial Outcome of Successful Green Innovation," Publications of Darmstadt Technical University, Institute for Business Studies (BWL) 132099, Darmstadt Technical University, Department of Business Administration, Economics and Law, Institute for Business Studies (BWL).
    11. Xiaowen Dai & Libin Jin & Anqi Shi & Lei Shi, 2016. "Outlier detection and accommodation in general spatial models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 25(3), pages 453-475, August.
    12. Jukka Sundvall & Benjamin James Dyson, 2022. "Breaking the bonds of reinforcement: Effects of trial outcome, rule consistency and rule complexity against exploitable and unexploitable opponents," PLOS ONE, Public Library of Science, vol. 17(2), pages 1-19, February.
    13. Lee, Dae-Jin & Durbán, María, 2012. "Seasonal modulation mixed models for time series forecasting," DES - Working Papers. Statistics and Econometrics. WS ws122519, Universidad Carlos III de Madrid. Departamento de Estadística.
    14. repec:jss:jstsof:23:i07 is not listed on IDEAS
    15. Joel Jorge Nuvunga & Carlos Pereira da Silva & Luciano Antonio de Oliveira & Renato Ribeiro de Lima & Marcio Balestre, 2019. "Bayesian factor analytic model: An approach in multiple environment trials," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-26, August.
    16. Wei Pan & Xianbin Wang & Wenwei Zhou & Bowen Hang & Liwen Guo, 2023. "Linguistic Analysis for Identifying Depression and Subsequent Suicidal Ideation on Weibo: Machine Learning Approaches," IJERPH, MDPI, vol. 20(3), pages 1-12, February.
    17. Alexander Robitzsch, 2020. "L p Loss Functions in Invariance Alignment and Haberman Linking with Few or Many Groups," Stats, MDPI, vol. 3(3), pages 1-38, August.
    18. Sudipto Banerjee & Gregg A. Johnson, 2006. "Coregionalized Single- and Multiresolution Spatially Varying Growth Curve Modeling with Application to Weed Growth," Biometrics, The International Biometric Society, vol. 62(3), pages 864-876, September.
    19. Ferran Orga & Andrew Mitchell & Marc Freixes & Francesco Aletta & Rosa Ma Alsina-Pagès & Maria Foraster, 2021. "Multilevel Annoyance Modelling of Short Environmental Sound Recordings," Sustainability, MDPI, vol. 13(11), pages 1-13, May.
    20. Lee, Dae-Jin, 2017. "A general framework for prediction in penalized regression," DES - Working Papers. Statistics and Econometrics. WS 24607, Universidad Carlos III de Madrid. Departamento de Estadística.
    21. Boby Mathew & Jens Léon & Mikko J Sillanpää, 2018. "Impact of residual covariance structures on genomic prediction ability in multi-environment trials," PLOS ONE, Public Library of Science, vol. 13(7), pages 1-11, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:76:y:2020:i:4:p:1374-1382. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.