IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v61y1999i3p661-680.html
   My bibliography  Save this article

Biased Bootstrap Methods for Reducing the Effects of Contamination

Author

Listed:
  • Peter Hall
  • Brett Presnell

Abstract

Contamination of a sampled distribution, for example by a heavy‐tailed distribution, can degrade the performance of a statistical estimator. We suggest a general approach to alleviating this problem, using a version of the weighted bootstrap. The idea is to ‘tilt’ away from the contaminated distribution by a given (but arbitrary) amount, in a direction that minimizes a measure of the new distribution's dispersion. This theoretical proposal has a simple empirical version, which results in each data value being assigned a weight according to an assessment of its influence on dispersion. Importantly, distance can be measured directly in terms of the likely level of contamination, without reference to an empirical measure of scale. This makes the procedure particularly attractive for use in multivariate problems. It has several forms, depending on the definitions taken for dispersion and for distance between distributions. Examples of dispersion measures include variance and generalizations based on high order moments. Practicable measures of the distance between distributions may be based on power divergence, which includes Hellinger and Kullback–Leibler distances. The resulting location estimator has a smooth, redescending influence curve and appears to avoid computational difficulties that are typically associated with redescending estimators. Its breakdown point can be located at any desired value ε∈ (0, ½) simply by ‘trimming’ to a known distance (depending only on ε and the choice of distance measure) from the empirical distribution. The estimator has an affine equivariant multivariate form. Further, the general method is applicable to a range of statistical problems, including regression.

Suggested Citation

  • Peter Hall & Brett Presnell, 1999. "Biased Bootstrap Methods for Reducing the Effects of Contamination," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(3), pages 661-680.
  • Handle: RePEc:bla:jorssb:v:61:y:1999:i:3:p:661-680
    DOI: 10.1111/1467-9868.00199
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/1467-9868.00199
    Download Restriction: no

    File URL: https://libkey.io/10.1111/1467-9868.00199?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Čížek, Pavel, 2012. "Semiparametric robust estimation of truncated and censored regression models," Journal of Econometrics, Elsevier, vol. 168(2), pages 347-366.
    2. Cizek, Pavel, 2008. "Robust and Efficient Adaptive Estimation of Binary-Choice Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 687-696, June.
    3. Hong-Jiang Wu & Ying-Ying Zhang & Han-Yu Li, 2023. "Expectation identities from integration by parts for univariate continuous random variables with applications to high-order moments," Statistical Papers, Springer, vol. 64(2), pages 477-496, April.
    4. Madeleine Cule & Richard Samworth & Michael Stewart, 2010. "Maximum likelihood estimation of a multi‐dimensional log‐concave density," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(5), pages 545-607, November.
    5. Marc G. Genton & Peter Hall, 2016. "A tilting approach to ranking influence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 77-97, January.
    6. G. Alastair Young, 2003. "Better bootstrapping by constrained prepivoting," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(2), pages 227-242.
    7. Ferrari, Davide & Zheng, Chao, 2016. "Reliable inference for complex models by discriminative composite likelihood estimation," Journal of Multivariate Analysis, Elsevier, vol. 144(C), pages 68-80.
    8. Peter Hall & D. M. Titterington & Jing‐Hao Xue, 2009. "Tilting methods for assessing the influence of components in a classifier," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(4), pages 783-803, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:61:y:1999:i:3:p:661-680. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.