IDEAS home Printed from https://ideas.repec.org/p/chf/rpseri/rp2448.html
   My bibliography  Save this paper

Multiple Outlier Detection in Samples with Exponential & Pareto Tails

Author

Listed:
  • Didier Sornette

    (Risks-X, Southern University of Science and Technology (SUSTech); Swiss Finance Institute)

  • Ran Wei

    (ETH Zürich)

Abstract

We introduce two ratio-based robust test statistics, max-robust-sum (MRS) and sum-robust-sum (SRS), designed to enhance the robustness of outlier detection in samples with exponential or Pareto tails. We also reintroduce the inward sequential testing method-formerly relegated since the introduction of outward testing-and show that MRS and SRS tests reduce susceptibility of the inward approach to masking, making the inward test as powerful as, and potentially less error-prone than, outward tests. Moreover, inward testing does not require the complicated type I error control of outward tests. A comprehensive comparison of the test statistics is done, considering performance of the proposed tests in both block and sequential tests, and contrasting their performance with classical test statistics across various data scenarios. In five case studies-financial crashes, nuclear power generation accidents, stock market returns, epidemic fatalities, and city sizes-significant outliers are detected and related to the concept of 'Dragon King' events, defined as meaningful outliers that arise from a unique generating mechanism.

Suggested Citation

  • Didier Sornette & Ran Wei, 2024. "Multiple Outlier Detection in Samples with Exponential & Pareto Tails," Swiss Finance Institute Research Paper Series 24-48, Swiss Finance Institute.
  • Handle: RePEc:chf:rpseri:rp2448
    as

    Download full text from publisher

    File URL: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4957557
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    Outlier detection; Exponential sample; Pareto sample; Dragon King; Extreme Value Theory;
    All these keywords.

    JEL classification:

    • C10 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - General
    • C46 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Specific Distributions
    • C49 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Other
    • G01 - Financial Economics - - General - - - Financial Crises

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:chf:rpseri:rp2448. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Ridima Mittal (email available below). General contact details of provider: https://edirc.repec.org/data/fameech.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.