IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1802.05495.html
   My bibliography  Save this paper

How Much Data Do You Need? An Operational, Pre-Asymptotic Metric for Fat-tailedness

Author

Listed:
  • Nassim Nicholas Taleb

Abstract

This note presents an operational measure of fat-tailedness for univariate probability distributions, in $[0,1]$ where 0 is maximally thin-tailed (Gaussian) and 1 is maximally fat-tailed. Among others,1) it helps assess the sample size needed to establish a comparative $n$ needed for statistical significance, 2) allows practical comparisons across classes of fat-tailed distributions, 3) helps understand some inconsistent attributes of the lognormal, pending on the parametrization of its scale parameter. The literature is rich for what concerns asymptotic behavior, but there is a large void for finite values of $n$, those needed for operational purposes. Conventional measures of fat-tailedness, namely 1) the tail index for the power law class, and 2) Kurtosis for finite moment distributions fail to apply to some distributions, and do not allow comparisons across classes and parametrization, that is between power laws outside the Levy-Stable basin, or power laws to distributions in other classes, or power laws for different number of summands. How can one compare a sum of 100 Student T distributed random variables with 3 degrees of freedom to one in a Levy-Stable or a Lognormal class? How can one compare a sum of 100 Student T with 3 degrees of freedom to a single Student T with 2 degrees of freedom? We propose an operational and heuristic measure that allow us to compare $n$-summed independent variables under all distributions with finite first moment. The method is based on the rate of convergence of the Law of Large numbers for finite sums, $n$-summands specifically. We get either explicit expressions or simulation results and bounds for the lognormal, exponential, Pareto, and the Student T distributions in their various calibrations --in addition to the general Pearson classes.

Suggested Citation

  • Nassim Nicholas Taleb, 2018. "How Much Data Do You Need? An Operational, Pre-Asymptotic Metric for Fat-tailedness," Papers 1802.05495, arXiv.org, revised Nov 2018.
  • Handle: RePEc:arx:papers:1802.05495
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1802.05495
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Bouchaud,Jean-Philippe & Potters,Marc, 2003. "Theory of Financial Risk and Derivative Pricing," Cambridge Books, Cambridge University Press, number 9780521819169, January.
    2. Pinelis, Iosif, 2015. "Characteristic function of the positive part of a random variable and related results, with applications," Statistics & Probability Letters, Elsevier, vol. 106(C), pages 281-286.
    3. Xavier Gabaix, 2009. "Power Laws in Economics and Finance," Annual Review of Economics, Annual Reviews, vol. 1(1), pages 255-294, May.
    4. Dagum, Camilo, 1980. "Inequality Measures between Income Distributions with Applications," Econometrica, Econometric Society, vol. 48(7), pages 1791-1803, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nassim Nicholas Taleb, 2019. "On the Statistical Differences between Binary Forecasts and Real World Payoffs," Papers 1907.11162, arXiv.org, revised Dec 2019.
    2. Taleb, Nassim Nicholas, 2020. "On the statistical differences between binary forecasts and real-world payoffs," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1228-1240.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Taleb, Nassim Nicholas, 2019. "How much data do you need? An operational, pre-asymptotic metric for fat-tailedness," International Journal of Forecasting, Elsevier, vol. 35(2), pages 677-686.
    2. Paulo Ferreira & Éder J.A.L. Pereira & Hernane B.B. Pereira, 2020. "From Big Data to Econophysics and Its Use to Explain Complex Phenomena," JRFM, MDPI, vol. 13(7), pages 1-10, July.
    3. Taleb, Nassim Nicholas & Douady, Raphael, 2015. "On the super-additivity and estimation biases of quantile contributions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 429(C), pages 252-260.
    4. Toda, Alexis Akira, 2012. "The double power law in income distribution: Explanations and evidence," Journal of Economic Behavior & Organization, Elsevier, vol. 84(1), pages 364-381.
    5. Giacomo Bormetti & Sofia Cazzaniga, 2014. "Multiplicative noise, fast convolution and pricing," Quantitative Finance, Taylor & Francis Journals, vol. 14(3), pages 481-494, March.
    6. Federica De Domenico & Giacomo Livan & Guido Montagna & Oreste Nicrosini, 2023. "Modeling and Simulation of Financial Returns under Non-Gaussian Distributions," Papers 2302.02769, arXiv.org.
    7. Abduraimova, Kumushoy, 2022. "Contagion and tail risk in complex financial networks," Journal of Banking & Finance, Elsevier, vol. 143(C).
    8. SAITO Yukiko, 2013. "Role of Hub Firms in Geographical Transaction Network," Discussion papers 13080, Research Institute of Economy, Trade and Industry (RIETI).
    9. Fabrizio Pomponio & Frédéric Abergel, 2013. "Multiple-limit trades : empirical facts and application to lead-lag measures," Post-Print hal-00745317, HAL.
    10. Foellmi, Reto & Martínez, Isabel Z., 2014. "Volatile Top Income Shares in Switzerland? Reassessing the Evolution Between 1981 and 2009," CEPR Discussion Papers 10006, C.E.P.R. Discussion Papers.
    11. Dominik Prochniewicz & Jacek Kudrys & Kamil Maciuk, 2022. "Noises in Double-Differenced GNSS Observations," Energies, MDPI, vol. 15(5), pages 1-18, February.
    12. Lubashevsky, Ihor & Friedrich, Rudolf & Heuer, Andreas & Ushakov, Andrey, 2009. "Generalized superstatistics of nonequilibrium Markovian systems," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(21), pages 4535-4550.
    13. Assaf Almog & Ferry Besamusca & Mel MacMahon & Diego Garlaschelli, 2015. "Mesoscopic Community Structure of Financial Markets Revealed by Price and Sign Fluctuations," PLOS ONE, Public Library of Science, vol. 10(7), pages 1-16, July.
    14. Sebastiano Michele Zema & Giorgio Fagiolo & Tiziano Squartini & Diego Garlaschelli, 2021. "Mesoscopic Structure of the Stock Market and Portfolio Optimization," Papers 2112.06544, arXiv.org.
    15. Ross Richardson & Matteo G. Richiardi & Michael Wolfson, 2015. "We ran one billion agents. Scaling in simulation models," LABORatorio R. Revelli Working Papers Series 142, LABORatorio R. Revelli, Centre for Employment Studies.
    16. Igor Fedotenkov, 2020. "A Review of More than One Hundred Pareto-Tail Index Estimators," Statistica, Department of Statistics, University of Bologna, vol. 80(3), pages 245-299.
    17. Harmenberg, Karl, 2024. "A simple theory of Pareto-distributed earnings," Economics Letters, Elsevier, vol. 234(C).
    18. Da Silva, Sergio, 2009. "Does Macroeconomics Need Microeconomic Foundations?," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy (IfW Kiel), vol. 3, pages 1-11.
    19. S. Reimann, 2007. "Price dynamics from a simple multiplicative random process model," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 56(4), pages 381-394, April.
    20. Chen, Zhimin & Ibragimov, Rustam, 2019. "One country, two systems? The heavy-tailedness of Chinese A- and H- share markets," Emerging Markets Review, Elsevier, vol. 38(C), pages 115-141.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1802.05495. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.