IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0129767.html
   My bibliography  Save this article

Computed ABC Analysis for Rational Selection of Most Informative Variables in Multivariate Data

Author

Listed:
  • Alfred Ultsch
  • Jörn Lötsch

Abstract

Objective: Multivariate data sets often differ in several factors or derived statistical parameters, which have to be selected for a valid interpretation. Basing this selection on traditional statistical limits leads occasionally to the perception of losing information from a data set. This paper proposes a novel method for calculating precise limits for the selection of parameter sets. Methods: The algorithm is based on an ABC analysis and calculates these limits on the basis of the mathematical properties of the distribution of the analyzed items. The limits im-plement the aim of any ABC analysis, i.e., comparing the increase in yield to the required additional effort. In particular, the limit for set A, the “important few”, is optimized in a way that both, the effort and the yield for the other sets (B and C), are minimized and the additional gain is optimized. Results: As a typical example from biomedical research, the feasibility of the ABC analysis as an objective replacement for classical subjective limits to select highly relevant variance components of pain thresholds is presented. The proposed method improved the biological inter-pretation of the results and increased the fraction of valid information that was obtained from the experimental data. Conclusions: The method is applicable to many further biomedical problems in-cluding the creation of diagnostic complex biomarkers or short screening tests from comprehensive test batteries. Thus, the ABC analysis can be proposed as a mathematically valid replacement for traditional limits to maximize the information obtained from multivariate research data.

Suggested Citation

  • Alfred Ultsch & Jörn Lötsch, 2015. "Computed ABC Analysis for Rational Selection of Most Informative Variables in Multivariate Data," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-15, June.
  • Handle: RePEc:plo:pone00:0129767
    DOI: 10.1371/journal.pone.0129767
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0129767
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0129767&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0129767?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gastwirth, Joseph L & Glauberman, Marcia, 1976. "The Interpolation of the Lorenz Curve and Gini Index from Grouped Data," Econometrica, Econometric Society, vol. 44(3), pages 479-483, May.
    2. Gastwirth, Joseph L, 1971. "A General Definition of the Lorenz Curve," Econometrica, Econometric Society, vol. 39(6), pages 1037-1039, November.
    3. Atkinson, Anthony B., 1970. "On the measurement of inequality," Journal of Economic Theory, Elsevier, vol. 2(3), pages 244-263, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jan Michael Spoor, 2023. "Improving customer segmentation via classification of key accounts as outliers," Journal of Marketing Analytics, Palgrave Macmillan, vol. 11(4), pages 747-760, December.
    2. Natálie Veselá & Volodymyr Rodchenko & David Hampel, 2022. "On the Investment Attractiveness of Ukrainian Companies," European Journal of Business Science and Technology, Mendel University in Brno, Faculty of Business and Economics, vol. 8(1), pages 54-71.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Beach, Charles M., 1979. "Model-Free Statistical Inference with Lorenz Curves, Income Shares, and Gini Coefficients," Queen's Institute for Economic Research Discussion Papers 275154, Queen's University - Department of Economics.
    2. Fabio Clementi & Mauro Gallegati & Giorgio Kaniadakis, 2010. "A model of personal income distribution with application to Italian data," Empirical Economics, Springer, vol. 39(2), pages 559-591, October.
    3. Masato Okamoto, 2014. "Interpolating the Lorenz Curve: Methods to Preserve Shape and Remain Consistent with the Concentration Curves for Components," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 60(2), pages 349-384, June.
    4. Belzunce, Félix & Pinar, José F. & Ruiz, José M. & Sordo, Miguel A., 2013. "Comparison of concentration for several families of income distributions," Statistics & Probability Letters, Elsevier, vol. 83(4), pages 1036-1045.
    5. Satya Paul & Sriram Shankar, 2020. "An alternative single parameter functional form for Lorenz curve," Empirical Economics, Springer, vol. 59(3), pages 1393-1402, September.
    6. Fontanari Andrea & Cirillo Pasquale & Oosterlee Cornelis W., 2020. "Lorenz-generated bivariate Archimedean copulas," Dependence Modeling, De Gruyter, vol. 8(1), pages 186-209, January.
    7. Pogorelskiy, Kirill & Seidl, Christian & Traub, Stefan, 2010. "Tax progression: International and intertemporal comparison using LIS data," Economics Working Papers 2010-08, Christian-Albrechts-University of Kiel, Department of Economics.
    8. Francesco Andreoli & Claudio Zoli, 2020. "From unidimensional to multidimensional inequality: a review," METRON, Springer;Sapienza Università di Roma, vol. 78(1), pages 5-42, April.
    9. Edwin Fourrier-Nicolai & Michel Lubrano, 2019. "The Effect of Aspirations on Inequality: Evidence from the German Reunification using Bayesian Growth Incidence Curves," Working Papers halshs-02122371, HAL.
    10. Gengsheng Qin & Baoying Yang & Nelly Belinga-Hall, 2013. "Empirical likelihood-based inferences for the Lorenz curve," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(1), pages 1-21, February.
    11. Jasso, Guillermina & Kotz, Samuel, 2007. "Two Types of Inequality: Inequality Between Persons and Inequality Between Subgroups," IZA Discussion Papers 2749, Institute of Labor Economics (IZA).
    12. Zhao, Puying & Haziza, David & Wu, Changbao, 2020. "Survey weighted estimating equation inference with nuisance functionals," Journal of Econometrics, Elsevier, vol. 216(2), pages 516-536.
    13. Claudio Zoli, 2002. "Inverse stochastic dominance, inequality measurement and Gini indices," Journal of Economics, Springer, vol. 77(1), pages 119-161, December.
    14. ANDREOLI Francesco & HAVNES Tarjei & LEFRANC Arnaud, 2014. "Equalization of opportunity: Definitions, implementable conditions and application to early-childhood policy evaluation," LISER Working Paper Series 2014-12, Luxembourg Institute of Socio-Economic Research (LISER).
    15. Sarabia Alegría, J.M & Pascual Sáez, Marta, 2001. "Rankings de distribuciones de renta basados en curvas de Lorenz ordenadas: un estudio empírico1," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 19, pages 151-169, Diciembre.
    16. Giovanni Giorgi, 2014. "A couple of good reasons to translate papers of the Italian statistical tradition," METRON, Springer;Sapienza Università di Roma, vol. 72(1), pages 1-3, April.
    17. Greselin, Francesca & Zitikis, Ricardas, 2015. "Measuring economic inequality and risk: a unifying approach based on personal gambles, societal preferences and references," MPRA Paper 65892, University Library of Munich, Germany.
    18. Christopher Bennett & Ričardas Zitikis, 2015. "Ignorance, lotteries, and measures of economic inequality," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 13(2), pages 309-316, June.
    19. Miguel Sordo & Jorge Navarro & José Sarabia, 2014. "Distorted Lorenz curves: models and comparisons," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 42(4), pages 761-780, April.
    20. Francesco Andreoli & Tarjei Havnes & Arnaud Lefranc, 2019. "Robust Inequality of Opportunity Comparisons: Theory and Application to Early Childhood Policy Evaluation," The Review of Economics and Statistics, MIT Press, vol. 101(2), pages 355-369, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0129767. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.