IDEAS home Printed from https://ideas.repec.org/a/spr/advdac/v9y2015i3p287-314.html
   My bibliography  Save this article

Classifying real-world data with the $${ DD}\alpha $$ D D α -procedure

Author

Listed:
  • Pavlo Mozharovskyi
  • Karl Mosler
  • Tatjana Lange

Abstract

The $${ DD}\alpha $$ D D α -classifier, a nonparametric fast and very robust procedure, is described and applied to fifty classification problems regarding a broad spectrum of real-world data. The procedure first transforms the data from their original property space into a depth space, which is a low-dimensional unit cube, and then separates them by a projective invariant procedure, called $$\alpha $$ α -procedure. To each data point the transformation assigns its depth values with respect to the given classes. Several alternative depth notions (spatial depth, Mahalanobis depth, projection depth, and Tukey depth, the latter two being approximated by univariate projections) are used in the procedure, and compared regarding their average error rates. With the Tukey depth, which fits the distributions’ shape best and is most robust, ‘outsiders’, that is data points having zero depth in all classes, appear. They need an additional treatment for classification. Evidence is also given about the dimension of the extended feature space needed for linear separation. The $${ DD}\alpha $$ D D α -procedure is available as an R-package. Copyright Springer-Verlag Berlin Heidelberg 2015

Suggested Citation

  • Pavlo Mozharovskyi & Karl Mosler & Tatjana Lange, 2015. "Classifying real-world data with the $${ DD}\alpha $$ D D α -procedure," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 9(3), pages 287-314, September.
  • Handle: RePEc:spr:advdac:v:9:y:2015:i:3:p:287-314
    DOI: 10.1007/s11634-014-0180-8
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s11634-014-0180-8
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11634-014-0180-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Christmann, Andreas & Rousseeuw, Peter J., 2001. "Measuring overlap in binary regression," Computational Statistics & Data Analysis, Elsevier, vol. 37(1), pages 65-75, July.
    2. Davy Paindaveine & Germain Van Bever, 2012. "Nonparametrically Consistent Depth-Based Classifiers," Working Papers ECARES ECARES 2012-014, ULB -- Universite Libre de Bruxelles.
    3. Andreas Christmann & Paul Fischer & Thorsten Joachims, 2002. "Comparison between various regression depth methods and the support vector machine to approximate the minimum number of misclassifications," Computational Statistics, Springer, vol. 17(2), pages 273-287, July.
    4. Karl Mosler, 2003. "Central Regions and Dependency," Methodology and Computing in Applied Probability, Springer, vol. 5(1), pages 5-21, March.
    5. Hubert, Mia & Van Driessen, Katrien, 2004. "Fast and robust discriminant analysis," Computational Statistics & Data Analysis, Elsevier, vol. 45(2), pages 301-320, March.
    6. Cuesta-Albertos, J.A. & Nieto-Reyes, A., 2008. "The random Tukey depth," Computational Statistics & Data Analysis, Elsevier, vol. 52(11), pages 4979-4988, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kotík, Lukáš & Hlubinka, Daniel, 2017. "A weighted localization of halfspace depth and its properties," Journal of Multivariate Analysis, Elsevier, vol. 157(C), pages 53-69.
    2. Mengta Yang & Reza Modarres, 2017. "Multivariate tests of uniformity," Statistical Papers, Springer, vol. 58(3), pages 627-639, September.
    3. Raymaekers, Jakob & Rousseeuw, Peter, 2019. "A generalized spatial sign covariance matrix," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 94-111.
    4. Ondrej Vencalek & Olusola Samuel Makinde, 2021. "RR-classifier: a nonparametric classification procedure in multidimensional space based on relative ranks," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 105(4), pages 675-693, December.
    5. Panagiotis Tzirakis & Christos Tjortjis, 2017. "T3C: improving a decision tree classification algorithm’s interval splits on continuous attributes," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(2), pages 353-370, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tatjana Lange & Karl Mosler & Pavlo Mozharovskyi, 2014. "Fast nonparametric classification based on data depth," Statistical Papers, Springer, vol. 55(1), pages 49-69, February.
    2. Nedret Billor & Asheber Abebe & Asuman Turkmen & Sai Nudurupati, 2008. "Classification Based on Depth Transvariations," Journal of Classification, Springer;The Classification Society, vol. 25(2), pages 249-260, November.
    3. Victor Chernozhukov & Alfred Galichon & Marc Hallin & Marc Henry, 2014. "Monge-Kantorovich Depth, Quantiles, Ranks, and Signs," Papers 1412.8434, arXiv.org, revised Sep 2015.
    4. repec:spo:wpmain:info:hdl:2441/3qnaslliat80pbqa8t90240unj is not listed on IDEAS
    5. Davy Paindaveine & Germain Van Bever, 2017. "Halfspace Depths for Scatter, Concentration and Shape Matrices," Working Papers ECARES ECARES 2017-19, ULB -- Universite Libre de Bruxelles.
    6. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2017. "Multivariate and functional classification using depth and distance," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(3), pages 445-466, September.
    7. repec:hal:spmain:info:hdl:2441/3qnaslliat80pbqa8t90240unj is not listed on IDEAS
    8. Dyckerhoff, Rainer & Mozharovskyi, Pavlo & Nagy, Stanislav, 2021. "Approximate computation of projection depths," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    9. Mengta Yang & Reza Modarres, 2017. "Multivariate tests of uniformity," Statistical Papers, Springer, vol. 58(3), pages 627-639, September.
    10. Anirvan Chakraborty & Probal Chaudhuri, 2014. "On data depth in infinite dimensional spaces," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 66(2), pages 303-324, April.
    11. Gleb A. Koshevoy & Karl Mosler, 2007. "Multivariate Lorenz dominance based on zonoids," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 91(1), pages 57-76, March.
    12. Elías Fernández, Antonio & Jiménez Recaredo, Raúl José, 2017. "Prediction Bands for Functional Data Based on Depth Measures," DES - Working Papers. Statistics and Econometrics. WS 24606, Universidad Carlos III de Madrid. Departamento de Estadística.
    13. Cizek, Pavel, 2008. "Robust and Efficient Adaptive Estimation of Binary-Choice Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 687-696, June.
    14. Carlo Sguera & Pedro Galeano & Rosa Lillo, 2014. "Spatial depth-based classification for functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(4), pages 725-750, December.
    15. Nieto-Reyes, Alicia & Battey, Heather, 2021. "A topologically valid construction of depth for functional data," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    16. Arie Beresteanu & Francesca Molinari, 2008. "Asymptotic Properties for a Class of Partially Identified Models," Econometrica, Econometric Society, vol. 76(4), pages 763-814, July.
    17. Victor Chernozhukov & Alfred Galichon & Marc Hallin & Marc Henry, 2014. "Monge-Kantorovich Depth, Quantiles, Ranks, and Signs," Papers 1412.8434, arXiv.org, revised Sep 2015.
    18. Vencalek, Ondrej & Pokotylo, Oleksii, 2018. "Depth-weighted Bayes classification," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 1-12.
    19. Dyckerhoff, Rainer & Mozharovskyi, Pavlo, 2016. "Exact computation of the halfspace depth," Computational Statistics & Data Analysis, Elsevier, vol. 98(C), pages 19-30.
    20. Alper Sinan & B. Barıs Alkan, 2015. "A useful approach to identify the multicollinearity in the presence of outliers," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(5), pages 986-993, May.
    21. Miguel Flores & Salvador Naya & Rubén Fernández-Casal & Sonia Zaragoza & Paula Raña & Javier Tarrío-Saavedra, 2020. "Constructing a Control Chart Using Functional Data," Mathematics, MDPI, vol. 8(1), pages 1-26, January.
    22. Alba M. Franco-Pereira & Rosa E. Lillo, 2020. "Rank tests for functional data based on the epigraph, the hypograph and associated graphical representations," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(3), pages 651-676, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:advdac:v:9:y:2015:i:3:p:287-314. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.