IDEAS home Printed from https://ideas.repec.org/p/ies/wpaper/e202302.html
   My bibliography  Save this paper

Convex and Nonconvex Nonparametric Frontier-based Classification Methods for Anomaly Detection

Author

Listed:
  • Qianying JIN

    (College of Economics and Management, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China)

  • Kristiaan KERSTENS

    (Univ. Lille, CNRS, IESEG School of Management, UMR 9221 - LEM - Lille E´conomie Management, Lille, France)

  • Ignace VAN DE WOESTYNE

    (KU Leuven, Research Centre for Operations Research and Statistics (ORSTAT), Brussels Campus, War- moesberg 26, B-1000 Brussels, Belgium)

Abstract

Effective methods for determining the boundary of the normal class are very useful for detecting anomalies in commercial or security applications - a problem known as anomaly detection. This contribution proposes a nonparametric frontier-based clas- sification (NPFC) method for anomaly detection. By relaxing the commonly used convexity assumption in the literature, a nonconvex NPFC method is constructed and the nonconvex nonparametric frontier turns out to provide a more conservative bound- ary enveloping the normal class. By reflecting on the monotonic relation between the characteristic variables and the membership, the proposed NPFC method is in a more general form since both input-type and output-type characteristic variables are incor- porated. A biomedical data set is used to test the performance of the proposed NPFC methods. The results show that the proposed NPFC methods have competitive clas- sification performance and have consistent advantages in detecting abnormal samples, especially the nonconvex NPFC method

Suggested Citation

  • Qianying JIN & Kristiaan KERSTENS & Ignace VAN DE WOESTYNE, 2023. "Convex and Nonconvex Nonparametric Frontier-based Classification Methods for Anomaly Detection," Working Papers 2023-EQM-01, IESEG School of Management.
  • Handle: RePEc:ies:wpaper:e202302
    as

    Download full text from publisher

    File URL: https://www.ieseg.fr/wp-content/uploads/2023/02/2023-EQM-01.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Kerstens, Kristiaan & Sadeghi, Jafar & Toloo, Mehdi & Van de Woestyne, Ignace, 2022. "Procedures for ranking technical and cost efficient units: With a focus on nonconvexity," European Journal of Operational Research, Elsevier, vol. 300(1), pages 269-281.
    2. Per Andersen & Niels Christian Petersen, 1993. "A Procedure for Ranking Efficient Units in Data Envelopment Analysis," Management Science, INFORMS, vol. 39(10), pages 1261-1264, October.
    3. Walter Briec & Kristiaan Kerstens & Ignace Van de Woestyne, 2018. "Hypercongestion in production correspondences: an empirical exploration," Applied Economics, Taylor & Francis Journals, vol. 50(27), pages 2938-2956, June.
    4. Lovell, C. A. Knox & Pastor, Jesus T., 1999. "Radial DEA models without inputs or without outputs," European Journal of Operational Research, Elsevier, vol. 118(1), pages 46-51, October.
    5. K Kerstens & I Van de Woestyne, 2011. "Negative data in DEA: a simple proportional distance function approach," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 62(7), pages 1413-1419, July.
    6. Bruce G. Marcot & Anca M. Hanea, 2021. "What is an optimal value of k in k-fold cross-validation in discrete Bayesian network analysis?," Computational Statistics, Springer, vol. 36(3), pages 2009-2031, September.
    7. Valero-Carreras, Daniel & Aparicio, Juan & Guerrero, Nadia M., 2021. "Support vector frontiers: A new approach for estimating production functions through support vector machines," Omega, Elsevier, vol. 104(C).
    8. W. Briec, 1997. "A Graph-Type Extension of Farrell Technical Efficiency Measure," Journal of Productivity Analysis, Springer, vol. 8(1), pages 95-110, March.
    9. Sueyoshi, Toshiyuki, 2006. "DEA-Discriminant Analysis: Methodological comparison among eight discriminant analysis approaches," European Journal of Operational Research, Elsevier, vol. 169(1), pages 247-272, February.
    10. R. G. Chambers & Y. Chung & R. Färe, 1998. "Profit, Directional Distance Functions, and Nerlovian Efficiency," Journal of Optimization Theory and Applications, Springer, vol. 98(2), pages 351-364, August.
    11. Juan Aparicio & Miriam Esteve & Jesus J. Rodriguez-Sala & Jose L. Zofio, 2021. "The Estimation of Productive Efficiency Through Machine Learning Techniques: Efficiency Analysis Trees," International Series in Operations Research & Management Science, in: Joe Zhu & Vincent Charles (ed.), Data-Enabled Analytics, pages 51-92, Springer.
    12. Chiwoo Park & Jianhua Z. Huang & Yu Ding, 2010. "A Computable Plug-In Estimator of Minimum Volume Sets for Novelty Detection," Operations Research, INFORMS, vol. 58(5), pages 1469-1480, October.
    13. Esteve, Miriam & Aparicio, Juan & Rodriguez-Sala, Jesus J. & Zhu, Joe, 2023. "Random Forests and the measurement of super-efficiency in the context of Free Disposal Hull," European Journal of Operational Research, Elsevier, vol. 304(2), pages 729-744.
    14. Pendharkar, Parag C., 2002. "A potential use of data envelopment analysis for the inverse classification problem," Omega, Elsevier, vol. 30(3), pages 243-248, June.
    15. Laurens Cherchye & Timo Kuosmanen & Thierry Post, 2001. "FDH Directional Distance Functions with an Application to European Commercial Banks," Journal of Productivity Analysis, Springer, vol. 15(3), pages 201-215, January.
    16. R. D. Banker & A. Charnes & W. W. Cooper, 1984. "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis," Management Science, INFORMS, vol. 30(9), pages 1078-1092, September.
    17. C F Leon & F Palacios, 2009. "Evaluation of rejected cases in an acceptance system with data envelopment analysis and goal programming," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 60(10), pages 1411-1420, October.
    18. Kaffash, Sepideh & Azizi, Roza & Huang, Ying & Zhu, Joe, 2020. "A survey of data envelopment analysis applications in the insurance industry 1993–2018," European Journal of Operational Research, Elsevier, vol. 284(3), pages 801-813.
    19. Kim, Ji-Hyun, 2009. "Estimating classification error rate: Repeated cross-validation, repeated hold-out and bootstrap," Computational Statistics & Data Analysis, Elsevier, vol. 53(11), pages 3735-3745, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Esteve, Miriam & Aparicio, Juan & Rodriguez-Sala, Jesus J. & Zhu, Joe, 2023. "Random Forests and the measurement of super-efficiency in the context of Free Disposal Hull," European Journal of Operational Research, Elsevier, vol. 304(2), pages 729-744.
    2. Ravelojaona, Paola, 2019. "On constant elasticity of substitution – Constant elasticity of transformation Directional Distance Functions," European Journal of Operational Research, Elsevier, vol. 272(2), pages 780-791.
    3. Guerrero, Nadia M. & Moragues, Raul & Aparicio, Juan & Valero-Carreras, Daniel, 2024. "Support Vector Frontiers with kernel splines," Omega, Elsevier, vol. 128(C).
    4. Cova-Alonso, David José & Díaz-Hernández, Juan José & Martínez-Budría, Eduardo, 2021. "A strong efficiency measure for CCR/BCC models," European Journal of Operational Research, Elsevier, vol. 291(1), pages 284-295.
    5. Mahmood Mehdiloo & Jafar Sadeghi & Kristiaan Kerstens, 2024. "Top Down Axiomatic Modeling of Metatechnologies and Evaluating Directional Economic Efficiency," Working Papers 2024-EQM-03, IESEG School of Management.
    6. Sahoo, Biresh K. & Singh, Ramadhar & Mishra, Bineet & Sankaran, Krithiga, 2017. "Research productivity in management schools of India during 1968-2015: A directional benefit-of-doubt model analysis," Omega, Elsevier, vol. 66(PA), pages 118-139.
    7. Guillen, Maria D. & Aparicio, Juan & Kapelko, Magdalena & Esteve, Miriam, 2025. "Measuring environmental inefficiency through machine learning: An approach based on efficiency analysis trees and by-production technology," European Journal of Operational Research, Elsevier, vol. 321(2), pages 529-542.
    8. Aparicio, Juan & Pastor, Jesus T. & Vidal, Fernando, 2016. "The directional distance function and the translation invariance property," Omega, Elsevier, vol. 58(C), pages 1-3.
    9. Halická, Margaréta & Trnovská, Mária & Černý, Aleš, 2024. "A unified approach to radial, hyperbolic, and directional efficiency measurement in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 312(1), pages 298-314.
    10. Pastor, Jesus T. & Lovell, C.A. Knox & Aparicio, Juan, 2020. "Defining a new graph inefficiency measure for the proportional directional distance function and introducing a new Malmquist productivity index," European Journal of Operational Research, Elsevier, vol. 281(1), pages 222-230.
    11. Maria Silva Portela & Pedro Borges & Emmanuel Thanassoulis, 2003. "Finding Closest Targets in Non-Oriented DEA Models: The Case of Convex and Non-Convex Technologies," Journal of Productivity Analysis, Springer, vol. 19(2), pages 251-269, April.
    12. Kao, Chiang, 2020. "Measuring efficiency in a general production possibility set allowing for negative data," European Journal of Operational Research, Elsevier, vol. 282(3), pages 980-988.
    13. Mercedes Beltrán-Esteve & José Gómez-Limón & Andrés Picazo-Tadeo & Ernest Reig-Martínez, 2014. "A metafrontier directional distance function approach to assessing eco-efficiency," Journal of Productivity Analysis, Springer, vol. 41(1), pages 69-83, February.
    14. Timo Kuosmanen, 2007. "Performance measurement and best-practice benchmarking of mutual funds: combining stochastic dominance criteria with data envelopment analysis," Journal of Productivity Analysis, Springer, vol. 28(1), pages 71-86, October.
    15. Papaioannou, Grammatoula & Podinovski, Victor V., 2023. "Production technologies with ratio inputs and outputs," European Journal of Operational Research, Elsevier, vol. 310(3), pages 1164-1178.
    16. Halická, Margaréta & Trnovská, Mária & Černý, Aleš, 2025. "On indication, strict monotonicity, and efficiency of projections in a general class of path-based data envelopment analysis models," European Journal of Operational Research, Elsevier, vol. 320(1), pages 175-187.
    17. Aparicio, Juan & Mahlberg, Bernhard & Pastor, Jesus T. & Sahoo, Biresh K., 2014. "Decomposing technical inefficiency using the principle of least action," European Journal of Operational Research, Elsevier, vol. 239(3), pages 776-785.
    18. Arnaud Abad, 2020. "Environmental Efficiency and Productivity Analysis," Working Papers hal-03032038, HAL.
    19. Juan Aparicio & Magdalena Kapelko, 2019. "Enhancing the Measurement of Composite Indicators of Corporate Social Performance," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 144(2), pages 807-826, July.
    20. Mustapha D. Ibrahim & Sahand Daneshvar & Mevhibe B. Hocaoğlu & Olasehinde-Williams G. Oluseye, 2019. "An Estimation of the Efficiency and Productivity of Healthcare Systems in Sub-Saharan Africa: Health-Centred Millennium Development Goal-Based Evidence," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 143(1), pages 371-389, May.

    More about this item

    Keywords

    : Nonparametric Frontier; Convex; Nonconvex; Anomaly Detection;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ies:wpaper:e202302. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Lies BOUTEN (email available below). General contact details of provider: https://edirc.repec.org/data/iesegfr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.