IDEAS home Printed from https://ideas.repec.org/a/hin/jnlmpe/1899225.html
   My bibliography  Save this article

A Novel Approach for Outlier Detection in Multivariate Data

Author

Listed:
  • Saima Afzal
  • Ayesha Afzal
  • Muhammad Amin
  • Sehar Saleem
  • Nouman Ali
  • Muhammad Sajid

Abstract

Outlier detection is a challenging task especially when outliers are defined by rare combinations of multiple variables. In this paper, we develop and evaluate a new method for the detection of outliers in multivariate data that relies on Principal Components Analysis (PCA) and three-sigma limits. The proposed approach employs PCA to effectively perform dimension reduction by regenerating variables, i.e., fitted points from the original observations. The observations lying outside the three-sigma limits are identified as the outliers. This proposed method has been successfully employed to two real life and several artificially generated datasets. The performance of the proposed method is compared with some of the existing methods using different performance evaluation criteria including the percentage of correct classification, precision, recall, and F -measure. The supremacy of the proposed method is confirmed by abovementioned criteria and datasets. The F -measure for the first real life dataset is the highest, i.e., 0.6667 for the proposed method and 0.3333 and 0.4000 for the two existing approaches. Similarly, for the second real dataset, this measure is 0.8000 for the proposed approach and 0.5263 and 0.6315 for the two existing approaches. It is also observed by the simulation experiments that the performance of the proposed approach got better with increasing sample size.

Suggested Citation

  • Saima Afzal & Ayesha Afzal & Muhammad Amin & Sehar Saleem & Nouman Ali & Muhammad Sajid, 2021. "A Novel Approach for Outlier Detection in Multivariate Data," Mathematical Problems in Engineering, Hindawi, vol. 2021, pages 1-12, October.
  • Handle: RePEc:hin:jnlmpe:1899225
    DOI: 10.1155/2021/1899225
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/MPE/2021/1899225.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/MPE/2021/1899225.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2021/1899225?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:jnlmpe:1899225. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.