IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i12p2631-d1167067.html
   My bibliography  Save this article

Method for the Detection of Functional Outliers Applied to Quality Monitoring Samples in the Vicinity of El Musel Seaport in the Metropolitan Area of Gijón (Northern Spain)

Author

Listed:
  • Luis Alfonso Menéndez-García

    (Department of Mining Technology, Topography and Structures, Higher and Technical School of Mining Engineering, University of León, Campus de Vegazana s/n, 24071 León, Spain)

  • Paulino José García-Nieto

    (Department of Mathematics, Faculty of Sciences, University of Oviedo, 33007 Oviedo, Spain)

  • Esperanza García-Gonzalo

    (Department of Mathematics, Faculty of Sciences, University of Oviedo, 33007 Oviedo, Spain)

  • Fernando Sánchez Lasheras

    (Department of Mathematics, Faculty of Sciences, University of Oviedo, 33007 Oviedo, Spain)

  • Laura Álvarez-de-Prado

    (Department of Mining Technology, Topography and Structures, Higher and Technical School of Mining Engineering, University of León, Campus de Vegazana s/n, 24071 León, Spain)

  • Antonio Bernardo-Sánchez

    (Department of Mining Technology, Topography and Structures, Higher and Technical School of Mining Engineering, University of León, Campus de Vegazana s/n, 24071 León, Spain)

Abstract

Air pollution affects human health and is one of the main problems in the world, including in coastal cities with industrial seaports. In this sense, the city of Gijón (northern Spain) stands out as one of the 20 Spanish cities with the worst air quality. The study aims to identify outliers in air quality observations near the El Musel seaport, resulting from the emissions of six pollutants over an eight-year period (2014–2021). It compares methods based on the functional data analysis (FDA) approach and vector methods to determine the optimal approach for detecting outliers and supporting air quality control. Our approach involves analyzing air pollutant observations as a set of curves rather than vectors. Therefore, in the FDA approach, curves are constructed to provide the best fit to isolated data points, resulting in a collection of continuous functions. These functions capture the behavior of the data in a continuous domain. Two FDA approach methodologies were used here: the functional bagplot and the high-density region (HDR) boxplot. Finally, outlier detection using the FDA approach was found to be more powerful than the vector methods and the functional bagplot method detected more outliers than the HDR boxplot.

Suggested Citation

  • Luis Alfonso Menéndez-García & Paulino José García-Nieto & Esperanza García-Gonzalo & Fernando Sánchez Lasheras & Laura Álvarez-de-Prado & Antonio Bernardo-Sánchez, 2023. "Method for the Detection of Functional Outliers Applied to Quality Monitoring Samples in the Vicinity of El Musel Seaport in the Metropolitan Area of Gijón (Northern Spain)," Mathematics, MDPI, vol. 11(12), pages 1-23, June.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:12:p:2631-:d:1167067
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/12/2631/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/12/2631/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Filzmoser, Peter & Maronna, Ricardo & Werner, Mark, 2008. "Outlier identification in high dimensions," Computational Statistics & Data Analysis, Elsevier, vol. 52(3), pages 1694-1711, January.
    2. Oluwasegun Taiwo Ojo & Antonio Fernández Anta & Rosa E. Lillo & Carlo Sguera, 2022. "Detecting and classifying outliers in big functional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(3), pages 725-760, September.
    3. Mingzhe Zou & Sasa Z. Djokic, 2020. "A Review of Approaches for the Detection and Treatment of Outliers in Processing Wind Turbine and Wind Farm Measurements," Energies, MDPI, vol. 13(16), pages 1-30, August.
    4. Hyndman, Rob J. & Shahid Ullah, Md., 2007. "Robust forecasting of mortality and fertility rates: A functional data approach," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4942-4956, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rob J. Hyndman & Han Lin Shang, 2008. "Rainbow plots, Bagplots and Boxplots for Functional Data," Monash Econometrics and Business Statistics Working Papers 9/08, Monash University, Department of Econometrics and Business Statistics.
    2. Montes, Francisco & Sala, Ramón, 2012. "Equilibrio competitivo en Liga española de futbol de Primera División: Un test de Montecarlo basado en datos funcionales/Competitive Balance in the First Division Spanish Soccer League: A Montecarlo T," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 30, pages 513-526, Agosto.
    3. Booth, Heather, 2006. "Demographic forecasting: 1980 to 2005 in review," International Journal of Forecasting, Elsevier, vol. 22(3), pages 547-581.
    4. Rob Hyndman & Heather Booth & Farah Yasmeen, 2013. "Coherent Mortality Forecasting: The Product-Ratio Method With Functional Time Series Models," Demography, Springer;Population Association of America (PAA), vol. 50(1), pages 261-283, February.
    5. G. Zioutas & C. Chatzinakos & T. D. Nguyen & L. Pitsoulis, 2017. "Optimization techniques for multivariate least trimmed absolute deviation estimation," Journal of Combinatorial Optimization, Springer, vol. 34(3), pages 781-797, October.
    6. Niels Haldrup & Carsten P. T. Rosenskjold, 2019. "A Parametric Factor Model of the Term Structure of Mortality," Econometrics, MDPI, vol. 7(1), pages 1-22, March.
    7. Fang, Lei & Härdle, Wolfgang Karl, 2015. "Stochastic population analysis: A functional data approach," SFB 649 Discussion Papers 2015-007, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    8. Mestekemper, Thomas & Windmann, Michael & Kauermann, Göran, 2010. "Functional hourly forecasting of water temperature," International Journal of Forecasting, Elsevier, vol. 26(4), pages 684-699, October.
    9. Li, Han & Hyndman, Rob J., 2021. "Assessing mortality inequality in the U.S.: What can be said about the future?," Insurance: Mathematics and Economics, Elsevier, vol. 99(C), pages 152-162.
    10. Ahbab Mohammad Fazle Rabbi & Stefano Mazzuco, 2021. "Mortality Forecasting with the Lee–Carter Method: Adjusting for Smoothing and Lifespan Disparity," European Journal of Population, Springer;European Association for Population Studies, vol. 37(1), pages 97-120, March.
    11. Katrien Antonio & Anastasios Bardoutsos & Wilbert Ouburg, 2015. "Bayesian Poisson log-bilinear models for mortality projections with multiple populations," Working Papers Department of Accountancy, Finance and Insurance (AFI), Leuven 485564, KU Leuven, Faculty of Economics and Business (FEB), Department of Accountancy, Finance and Insurance (AFI), Leuven.
    12. D’Amato, Valeria & Di Lorenzo, Emilia & Haberman, Steven & Sagoo, Pretty & Sibillo, Marilena, 2018. "De-risking strategy: Longevity spread buy-in," Insurance: Mathematics and Economics, Elsevier, vol. 79(C), pages 124-136.
    13. Abdulelah Alkesaiberi & Fouzi Harrou & Ying Sun, 2022. "Efficient Wind Power Prediction Using Machine Learning Methods: A Comparative Study," Energies, MDPI, vol. 15(7), pages 1-24, March.
    14. D’Amato, Valeria & Haberman, Steven & Piscopo, Gabriella & Russolillo, Maria, 2012. "Modelling dependent data for longevity projections," Insurance: Mathematics and Economics, Elsevier, vol. 51(3), pages 694-701.
    15. Thomas Triebs & Subal C. Kumbhakar, 2012. "Management Practice in Production," ifo Working Paper Series 129, ifo Institute - Leibniz Institute for Economic Research at the University of Munich.
    16. Boente, Graciela & Parada, Daniela, 2023. "Robust estimation for functional quadratic regression models," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    17. Hong Li & Johnny Siu-Hang Li, 2017. "Optimizing the Lee-Carter Approach in the Presence of Structural Changes in Time and Age Patterns of Mortality Improvements," Demography, Springer;Population Association of America (PAA), vol. 54(3), pages 1073-1095, June.
    18. Kosiorowski Daniel & Mielczarek Dominik & Rydlewski Jerzy P. & Snarska Małgorzata, 2018. "Generalized Exponential Smoothing In Prediction Of Hierarchical Time Series," Statistics in Transition New Series, Statistics Poland, vol. 19(2), pages 331-350, June.
    19. Heather Booth & Rob Hyndman & Leonie Tickle & Piet de Jong, 2006. "Lee-Carter mortality forecasting: a multi-country comparison of variants and extensions," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 15(9), pages 289-310.
    20. David E. Tyler & Frank Critchley & Lutz Dümbgen & Hannu Oja, 2009. "Invariant co‐ordinate selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(3), pages 549-592, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:12:p:2631-:d:1167067. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.