IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v9y2016i9p752-d78211.html
   My bibliography  Save this article

A New Methodology Based on Imbalanced Classification for Predicting Outliers in Electricity Demand Time Series

Author

Listed:
  • Francisco Javier Duque-Pintor

    (Division of Computer Science, Universidad Pablo de Olavide, ES-41013 Seville, Spain)

  • Manuel Jesús Fernández-Gómez

    (Division of Computer Science, Universidad Pablo de Olavide, ES-41013 Seville, Spain)

  • Alicia Troncoso

    (Division of Computer Science, Universidad Pablo de Olavide, ES-41013 Seville, Spain)

  • Francisco Martínez-Álvarez

    (Division of Computer Science, Universidad Pablo de Olavide, ES-41013 Seville, Spain)

Abstract

The occurrence of outliers in real-world phenomena is quite usual. If these anomalous data are not properly treated, unreliable models can be generated. Many approaches in the literature are focused on a posteriori detection of outliers. However, a new methodology to a priori predict the occurrence of such data is proposed here. Thus, the main goal of this work is to predict the occurrence of outliers in time series, by using, for the first time, imbalanced classification techniques. In this sense, the problem of forecasting outlying data has been transformed into a binary classification problem, in which the positive class represents the occurrence of outliers. Given that the number of outliers is much lower than the number of common values, the resultant classification problem is imbalanced. To create training and test sets, robust statistical methods have been used to detect outliers in both sets. Once the outliers have been detected, the instances of the dataset are labeled accordingly. Namely, if any of the samples composing the next instance are detected as an outlier, the label is set to one. As a study case, the methodology has been tested on electricity demand time series in the Spanish electricity market, in which most of the outliers were properly forecast.

Suggested Citation

  • Francisco Javier Duque-Pintor & Manuel Jesús Fernández-Gómez & Alicia Troncoso & Francisco Martínez-Álvarez, 2016. "A New Methodology Based on Imbalanced Classification for Predicting Outliers in Electricity Demand Time Series," Energies, MDPI, vol. 9(9), pages 1-10, September.
  • Handle: RePEc:gam:jeners:v:9:y:2016:i:9:p:752-:d:78211
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/9/9/752/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/9/9/752/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Croux, Christophe & Gelper, Sarah & Mahieu, Koen, 2010. "Robust exponential smoothing of multivariate time series," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 2999-3006, December.
    2. Nowotarski, Jakub & Tomczyk, Jakub & Weron, Rafał, 2013. "Robust estimation and forecasting of the long-term seasonal component of electricity spot prices," Energy Economics, Elsevier, vol. 39(C), pages 13-27.
    3. M. Angeles Carnero & Daniel Peña & Esther Ruiz, 2007. "Effects of outliers on the identification and estimation of GARCH models," Journal of Time Series Analysis, Wiley Blackwell, vol. 28(4), pages 471-497, July.
    4. Sarah Gelper & Roland Fried & Christophe Croux, 2010. "Robust forecasting with exponential and Holt-Winters smoothing," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 29(3), pages 285-300.
    5. Galeano, Pedro & Pena, Daniel & Tsay, Ruey S., 2006. "Outlier Detection in Multivariate Time Series by Projection Pursuit," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 654-669, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. J. R. S. Iruela & L. G. B. Ruiz & M. I. Capel & M. C. Pegalajar, 2021. "A TensorFlow Approach to Data Analysis for Time Series Forecasting in the Energy-Efficiency Realm," Energies, MDPI, vol. 14(13), pages 1-22, July.
    2. Francisco Martínez-Álvarez & Alicia Troncoso & José C. Riquelme, 2017. "Recent Advances in Energy Time Series Forecasting," Energies, MDPI, vol. 10(6), pages 1-3, June.
    3. Paul Anton Verwiebe & Stephan Seim & Simon Burges & Lennart Schulz & Joachim Müller-Kirchenbauer, 2021. "Modeling Energy Demand—A Systematic Literature Review," Energies, MDPI, vol. 14(23), pages 1-58, November.
    4. L. Cabezón & L. G. B. Ruiz & D. Criado-Ramón & E. J. Gago & M. C. Pegalajar, 2022. "Photovoltaic Energy Production Forecasting through Machine Learning Methods: A Scottish Solar Farm Case Study," Energies, MDPI, vol. 15(22), pages 1-14, November.
    5. Xiaoyu Zhang & Rui Wang & Tao Zhang & Yajie Liu & Yabing Zha, 2018. "Short-Term Load Forecasting Using a Novel Deep Learning Framework," Energies, MDPI, vol. 11(6), pages 1-15, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Grané, Aurea & Veiga, Helena, 2010. "Outliers in Garch models and the estimation of risk measures," DES - Working Papers. Statistics and Econometrics. WS ws100502, Universidad Carlos III de Madrid. Departamento de Estadística.
    2. Veiga, Helena, 2009. "Wavelet-based detection of outliers in volatility models," DES - Working Papers. Statistics and Econometrics. WS ws090403, Universidad Carlos III de Madrid. Departamento de Estadística.
    3. Grané, Aurea & Martín-Barragán, Belén & Veiga, Helena, 2014. "Outliers in multivariate Garch models," DES - Working Papers. Statistics and Econometrics. WS ws140503, Universidad Carlos III de Madrid. Departamento de Estadística.
    4. Muler, Nora & Yohai, V´ictor J., 2013. "Robust estimation for vector autoregressive models," Computational Statistics & Data Analysis, Elsevier, vol. 65(C), pages 68-79.
    5. Gambacciani, Marco & Paolella, Marc S., 2017. "Robust normal mixtures for financial portfolio allocation," Econometrics and Statistics, Elsevier, vol. 3(C), pages 91-111.
    6. Grané, Aurea & Veiga, Helena, 2010. "Wavelet-based detection of outliers in financial time series," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2580-2593, November.
    7. Barrow, Devon & Kourentzes, Nikolaos, 2018. "The impact of special days in call arrivals forecasting: A neural network approach to modelling special days," European Journal of Operational Research, Elsevier, vol. 264(3), pages 967-977.
    8. Jakub Nowotarski, 2013. "Short-term forecasting of electricity spot prices using model averaging (Krótkoterminowe prognozowanie spotowych cen energii elektrycznej z wykorzystaniem uśredniania modeli)," HSC Research Reports HSC/13/17, Hugo Steinhaus Center, Wroclaw University of Technology.
    9. Doornik, Jurgen A. & Ooms, Marius, 2008. "Multimodality in GARCH regression models," International Journal of Forecasting, Elsevier, vol. 24(3), pages 432-448.
    10. Marie Bessec & Julien Fouquau & Sophie Meritet, 2016. "Forecasting electricity spot prices using time-series models with a double temporal segmentation," Applied Economics, Taylor & Francis Journals, vol. 48(5), pages 361-378, January.
    11. Afanasyev, Dmitriy O. & Fedorova, Elena A., 2019. "On the impact of outlier filtering on the electricity price forecasting accuracy," Applied Energy, Elsevier, vol. 236(C), pages 196-210.
    12. Sucarrat, Genaro & Grønneberg, Steffen & Escribano, Alvaro, 2016. "Estimation and inference in univariate and multivariate log-GARCH-X models when the conditional density is unknown," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 582-594.
    13. Marcjasz, Grzegorz & Uniejewski, Bartosz & Weron, Rafał, 2019. "On the importance of the long-term seasonal component in day-ahead electricity price forecasting with NARX neural networks," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1520-1532.
    14. Trucíos, Carlos & Mazzeu, João H.G. & Hotta, Luiz K. & Valls Pereira, Pedro L. & Hallin, Marc, 2021. "Robustness and the general dynamic factor model with infinite-dimensional space: Identification, estimation, and forecasting," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1520-1534.
    15. Escribano, Alvaro & Sucarrat, Genaro, 2018. "Equation-by-equation estimation of multivariate periodic electricity price volatility," Energy Economics, Elsevier, vol. 74(C), pages 287-298.
    16. Weron, Rafał & Zator, Michał, 2015. "A note on using the Hodrick–Prescott filter in electricity markets," Energy Economics, Elsevier, vol. 48(C), pages 1-6.
    17. M. Angeles Carnero & Daniel Peña & Esther Ruiz, 2008. "Estimating and Forecasting GARCH Volatility in the Presence of Outiers," Working Papers. Serie AD 2008-13, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
    18. Behmiri, Niaz Bashiri & Manera, Matteo, 2015. "The role of outliers and oil price shocks on volatility of metal prices," Resources Policy, Elsevier, vol. 46(P2), pages 139-150.
    19. Guo-hua Ye & Mirxat Alim & Peng Guan & De-sheng Huang & Bao-sen Zhou & Wei Wu, 2021. "Improving the precision of modeling the incidence of hemorrhagic fever with renal syndrome in mainland China with an ensemble machine learning approach," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-13, March.
    20. Nowotarski, Jakub & Weron, Rafał, 2018. "Recent advances in electricity price forecasting: A review of probabilistic forecasting," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1548-1568.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:9:y:2016:i:9:p:752-:d:78211. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.