IDEAS home Printed from https://ideas.repec.org/a/jcs/journl/v3y2019i1p1-12.html
   My bibliography  Save this article

Predicting Pollution Level Using Random Forest: A Case Study of Marilao River in Bulacan Province, Philippines

Author

Listed:
  • Jayson M. Victoriano

    (AMA University, Bulacan State University)

  • Manuel Luis C. Delos Santos

    (Asian Institute of Computer Studies)

  • Albert A. Vinluan

    (New Era University)

  • Jennifer T. Carpio

    (University of the East)

Abstract

Purpose – This study aims to predict the pollution level that threatens the Marilao River, located in the province of Bulacan, Philippines. The inhabitants of this area are now being exposed to pollution. Contamination of this waterway comes from both formal and informal industries, such as a used lead-acid battery, open dumpsites metal refining, and other toxic metals. Using various water quality parameters like Dissolved Oxygen (DO), Potential of Hydrogen (pH), Biochemical Oxygen Demand (BOD) and Total Suspended Solids (TSS) were the basis for predicting the pollution level. Method – This study used the Data Mining technique based on the sample data collected from January of 2013 to November of 2017. These were used as a training data and test results to predict the river condition with its corresponding pollution level classification indicated with the used of colors such as “Green†for “Normal†, “Yellow†for “Average†, “Orange†for “Polluted†and “Red†for “Highly Polluted†. The model got an accuracy of 91.75% with a Kappa value of 0.8115, interpreted as “Strong†in terms of the level of agreement. Results – The predicted model using the Random Forest have scored 91.75% in terms of correctly classified instances and were able to generate 0.8115 Kappa values which indicate that the model used to produce a strong level of agreement. Conclusion – From 2013 to 2017 based on the data sampling provided by the Environmental Management Bureau (EMB), an attached agency of the Department of Environment and Natural Resources (DENR) in the Philippines mandated to protect and restore the environment, shows that the river is highly polluted. Several issues like, underestimation of the water parameter results have been identified, issues which can be addressed by incorporating more observations to the training process and by validating the resulting model on the different training set. The discretion on decisions about the prediction of the model is attributed to DENR-EMB unit as they have more hands-on experience with regards to monitoring, restoring, protecting the environment.

Suggested Citation

  • Jayson M. Victoriano & Manuel Luis C. Delos Santos & Albert A. Vinluan & Jennifer T. Carpio, 2019. "Predicting Pollution Level Using Random Forest: A Case Study of Marilao River in Bulacan Province, Philippines," International Journal of Computing Sciences Research, Step Academic, vol. 3(1), pages 1-12, April.
  • Handle: RePEc:jcs:journl:v:3:y:2019:i:1:p:1-12
    DOI: 10.25147/ijcsr.2017.001.1.30
    as

    Download full text from publisher

    File URL: https://www.stepacademic.net/ijcsr/article/view/97/51
    Download Restriction: no

    File URL: https://libkey.io/10.25147/ijcsr.2017.001.1.30?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jcs:journl:v:3:y:2019:i:1:p:1-12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Liam Demafelix (email available below). General contact details of provider: https://www.stepacademic.net .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.