IDEAS home Printed from https://ideas.repec.org/a/gam/jlands/v14y2025i1p124-d1563163.html
   My bibliography  Save this article

Machine Learning-Based Prediction of Ecosystem-Scale CO 2 Flux Measurements

Author

Listed:
  • Jeffrey Uyekawa

    (Department of Mathematics and Statistics, Northern Arizona University, Flagstaff, AZ 86011, USA)

  • John Leland

    (Department of Mathematics and Statistics, Northern Arizona University, Flagstaff, AZ 86011, USA)

  • Darby Bergl

    (Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ 86011, USA
    Department of Biology, Northern Arizona University, Flagstaff, AZ 86011, USA)

  • Yujie Liu

    (Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ 86011, USA
    School of Informatics, Computing & Cyber Systems, Northern Arizona University, Flagstaff, AZ 86011, USA)

  • Andrew D. Richardson

    (Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ 86011, USA
    School of Informatics, Computing & Cyber Systems, Northern Arizona University, Flagstaff, AZ 86011, USA)

  • Benjamin Lucas

    (Department of Mathematics and Statistics, Northern Arizona University, Flagstaff, AZ 86011, USA)

Abstract

AmeriFlux is a network of hundreds of sites across the contiguous United States providing tower-based ecosystem-scale carbon dioxide flux measurements at 30 min temporal resolution. While geographically wide-ranging, over its existence the network has suffered from multiple issues including towers regularly ceasing operation for extended periods and a lack of standardization of measurements between sites. In this study, we use machine learning algorithms to predict CO 2 flux measurements at NEON sites (a subset of Ameriflux sites), creating a model to gap-fill measurements when sites are down or replace measurements when they are incorrect. Machine learning algorithms also have the ability to generalize to new sites, potentially even those without a flux tower. We compared the performance of seven machine learning algorithms using 35 environmental drivers and site-specific variables as predictors. We found that Extreme Gradient Boosting (XGBoost) consistently produced the most accurate predictions (Root Mean Squared Error of 1.81 μmolm −2 s −1 , R 2 of 0.86). The model showed excellent performance testing on sites that are ecologically similar to other sites (the Mid Atlantic, New England, and the Rocky Mountains), but poorer performance at sites with fewer ecological similarities to other sites in the data (Pacific Northwest, Florida, and Puerto Rico). The results show strong potential for machine learning-based models to make more skillful predictions than state-of-the-art process-based models, being able to estimate the multi-year mean carbon balance to within an error ±50 gCm −2 y −1 for 29 of our 44 test sites. These results have significant implications for being able to accurately predict the carbon flux or gap-fill an extended outage at any AmeriFlux site, and for being able to quantify carbon flux in support of natural climate solutions.

Suggested Citation

  • Jeffrey Uyekawa & John Leland & Darby Bergl & Yujie Liu & Andrew D. Richardson & Benjamin Lucas, 2025. "Machine Learning-Based Prediction of Ecosystem-Scale CO 2 Flux Measurements," Land, MDPI, vol. 14(1), pages 1-27, January.
  • Handle: RePEc:gam:jlands:v:14:y:2025:i:1:p:124-:d:1563163
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2073-445X/14/1/124/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2073-445X/14/1/124/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jlands:v:14:y:2025:i:1:p:124-:d:1563163. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.