Skip to main content
Machine Learning–Based Prediction of Atmospheric CO₂ Concentration: A Year– Month Trend analysis

Machine Learning–Based Prediction of Atmospheric CO₂ Concentration: A Year– Month Trend analysis

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Supplementary Files

Authors

Israt Jahan Powsi, Rayhan Miah, Md Khorshed Alam

Abstract

Atmospheric carbon dioxide (CO₂) remains the principal
driver of contemporary climate change, yet accurately
forecasting its temporal evolution requires models
capable of capturing complex nonlinear and seasonal
dynamics. In this study, we conduct a comprehensive
evaluation of thirteen supervised machine-learning
algorithms to model and predict long-term atmospheric
CO₂ concentrations using multi-decadal monthly and
annual observational datasets. The dataset encompasses
several decades of global CO₂ measurements, enabling a
detailed investigation of both persistent climatological
trends and short-term oscillatory variations.
All models were trained under a unified workflow and
assessed using a standardized performance matrix
comprising R², RMSE, and MAE. Among the tested
algorithms, Random Forest Regression and a multilayer
Artificial Neural Network (ANN) consistently
outperformed other classical and ensemble methods,
achieving R² values greater than 0.95 and demonstrating
exceptional robustness against noise and seasonal
irregularities. Time-series diagnostics further reveal a
sustained, near-exponential increase in global CO₂
levels, reflecting intensified anthropogenic influence,
reduced carbon-sink efficiency, and accelerating
feedback mechanisms in the Earth system.
The results highlight the utility of machine-learning
techniques as reliable and scalable tools for atmospheric
CO₂ forecasting, offering improved sensitivity to
nonlinearities compared to traditional statistical
approaches. Importantly, the analytical framework
developed in this work is extensible and can readily
integrate additional environmental variables such as
oceanic carbon-flux parameters, biospheric exchange
indices, remote-sensing products, or even emerging
acoustic-signature datasets to construct more holistic,
multimodal prediction systems. By establishing a strong
methodological baseline, this study contributes to the
advancement of next-generation climate-monitoring and
predictive-analytics systems, supporting ev

DOI

https://doi.org/10.31223/X5XX84

Subjects

Environmental Education

Keywords

machine learning, CO2, XGBoost, Deep learning, climate, ocean, Environtment

Dates

Published: 2025-12-12 17:44

Last Updated: 2025-12-12 17:44

License

CC BY Attribution 4.0 International

Additional Metadata

Data Availability (Reason not available):
https://doi.org/10.7910/DVN/C11P7K