Skip to main content
Multi-Model Machine Learning Analysis of Urban Temperature Trends: A Comparative Study on Climate Change Impacts in U.S. Cities of Midwest KANI Region

Multi-Model Machine Learning Analysis of Urban Temperature Trends: A Comparative Study on Climate Change Impacts in U.S. Cities of Midwest KANI Region

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Mirza Md Tasnim Mukarram, Quazi Umme Rukiya, Marc Linderman

Abstract

Urban temperature prediction is critical for regional climate planning, environmental monitoring, and thermal hazard mitigation. This study employs a multi-model supervised machine learning framework to predict and forecast daily urban air temperatures and evaluate model performance across key counties in the U.S. Midwest KANI region: Polk (IA), Pulaski (AR), Lancaster (NE), and Johnson (KS), encompassing 38 urban centres. Using ERA5-Land reanalysis data (2000-2024) from the cloud-based Google Earth Engine platform, this study compares six regression-based ML models: Linear Regression, Random Forest, XGBoost, Support Vector Regression (SVR), K-Nearest Neighbors (KNN), and Decision Tree to evaluate their predictive efficacy in forecasting urban temperature changes. Using 288 months of temperature data across 4 major economically important counties, we trained and evaluated each model using R², RMSE, and MAE metrics. Ensemble tree-based models XGBoost and Random Forest achieved the strongest performance in daily temperature forecasting across all counties from 2020 to 2024, with R² values around 0.91, RMSE between 2.60°C and 3.54°C, and MAE as low as 1.88°C. These models successfully captured seasonal dynamics, with forecasted daily temperatures ranging from –15°C during winter extremes to over 31°C in summer peaks. A Friedman test followed by Nemenyi post-hoc analysis confirmed that Decision Tree significantly underperformed compared to XGBoost (p = 0.04) and SVR (p = 0.03), while XGBoost, RF, SVR, and KNN formed a statistically indistinguishable high-performance cluster (p > 0.05). Linear Regression and Decision Tree were both outside this group, exhibiting poorer accuracy and greater bias, particularly in extreme conditions. These findings emphasize the superior reliability of ensemble methods for operational climate forecasting and highlight the practical forecast range of –15°C to +32°C, enabling precise early warning systems for climate adaptation and heat risk planning across vulnerable counties in the U.S. Midwest.

DOI

https://doi.org/10.31223/X5B44D

Subjects

Education, Engineering, Life Sciences

Keywords

Urban Temperature Prediction, trend analysis, Early Warning System, Environmental Modelling, environmental modelling, Early warning system, Trend Analysis, machine learning models

Dates

Published: 2025-07-02 04:28

Last Updated: 2025-07-02 23:26

License

CC-By Attribution-NonCommercial-NoDerivatives 4.0 International

Additional Metadata

Conflict of interest statement:
The authors declare no conflict of interest.

Data Availability (Reason not available):
All data will be made available upon request