A Novel Approach for Predicting Large Wildfires Using Machine Learning Towards Environmental Justice via Environmental Remote Sensing and Atmospheric Reanalysis Data across the United States

This is a Preprint and has not been peer reviewed. The published version of this Preprint is available: https://doi.org/10.3390/rs15235501. This is version 4 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Nikita Agrawal , Peder Nelson, Russanne Low

Abstract

Large wildfires (>125 hectares) in the United States account for over 95% of the burned area each year. Predicting large wildfires is imperative, however, current wildfire predictive models are region-based and computationally intensive. Using a scalable model based on easily available environmental and atmospheric data, this research aims to accurately predict across the United States whether large wildfires will develop. The data used in this study includes 2,109 wildfires over 20 years, representing 14 million hectares burned. Remote sensing environmental data (Normalized Difference Vegetation Index - NDVI, Enhanced Vegetation Index - EVI, Leaf Area Index - LAI, Fraction of Photosynthetically Active Radiation - FPAR, Land Surface Temperature during the Day - LST Day, and Land Surface Temperature during the Night - LST Night) consisting of 1.3 billion satellite observations was used. Atmospheric reanalysis data (u component of wind, v component of wind, relative humidity, temperature, and geopotential) at four pressure levels (300, 500, 700, and 850 Ha) were also factored. Six machine learning classification models (Logistic Regression, Decision Tree, Random Forest, eXtreme Gradient Boosting, K-Nearest-Neighbors, and Support Vector Machine) were created and tested on the resulting dataset to determine their accuracy in predicting large wildfires. Model validation tests and variable importance analysis were performed. The eXtreme Gradient Boosting (XGBoost) Classification model performed the best in predicting large wildfires, with 90.44% accuracy, a True Positive Rate of 0.92, and a True Negative Rate of 0.88. Furthermore, towards environmental justice, an analysis was performed to identify disadvantaged communities that are also vulnerable to wildfires. This model can be used by wildfire safety organizations to predict large wildfires with high accuracy and prioritize resource allocation to employ protective safeguards for impacted socioeconomically disadvantaged communities.

DOI

https://doi.org/10.31223/X50X13

Subjects

Earth Sciences, Environmental Sciences

Keywords

wildfires, machine learning, environmental justice, remote sensing, Atmospheric, MODIS, ERA5, burned area, climate change, python

Dates

Published: 2023-10-25 19:14

Last Updated: 2023-11-29 18:53

Older Versions
License

CC-By Attribution-NonCommercial-NoDerivatives 4.0 International

Additional Metadata

Conflict of interest statement:
None