Skip to main content
Territorially-Specialized Machine Learning Models for Wildfire Risk Prediction Across Argentina Using Satellite Data and H3 Hexagonal Grids

Territorially-Specialized Machine Learning Models for Wildfire Risk Prediction Across Argentina Using Satellite Data and H3 Hexagonal Grids

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

federico nicolas sinato, camila rivas

Abstract

Wildfire risk prediction in large, ecologically diverse countries requires models that account
for regional variation in fire drivers. We present GeoAlertAR-ML, a wildfire risk prediction
system for Argentina that uses an ensemble of regionally specialized Random Forest
classifiers operating over a national hexagonal grid of 13,231 H3 cells. Unlike global fire
danger indices or single-model approaches, our system trains independent models for each
of five ecological regions (Centro, Cuyo, NEA, NOA, Patagonia), routed by a K-Nearest
Neighbors classifier with 99.96% accuracy. The system ingests multisource satellite data —
including GFS meteorological fields, GSMaP precipitation, MODIS vegetation indices,
SRTM topography, Dynamic World land cover, and GHSL population density — and
produces daily hexagon-level risk predictions in four categories (Low, Moderate, High,
Extreme). Cross-validated F1-score averaged 93.2% across regions using GroupKFold to
prevent spatial data leakage. Operational validation against NASA FIRMS hotspots
confirmed 100% detection of active fire clusters in zones predicted as High or Extreme risk,
including the February 2026 Comarca Andina crisis (280+ FIRMS hotspots, 68–72%
predicted risk). A complementary 7-day forecasting model (Model B) based on regional
XGBoost classifiers trained on GFS hindcast data achieved 77.8% F1-score with minimal
degradation across forecast horizons (0.3–3.4% from day+1 to day+7). Additionally, the
system identifies potential anthropogenic ignition anomalies by flagging discrepancies
between low predicted meteorological risk and observed fire activity. GeoAlertAR-ML
demonstrates that regionally specialized models, trained on territory-specific fire histories,
outperform generic global approaches for national-scale wildfire risk assessment. The
system has been operational since late 2025 and won the NASA Space Apps Challenge
2025 Best Mission Concept award.
Keywords: wildfire prediction, machine learning, Random Forest, XGBoost, satellite remote
sensing, H3 hexagonal grid, Argentina, regional models, FIRMS validation

DOI

https://doi.org/10.31223/X5MV0M

Subjects

Earth Sciences, Environmental Sciences

Keywords

wildfire prediction, machine learning, Random Forest, XGBoost, satellite remote sensing, Argentina, regional models, FIRMS validation

Dates

Published: 2026-03-14 06:23

Last Updated: 2026-03-14 06:23

License

CC-By Attribution-NonCommercial-NoDerivatives 4.0 International

Additional Metadata

Conflict of interest statement:
None

Data Availability:
All satellite data used in this study is publicly available through Google Earth Engine. Code availability requests may be directed to the corresponding author

Metrics

Views: 12

Downloads: 0