Skip to main content
Enhancing the Generalization of Flood Susceptibility Models: A Leakage-Aware Ensemble Framework for Deltaic Landscapes

Enhancing the Generalization of Flood Susceptibility Models: A Leakage-Aware Ensemble Framework for Deltaic Landscapes

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Shafiq Mahmud, Golam Murad , Md. Aftabur Rahman

Abstract

Flood susceptibility mapping (FSM) is a cornerstone of disaster risk reduction in low-lying deltaic regions; however, conventional machine learning (ML) applications frequently suffer from spatial data leakage, resulting in inflated performance metrics and unreliable hazard predictions. To address this critical methodological shortcoming, this study develops a robust, leakage-aware ML framework for FSM in the highly vulnerable Greater Noakhali region of Bangladesh. Utilizing a flood inventory derived from Sentinel-1 SAR imagery of the catastrophic August 2024 event, we integrated twelve spatially explicit conditioning factors representing topographic, hydrological, and anthropogenic drivers. To ensure true model generalization and mitigate the effects of spatial autocorrelation, eight ensemble algorithms were evaluated using a rigorous 10-fold blocked spatial cross-validation strategy. The spatially constrained evaluation provided realistic performance estimates, in contrast to the overly optimistic ROC-AUC values typical of conventional random splits, with Random Forest demonstrating the highest generalization capability (ROC-AUC = 0.627) and XGBoost achieving the optimal balance between precision and recall (F1 = 0.451). Furthermore, SHAP (SHapley Additive exPlanations) analysis of the ensemble models identified land use, vegetation density (NDVI), and drainage density as the primary physical drivers of flood susceptibility in this coastal tract. By demonstrating that spatially robust validation is essential for credible geospatial AI, the resulting maps identify critical high-risk zones along the Meghna estuary and provide a physically plausible, operationally relevant tool for regional disaster management.

DOI

https://doi.org/10.31223/X5JJ5S

Subjects

Civil and Environmental Engineering, Engineering, Environmental Engineering, Hydraulic Engineering

Keywords

Flood susceptibility mapping, Spatial data leakage, Spatial cross-validation, Machine learning, Random Forest, XGBoost, SHAP analysis, Remote sensing, Bangladesh, Noakhali region.

Dates

Published: 2026-05-08 08:34

Last Updated: 2026-05-09 03:32

License

CC-By Attribution-NonCommercial-NoDerivatives 4.0 International

Additional Metadata

Conflict of interest statement:
The authors have no conflict of interest.

Data Availability:
The data used in this study are included within the article and can be made available from the corresponding author upon reasonable request.

Metrics

Views: 34

Downloads: 11