High resolution landslide susceptibility mapping using ensemble machine learning and geospatial big data

This is a Preprint and has not been peer reviewed. The published version of this Preprint is available: https://doi.org/10.1016/j.catena.2023.107653. This is version 2 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

nirdesh sharma , Manabendra Saharia, GV Ramana

Abstract

Landslide susceptibility represents the potential of slope failure for given geo-environmental conditions. The existing landslide susceptibility maps suffer from several limitations, such as being based on limited data, heuristic methodologies, low spatial resolution, and small areas of interest. In this study, we overcome all these limitations by developing a probabilistic framework that combines imbalance handling and ensemble machine learning for landslide susceptibility mapping. We employ a combination of One -Sided Selection and Support Vector Machine Synthetic Minority Oversampling Technique (SVMSMOTE) to eliminate class imbalance and develop smaller representative data from big data for model training. A blending ensemble approach using hyperparameter tuned Artificial Neural Networks, Random Forests, and Support Vector Machine, is employed to reduce the uncertainty associated with a single model. The methodology provides the landslide susceptibility probability and a landslide susceptibility class. A thorough evaluation of the framework is performed using receiver operating characteristic curves, confusion matrices, and the derivatives of confusion matrices. This framework is used to develop India's first national-scale machine learning based landslide susceptibility map. The landslide database is carefully curated from global and local inventories, and the landslide conditioning factors are selected from a multitude of geophysical and climatological variables. The Indian Landslide Susceptibility Map (ILSM) is developed at a resolution of 0.001o (~100 m) and is classified into five classes: very low, low, medium, high, and very high. We report an accuracy of 95.73%, sensitivity of 97.08%, and matthews correlation coefficient (MCC) of 0.915 on test data, demonstrating the accuracy, robustness, and generalizability of the framework for landslide identification. The model classified 4.75% area in India as very highly susceptible to landslides and detected new landslide susceptible zones in the Eastern Ghats, hitherto unreported in the government landslide records. The ILSM is expected to aid policymaking in disaster risk reduction and developing landslide prediction models.

DOI

https://doi.org/10.31223/X5BT3X

Subjects

Civil and Environmental Engineering

Keywords

Ensemble Learning, big data, One-Sided Selection (OSS), Support Vector Machine Synthetic Minority Oversampling Technique (SVMSMOTE), High resolution landslide susceptibility

Dates

Published: 2023-11-21 16:29

Older Versions
License

CC-BY Attribution-NonCommercial 4.0 International

Additional Metadata

Conflict of interest statement:
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Data Availability (Reason not available):
1) Zenodo: https://zenodo.org/doi/10.5281/zenodo.10085271 2) Google Earth engine: • var ILSM_class = ee.Image(“projects/ee-nirdeshsharmanith1/assets/ ILSM”) • var ILSM_probability = ee.Image(“projects/ee-nirdeshsharmanith1/ assets/ILSM_probability”)