Skip to main content
CryoSentinel: A Multimodal Foundation-Model Segmenter for Glacial Lakes in High Mountain Asia from Sentinel-1 SAR, Sentinel-2 Optical, and Copernicus DEM Imagery

CryoSentinel: A Multimodal Foundation-Model Segmenter for Glacial Lakes in High Mountain Asia from Sentinel-1 SAR, Sentinel-2 Optical, and Copernicus DEM Imagery

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Abzal Abdrash 

Abstract

Glacial-lake outburst floods (GLOFs) are the dominant climate-driven hazard in High Mountain Asia, and reliable lake-extent segmentation is the prerequisite for every downstream early-warning workflow. We present CryoSentinel, a multimodal foundation-model semantic segmenter built on the IBM/ESA TerraMind 1.0 Large encoder (1.1 B parameters) with a UperNet decoder, fine-tuned on 5,614 co-registered chips spanning the Tien Shan, Zhetysu Alatau, and Ile Alatau ranges of Central Asia. The model jointly ingests Sentinel-1 GRD SAR (VV, VH), Sentinel-2 L2A optical (12 bands), and Copernicus DEM 30 m elevation at 10 m / pixel and is trained under a 17-km-buffered spatial-block-split protocol that rules out lake-level leakage. On the validation split it attains an IoU of 0.9557 at threshold 0.5 with flip-only test-time augmentation, a +4.27 percentage-point improvement over the most directly comparable prior result (Adhikari and Regmi, 2025; arXiv:2512.24117; Sentinel-1-only, IoU 0.9130). On a held-out spatial test split it reports IoU 0.8918; an independent MNDWI cross-check reveals that 7 of the 665 test chips are Kumar-polygon mislabels rather than model failures, and dropping these raises the label-corrected test IoU to 0.9082. We document the eleven configuration corrections that lifted our finetune from 0.825 to 0.9557 IoU, the spatial-block-split protocol, and the label-noise audit procedure in full so the result is reproducible end-to-end. Code (Apache 2.0), pretrained weights, and the 42,237-chip dataset (ODC-By 1.0) are publicly released and assigned permanent DOIs.

DOI

https://doi.org/10.31223/X5N47B

Subjects

Artificial Intelligence and Robotics, Environmental Monitoring, Geomorphology, Glaciology, Hydrology

Keywords

glacial lakes, GLOF, glacial lake outburst flood, semantic segmentation, foundation model, TerraMind, multimodal Earth observation, Sentinel-1, Sentinel-2, Copernicus DEM, High Mountain Asia, Tien Shan, Zhetysu Alatau, Ile Alatau, Central Asia, hazard monitoring, Kazakhstan, climate risk, deep learning, remote sensing

Dates

Published: 2026-05-16 13:28

Last Updated: 2026-05-16 13:28

License

CC BY Attribution 4.0 International

Additional Metadata

Conflict of interest statement:
None

Data Availability:
All code is publicly available at https://github.com/abzalabdrash/cryosentinel under the Apache License 2.0, with a versioned snapshot of the v1.0.0 release archived on Zenodo (DOI: 10.5281/zenodo.20239229). Pretrained model weights are publicly available at https://huggingface.co/abzal-glw/cryosentinel-terramind-v3 under the Apache License 2.0. The training and evaluation dataset (42,237 multimodal chips) is publicly available at https://huggingface.co/datasets/abzal-glw/cryosentinel-glof-v3 under the Open Data Commons Attribution License (ODC-By 1.0), with a permanent DataCite DOI: 10.57967/hf/8823. Upstream data sources (Sentinel-1, Sentinel-2, Copernicus DEM 30 m, Kumar and Vijay 2026 PANGAEA glacial-lake inventory) are openly accessible under ESA Copernicus and PANGAEA CC-BY 4.0 terms respectively.

Metrics

Views: 18

Downloads: 1