A Reproducible and Reusable Pipeline for Segmentation of Geoscientific Imagery

This is a Preprint and has not been peer reviewed. The published version of this Preprint is available: https://doi.org/10.1029/2022EA002332. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Daniel David Buscombe , Evan B Goldstein 

Abstract

Segmentation of Earth science imagery is an increasingly common task. Among modern techniques that use Deep Learning, the UNet architecture has been shown to be a reliable for segmenting a range of imagery. We developed software - Segmentation Gym - to implement a data-model pipeline for segmentation of scientific imagery using a family of UNet models. With an existing set of imagery and labels, the software uses a single configuration file that handles dataset creation, as well as model setup and model training. Key benefits of this software are a) the focus on reproducible dataset creation and modeling, and b) the ability for quick model experimentation through changes to a configuration file. Quick experimentation permits researchers to prototype different model architectures, sizes, and adjust common hyperparameters to find a suitable model. We demonstrate the use of the software using a dataset of 419 labeled Landsat-8 scenes of coastal environments and compare results across two model architectures, five model sizes, and three loss functions. This demonstration highlights that our software enables rapid, reproducible experimentation to determine optimal hyperparameters for specific datasets and research questions.

DOI

https://doi.org/10.31223/X5HS81

Subjects

Education, Engineering, Life Sciences, Physical Sciences and Mathematics

Keywords

Deep learning, machine learning, Image Segmentation, opinionated software, Landsat, remote sensing, Coastal processes

Dates

Published: 2022-09-07 07:55

Last Updated: 2022-09-07 11:55

License

CC BY Attribution 4.0 International

Additional Metadata

Conflict of interest statement:
None

Data Availability (Reason not available):
https://doi.org/10.5066/P91NP87I