Human-in-the-Loop Segmentation of Earth Surface Imagery

Daniel David Buscombe; Evan B Goldstein; Chris Sherwood; Cameron Scott Bodine; Jenna Brown; Jaycee Favela; Sharon Nicole Fitzpatrick; Christine Kranenburg; Jin-Si Over; Andy Ritchie; Jonathan Warrick; Phillipe Wernette

This is a Preprint and has not been peer reviewed. The published version of this Preprint is available: https://doi.org/10.1029/2021EA002085. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.

Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Daniel David Buscombe , Evan B Goldstein , Chris Sherwood , Cameron Scott Bodine , Jenna Brown , Jaycee Favela , Sharon Nicole Fitzpatrick , Christine Kranenburg , Jin-Si Over , Andy Ritchie , Jonathan Warrick , Phillipe Wernette

Abstract

Segmentation, or the classification of pixels (grid cells) in imagery, is ubiquitously applied in the natural sciences. Manual methods are often prohibitively time-consuming, especially those images consisting of small objects and/or significant spatial heterogeneity of colors or textures. Labeling complicated regions of transition that in Earth surface imagery are represented by collections of mixed-pixels, -textures, and -spectral signatures, can be especially error-prone because it is difficult to reliably unmix, identify and delineate consistently. However, the success of supervised machine learning (ML) approaches is entirely dependent on good label data. We describe a fast, semi-automated, method for interactive segmentation of N-dimensional (x,y,N) images into two-dimensional (x,y) label images. It uses human-in-the-loop ML to achieve consensus between the labeler and a model in an iterative workflow. The technique is reproducible; the sequence of decisions made by human labeler and ML algorithms can be encoded to file, so the entire process can be played back and new outputs generated with alternative decisions and/or algorithms. We illustrate the scientific potential of segmentation of imagery of diverse settings and image types using six case studies from river, estuarine, and open coast environments. These photographic and non-photographic imagery consist of 1- and 3-bands on regular and irregular grids ranging from centimeters to tens of meters. We demonstrate high levels of agreement in label images generated by several labelers on the same imagery, and make suggestions to achieve consensus and measure uncertainty, ideal for widespread application in training supervised ML for image segmentation.

DOI

https://doi.org/10.31223/X59K83

Subjects

Engineering, Physical Sciences and Mathematics

Keywords

Data Labeling, Interlabeler agreement, Gridded data, Earth surface processes, Geospatial analysis and map creation, map creation

Dates

Published: 2021-10-16 00:31

Last Updated: 2021-10-16 07:31

License

CC BY Attribution 4.0 International