Advancing river corridor science beyond disciplinary boundaries with an inductive approach to catalyse hypothesis generation

This is a Preprint and has not been peer reviewed. The published version of this Preprint is available: https://doi.org/10.1002/hyp.14540. This is version 5 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Adam Scott Ward , Jennifer Drummond, Angang Li, Anna Lupon, Marie Kurz, Jay Zarnetske, James Stegen, Eugenia Marti, Valerie Ouellet, Nicolai Brekenfeld, Feng Mao, Emily Graham, Susana Bernal, Stefan Krause, David Hannah, Aaron Packman, Todd Royer, Megan Klaar

Abstract

A unified conceptual framework for river corridors requires synthesis of diverse site-, method- and discipline-specific findings. The river research community has developed a substantial body of observations and process-specific interpretations, but we are still lacking a comprehensive model to distill this knowledge into fundamental transferable concepts. We confront the challenge of how a discipline classically organized around the deductive model of systematically collecting of site-, scale-, and mechanism-specific observations begins the process of synthesis. Machine learning is particularly well-suited to inductive generation of hypotheses. In this study, we prototype an inductive approach to holistic synthesis of river corridor observations, using support vector machine regression to identify potential couplings or feedbacks that would not necessarily arise from classical approaches. This approach generated 672 relationships linking a suite of 157 variables each measured at 62 locations in a fifth order river network. Eighty four percent of these relationships have not been previously investigated, and representing potential (hypothetical) process connections. We document relationships consistent with current understanding including hydrologic exchange processes, microbial ecology, and the River Continuum Concept, supporting that the approach can identify meaningful relationships in the data. Moreover, we highlight examples of two novel research questions that stem from interpretation of inductively-generated relationships. This study demonstrates the implementation of machine learning to sieve complex data sets and identify a small set of candidate relationships that warrant further study, including data types not commonly measured together. This structured approach complements traditional modes of inquiry, which are often limited by disciplinary perspectives and favour the careful pursuit of parsimony. Finally, we emphasize that this approach should be viewed as a complement to, rather than in place of, more traditional, deductive approaches to scientific discovery.

DOI

https://doi.org/10.31223/X54W44

Subjects

Hydrology

Keywords

machine learning, stream corridor, inductive

Dates

Published: 2021-05-08 12:07

Last Updated: 2022-04-02 10:17

Older Versions
License

CC BY Attribution 4.0 International

Additional Metadata

Conflict of interest statement:
None

Data Availability (Reason not available):
http://www.hydroshare.org/resource/de6d92d314354ea6819157818669fc59