Skip to main content
Lowering barriers to probing high-frequency variations in river chemistry through a frugal machine learning-based framework

Lowering barriers to probing high-frequency variations in river chemistry through a frugal machine learning-based framework

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Amita Prajna Mallik , Antoine Lucas, Eric Gayer, Jérôme Gaillardet

Abstract

High-frequency river chemistry monitoring is crucial for capturing transient hydro-geochemical variations and ensuring water security, yet its implementation is limited by logistical and budgetary constraints. Here we present a machine learning-based framework that integrates continuous, low-cost physico-chemical proxies with sparse ‘anchoring’ solute measurements to reconstruct hourly-scale variations in major dissolved metals and nutrients. Validated across three contrasting catchments, we demonstrate that daily to semi-weekly sampling suffice to achieve accurate reconstructions (NSE > 0.75), though sampling demand varies considerably across hydrological regimes. Prediction of nutrients (e.g. NO₃⁻ and K⁺) may require more frequent anchoring observations under stormflow conditions. Reconstruction accuracy is sustained even under multi-week voids in anchoring observations, demonstrating resilience to commonplace logistical disruptions. We complement this framework with a novel gap imputation method based on Singular Spectrum Analysis to address missing data in proxy time series, which outperforms traditional gap-filling approaches. Our results support a scalable, low-maintenance strategy that enables a >95% reduction in operational costs and carbon emissions associated with high-frequency monitoring; and provide a transferable, hydrological regime-specific roadmap for optimizing field sampling that minimizes logistical burden while maintaining reconstruction accuracy.

DOI

https://doi.org/10.31223/X5RV17

Subjects

Biogeochemistry, Environmental Monitoring, Hydrology, Water Resource Management

Keywords

critical zone, geochemistry, machine learning, water quality, high-frequency monitoring

Dates

Published: 2026-04-11 06:23

Last Updated: 2026-04-11 06:23

License

CC-By Attribution-NonCommercial-NoDerivatives 4.0 International

Additional Metadata

Conflict of interest statement:
None

Metrics

Views: 62

Downloads: 5