Skip to main content
Hydrological modeling in a highly urbanized watershed using explainable machine learning and sub-hourly data: A case study in the city of Sao Paulo, Brazil

Hydrological modeling in a highly urbanized watershed using explainable machine learning and sub-hourly data: A case study in the city of Sao Paulo, Brazil

This is a Preprint and has not been peer reviewed. This is version 2 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Fernando Saraiva-Filho, Elton Escobar-Silva, Marcos G. Quiles, Diego H. Stalder, Leonardo B. L. Santos

Abstract

Hydrological modeling of urbanized watersheds is a highly challenging task due to the complexity and non-linearity of the rainfall-runoff relationship in these areas. Many data-driven models have been proposed in the literature to address this problem. However, in this field, there is a need not only for performance but also for explainability and comprehension of the impacts of hydrometeorological factors. This study proposes a detailed comparative analysis between ensemble machine learning models using an explainable framework. We explore feature engineering and feature selection techniques to determine the best set of predictors in a situation of non-continuous data, a common problem in real-world scenarios. Among the models analysed, CatBoost stood out as the best-performing algorithm for most cases, and, in general, all the ensemble algorithms achieved good performance for a forecasting horizon up to 3 hours. A study with SHAP values revealed insightful aspects of the spatial and temporal dynamics of the rainfall-runoff relationship.

DOI

https://doi.org/10.31223/X5GF54

Subjects

Engineering

Keywords

Dates

Published: 2026-03-04 15:56

Last Updated: 2026-03-04 15:56

Older Versions

License

No Creative Commons license

Metrics

Views: 7

Downloads: 0