Skip to main content
A scalable framework for soil property mapping tested across a highly diverse tropical data-scarce region

A scalable framework for soil property mapping tested across a highly diverse tropical data-scarce region

This is a Preprint and has not been peer reviewed. The published version of this Preprint is available: https://doi.org/10.1016/j.soilad.2025.100064. This is version 4 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Rodrigo Miranda, Rodolfo L. B. Nobrega , Estevão Silva, Jadson Silva, José Araújo Filho, Magna Moura, Alexandre Barros, Alzira Souza, Anne Verhoef, Wanhong Yang, Hui Shao, Raghavan Srinivasan, Feras Ziadat, Suzana Montenegro, Maria Araújo, Josiclêda Galvíncio

Abstract

Reliable soil property maps are essential for environmental modeling, yet conventional mapping methods remain costly and time-consuming. We developed a machine learning framework that integrates the Soil-Landscape Estimation and Evaluation Program (SLEEP) with gradient boosting to predict soil properties at regional scales and multiple depths. Our approach addresses multicollinearity through a recursive feature selection algorithm. We applied this framework to a tropical region characterized by a ~700-km longitudinal gradient of contrasting topography, climate, and vegetation (~98,000 km²; NE Brazil), where scarce soil physicochemical data limit environmental modeling. We used six topographical, ten climate, and two vegetation covariates, along with data from 223 soil profiles (~1 profile per 440 km²). Training and testing of our framework demonstrated strong spatial performance (r² = 0.79–0.98 and percent bias = -1.39 to 1.14%). Topographic and climatic factors held greater weight than other variables in predicting soil layers, texture, and sum of bases. Moreover, we used our soil parameters combined with multiple pedotransfer functions (PTFs) to derive soil hydraulic properties. Our PTFs-derived estimates of hydraulic conductivity were considerably lower than high-resolution global predictions available for our study area due to differences in clay fraction and mineralogy. Therefore, we recommend the use of region-specific PTFs for hydraulic properties based on multi-covariate soil property maps. This cost-effective framework accurately integrates diverse environmental covariates, adapts to varying soil data availability, and scales across spatial resolutions, making it highly transferable to other data-scarce regions.

DOI

https://doi.org/10.31223/X57P9W

Subjects

Environmental Monitoring, Soil Science, Statistical Models

Keywords

Gradient Boosting Model, Decision trees, Sleep, Soil properties, tropics, Pernambuco.

Dates

Published: 2022-07-22 15:15

Last Updated: 2025-07-07 22:19

Older Versions

License

CC BY Attribution 4.0 International

Additional Metadata

Conflict of interest statement:
None.

Data Availability (Reason not available):
https://zenodo.org/record/5918544