Skip to main content
Integrating Bayesian Inference and Supervised Learning for Predictive Modeling of Coffee Rust Incidence Among Kenyan Smallholder Farmers

Integrating Bayesian Inference and Supervised Learning for Predictive Modeling of Coffee Rust Incidence Among Kenyan Smallholder Farmers

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Supplementary Files

Authors

Maurice Wanyonyi , Jacqueline Gogo Akelo , Veronicah Nyokabi Njenga, Frankline Obwoge Keraro , Titus Mutua Kioko

Abstract

Hemileia vastatrix a pathogenic fungus which causes coffee leaf rust, has been a significant challenge to the sustainability of Arabica coffee production in Kenya, where smallholder farmers experience frequent yield losses and lack access to effective control techniques. To manage it effectively, there is a need for predictive frameworks that quantify the risk and uncertainty of disease occurrence under real-world farm conditions. This research paper introduces a hybrid application model that combines both Bayesian hierarchical modelling and supervised machine learning to produce probabilistic predictions of coffee rust occurrence. The models were based on longitudinal data from 9,850 plots in six primary coffee-producing counties, in which microclimatic moisture — particularly leaf wetness duration and relative humidity were the key predictors of infection. Partial dependence and SHAP analyses provided evidence of nonlinear threshold effects: high humidity and extended leaf wetness significantly increased the likelihood of infection, and proximity to already infected farms amplified disease transmission. The most accurate predictive algorithm in the evaluated set was Logistic Regression, with the highest predictive accuracy (AUC-ROC = 0.867) and, at the same time, was interpretable and computationally efficient. The Bayesian hierarchical model also accounted for county-level heterogeneity and provided a robust measure of uncertainty. The findings, taken together, show that it is possible to develop a probabilistic modelling framework that can be replicated and interpreted to predict sustainable coffee disease. The proposed system provides a scalable, data-driven decision-support instrument to guide precision management plans and make smallholder coffee systems in Kenya and other tropical settings more resilient.

DOI

https://doi.org/10.31223/X5MF27

Subjects

Bioresource and Agricultural Engineering

Keywords

Bayesian hierarchical modeling, Supervised machine learning, Coffee leaf rust, Climate-informed disease prediction

Dates

Published: 2025-10-29 22:17

Last Updated: 2025-10-29 22:17

License

CC BY Attribution 4.0 International

Additional Metadata

Data Availability (Reason not available):
The data for this research will be made available upon request

Conflict of interest statement:
Author's declare no competing interests exist