Predicting Harmful Algal Blooms Using Explainable Deep Learning Models: A Comparative Study

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Bekir Zahit Demiray, Omer Mermer, Özlem Baydaroğlu , Ibrahim Demir

Abstract

Harmful algal blooms (HABs) have emerged as a significant environmental challenge, impacting aquatic ecosystems, drinking water supply systems, and human health due to the combined effects of human activities and climate change. This study investigates the performance of deep learning models, particularly the Transformer model, as there are limited studies exploring its effectiveness in HAB prediction, considering multiple influencing parameters including physical, chemical, and biological water quality monitoring data from multiple stations located west of Lake Erie, and uses Shapley Additive Explanations (SHAP) values as an explainable artificial intelligence (AI) tool to identify key input features affecting HABs. Our findings highlight the superiority of deep learning models, especially the Transformer, in capturing the complex dynamics of water quality parameters and providing actionable insights for ecological management. The SHAP analysis identifies particulate organic carbon, particulate organic nitrogen, and total phosphorus as critical factors influencing HAB predictions. This study contributes to the development of advanced predictive models for HABs, aiding in early detection and proactive management strategies.

DOI

https://doi.org/10.31223/X58D9F

Subjects

Environmental Engineering

Keywords

Deep learning, Harmful Algal Bloom (HAB), transformer, Explainable AI, SHAP value, water quality, chlorophyll-a

Dates

Published: 2024-09-29 12:01

Last Updated: 2024-09-29 19:01

License

CC BY Attribution 4.0 International

Additional Metadata

Conflict of interest statement:
None

Data Availability (Reason not available):
All the data used in this study will be made available upon request.