Development of a machine learning approach for local-scale ozone and PM2.5 forecasting: Application to multiple AQS sites in the Pacific Northwest

This is a Preprint and has not been peer reviewed. This is version 2 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Kai Fan, Ranil Dhammapala, Kyle Harrington, Brian K. Lamb, Yunha Lee 

Abstract

A machine learning (ML) based modeling framework has been successfully used to provide operational forecasts of O3 at Kennewick, WA. This paper shows its performance when applied to other observation locations to predict O3 and PM2.5 concentrations. The 10-time, 10-fold cross-validation method was used to evaluate the model performance in the Pacific Northwest (PNW). Similar to Kennewick, ML1 captures more high-O3 events, but also generates more false alarms, and the accuracy of ML2 is better (R2 = 0.79), especially for low-O3 events. Compared to AIRPACT, the combined modeling framework reduces the normalized mean bias (NMB) from 7.6% to 2.6%. In terms of Air Quality Index (AQI) forecasts, improvements occur for each AQI level which reflects more accurate O3 predictions and better capture of more high-O3 events. For PM2.5, ML1 and ML2 demonstrate similar capabilities to predict high-PM2.5 events and ML2 keeps its accuracy for low-PM2.5 predictions, so there is no need to combine the two methods. During the evaluation period, AIRPACT under-predicts the wildfire season PM2.5 concentrations in the PNW (NMB = -27%) and over-predicts at some sites in the cold season up to 200%, while ML2 has a lower NMB in both seasons (NMB = 7.9% in the wildfire season and 2.2% in the cold season) and correctly captures more high-PM2.5 events. The ML modeling framework is now operational for forecasts of O3 and PM2.5 at over 100 observation sites in the PNW.

DOI

https://doi.org/10.31223/X5WW6Q

Subjects

Atmospheric Sciences

Keywords

machine learning, Air quality forecasts, Ozone, PM2.5, random forestmultiple linear regression, Random Forest, multiple linear regression

Dates

Published: 2022-05-24 23:22

Older Versions
License

CC BY Attribution 4.0 International

Additional Metadata

Data Availability (Reason not available):
The AQS observation data are available from US EPA. The AIRPACT simulation data are available from Laboratory for Atmospheric Research, Washington State University. We acknowledge the WRF database from University of Washington.