Developing and implementing the use of predictive models for estimating water quality at Great Lakes beaches

Donna S. Francy; Amie M. G. Brady; Rebecca B. Carvin; Steven R. Corsi; Lori M. Fuller; John H. Harrison; Brett A. Hayhurst; Jeremiah Lant; Meredith B. Nevers; Paul J. Terrio; Tammy M. Zimmerman

doi:10.3133/sir20135166

Developing and implementing the use of predictive models for estimating water quality at Great Lakes beaches

Scientific Investigations Report 2013-5166

Coastal and Marine Geology Program Prepared in cooperation with the U.S. Environmental Protection Agency, Great Lakes Restoration Initiative

By: Donna S. Francy, Amie M. G. Brady, Rebecca B. Carvin, Steven R. Corsi, Lori M. Fuller, John H. Harrison, Brett A. Hayhurst, Jeremiah Lant, Meredith B. Nevers, Paul J. Terrio, and Tammy M. Zimmerman

https://doi.org/10.3133/sir20135166

Links

More information: USGS Index Page
Document: Report
Appendixes:
Companion File: Companion Files
Download citation as: RIS | Dublin Core

Abstract

Predictive models have been used at beaches to improve the timeliness and accuracy of recreational water-quality assessments over the most common current approach to water-quality monitoring, which relies on culturing fecal-indicator bacteria such as Escherichia coli (E. coli.). Beach-specific predictive models use environmental and water-quality variables that are easily and quickly measured as surrogates to estimate concentrations of fecal-indicator bacteria or to provide the probability that a State recreational water-quality standard will be exceeded. When predictive models are used for beach closure or advisory decisions, they are referred to as “nowcasts.” During the recreational seasons of 2010-12, the U.S. Geological Survey (USGS), in cooperation with 23 local and State agencies, worked to improve existing nowcasts at 4 beaches, validate predictive models at another 38 beaches, and collect data for predictive-model development at 7 beaches throughout the Great Lakes. This report summarizes efforts to collect data and develop predictive models by multiple agencies and to compile existing information on the beaches and beach-monitoring programs into one comprehensive report.

Local agencies measured E. coli concentrations and variables expected to affect E. coli concentrations such as wave height, turbidity, water temperature, and numbers of birds at the time of sampling. In addition to these field measurements, equipment was installed by the USGS or local agencies at or near several beaches to collect water-quality and metrological measurements in near real time, including nearshore buoys, weather stations, and tributary staff gages and monitors. The USGS worked with local agencies to retrieve data from existing sources either manually or by use of tools designed specifically to compile and process data for predictive-model development.

Predictive models were developed by use of linear regression and (or) partial least squares techniques for 42 beaches that had at least 2 years of data (2010-11 and sometimes earlier) and for 1 beach that had 1 year of data. For most models, software designed for model development by the U.S. Environmental Protection Agency (Virtual Beach) was used. The selected model for each beach was based on a combination of explanatory variables including, most commonly, turbidity, day of the year, change in lake level over 24 hours, wave height, wind direction and speed, and antecedent rainfall for various time periods. Forty-two predictive models were validated against data collected during an independent year (2012) and compared to the current method for assessing recreational water quality-using the previous day’s E. coli concentration (persistence model). Goals for good predictive-model performance were responses that were at least 5 percent greater than the persistence model and overall correct responses greater than or equal to 80 percent, sensitivities (percentage of exceedances of the bathing-water standard that were correctly predicted by the model) greater than or equal to 50 percent, and specificities (percentage of nonexceedances correctly predicted by the model) greater than or equal to 85 percent. Out of 42 predictive models, 24 models yielded over-all correct responses that were at least 5 percent greater than the use of the persistence model. Predictive-model responses met the performance goals more often than the persistence-model responses in terms of overall correctness (28 versus 17 models, respectively), sensitivity (17 versus 4 models), and specificity (34 versus 25 models). Gaining knowledge of each beach and the factors that affect E. coli concentrations is important for developing good predictive models. Collection of additional years of data with a wide range of environmental conditions may also help to improve future model performance. The USGS will continue to work with local agencies in 2013 and beyond to develop and validate predictive models at beaches and improve existing nowcasts, restructuring monitoring activities to accommodate future uncertainties in funding and resources.

Study Area

Additional publication details
Publication type	Report
Publication Subtype	USGS Numbered Series
Title	Developing and implementing the use of predictive models for estimating water quality at Great Lakes beaches
Series title	Scientific Investigations Report
Series number	2013-5166
DOI	10.3133/sir20135166
Year Published	2013
Language	English
Publisher	U.S. Geological Survey
Publisher location	Reston, VA
Contributing office(s)	Ohio Water Science Center
Description	Report: vii, 68 p.; 3 Appendices, Downloads Directory
Country	Canada;United States
Other Geospatial	Great Lakes
Google Analytic Metrics	Metrics page