Award Date
12-1-2023
Degree Type
Dissertation
Degree Name
Doctor of Philosophy (PhD)
Department
Civil and Environmental Engineering and Construction
First Committee Member
Sajjad Ahmad
Second Committee Member
Haroon Stephen
Third Committee Member
David James
Fourth Committee Member
Jacimaria Batista
Fifth Committee Member
Pushkin Kachroo
Number of Pages
679
Abstract
This research was aimed at exploring innovative and cost-effective tools in understanding the spatiotemporal variability of water quality parameters in the Colorado River Basin (CRB), which includes the Colorado River and major reservoirs and lakes in the USA including Lake Mead. The river which arises in the state of Colorado and empties into the Republic of Mexico at the Gulf of California, is a source of water to seven US states and the Republic of Mexico and provides water to about 40 million people and million acres of farmlands in seven states in the western US and the Republic of Mexico. Monitoring of water quality parameters (WQPs) has been traditionally carried out using field and laboratory methods. Although these methods offer standardized and accurate means of measuring these WQPs, they face some limitations including limited coverages, high labor and laboratory costs, and limited spatiotemporal sampling frequencies.
This study applied and integrated statistical techniques including descriptive and inferential statistics, remote sensing (RS) images, and machine learning (ML) models categorized as standalone and ensemble models as cost-effective tools to estimate the varying spatiotemporal concentrations of WQPs in Lake Mead and the Colorado River system. RS spectral signatures from Sentinel-2A/B Multi-Spectral Instrument (MSI) and Landsat 8 Operational Land Imager (OLI) images were retrieved from the Google Earth Engine (GEE) repository for the study.
This research used field data collected on Lake Mead by the cities of Las Vegas, Henderson, North Las Vegas, and the Clark County Water Reclamation District to ensure compliance with the Nevada Division of Environmental Protection (NDEP) National Pollutant Discharge Elimination System (NPDES) permits. Water quality data on the Colorado River was also obtained from the United States Geological Survey (USGS) data repository.
The approach utilized in this study involved conducting a comprehensive review of WQPs monitoring using RS applications as well as an overview of strengths and limitations of the conventional and RS measurement of WQPs of concern namely total dissolved solids (TDS), total suspended solids (TSS) and the implications of these WQPs on the ecosystem.
Statistical analyses were performed to assess the spatiotemporal distributions of TDS and TSS along the Colorado River to understand the variabilities and trends of these parameters. Key WQPs including TDS, TSS, dissolved oxygen (DO), and temperature were also studied in Lake Mead to understand the impact of hydrological variables and lake stratification in the lake on these parameters.
Additionally, the study employed machine learning (ML) models as an effective tool to estimate TDS in Lake Mead using electrical conductivity (EC) and temperature which are simpler, inexpensive, and takes less time. Results produced from the analysis showed that the eXtreme Gradient Boosting (XGBoost) algorithm was the best-performing ensemble model on the external validation datasets with R2 of 0.81 and RMSE of 34.19 mg/L. Another objective includes the employment of Landsat 8 OLI and Sentinel-2 MSI images for the estimation of the concentration of optically inactive TDS in Lake Mead. The Gradient Boosting Machine (GBM) and XGBoost algorithms were found to be generally showing extremely efficient capabilities in estimating TDS across all three datasets (training, testing, and external validation), respectively for the images from Sentinel-2 and Landsat 8 OLI sensors with evaluations on the external validation derived as 0.99, 9.07 mg/L, and 7.06 mg/L, respectively for R2, RMSE, and MAE for TDS estimations with Sentinel-2 images using GBM and with that of Landsat 8 OLI also yielding 0.95, 21.15 mg/L, and 15.89 mg/L, respectively for R2, RMSE, and MAE using the XGBoost algorithm. The use of GBM with the respective optimal features and hyperparameters described in this study was therefore proposed for the future estimation of TDS in Lake Mead using the Sentinel-2 images. This research also evaluated the potential of using remote sensing and machine learning to retrieve the concentrations of optically active TSS and optically inactive TDS in the Colorado River. The analysis found that ensemble algorithms such as the XGBoost were the optimal estimators for TDS using images from both Sentinel-2 MSI and Landsat 8 OLI with performance on the external validation derived as 0.99, 26.52 mg/L, and 19.19 mg/L, respectively for R2, RMSE, and MAE for Sentinel-2 images. The XGBoost algorithm yielded R2, RMSE, and MAE of 0.97, 35.82 mg/L, and 27.90 mg/L, respectively. The AdaBoost algorithm was found to be the best model for TSS estimations on the external validation 0.92, 29.48 mg/L, and 24.64 mg/L, respectively, for R2, RMSE, and MAE for the Sentinel-2 image. The RF model was found to be the optimal model for TSS estimations with the Landsat 8 OLI with reported R2, RMSE, and MAE of 0.90, 32.80 mg/L, and 22.91 mg/L, respectively.
The study therefore proposed the utilization of XGBoost with its hyperparameters and optimal or best features for the estimation of TDS for images from both sensors. The AdaBoost and RF models, together with their hyperparameters and the optimal features, respectively, for the Sentinel-2 MSI and Landsat 8 OLI.
The findings from this study provide key stakeholders, water resources managers, and researchers with perspectives into the spatiotemporal dynamics of WQPs in waterbodies in Lake Mead as well as a cost-effective tool for retrieving water quality data in the basin to improve monitoring efforts of the fast-depleting water resources.
Keywords
Google Earth engine; Machine learning; Remote sensing; Total dissolved solids; Total suspended solids; Water quality parameters
Disciplines
Civil Engineering | Environmental Engineering | Remote Sensing
File Format
File Size
12590 KB
Degree Grantor
University of Nevada, Las Vegas
Language
English
Repository Citation
Adjovu, Godson Ebenezer, "Unraveling Water Quality Issues in the Colorado River Basin: Utilizing Remote Sensing Satellite Images, Statistical, and Machine Learning for Improved Monitoring" (2023). UNLV Theses, Dissertations, Professional Papers, and Capstones. 4865.
http://dx.doi.org/10.34917/37200491
Rights
IN COPYRIGHT. For more information about this rights statement, please visit http://rightsstatements.org/vocab/InC/1.0/
COinS