Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/1458
PIRA download icon_1.1View/Download Full Text
Title: Predicting monthly streamflow using data-driven models coupled with data-preprocessing techniques
Authors: Wu, CL
Chau, KW 
Li, YS 
Issue Date: 25-Aug-2009
Source: Water Resources Research, Aug. 2009, v. 45, W08432
Abstract: In this paper, the accuracy performance of monthly streamflow forecasts is discussed when using data-driven modeling techniques on the streamflow series. A crisp distributed support vectors regression (CDSVR) model was proposed for monthly streamflow prediction in comparison with four other models: autoregressive moving average (ARMA), K-nearest neighbors (KNN), artificial neural networks (ANNs), and crisp distributed artificial neural networks (CDANN). With respect to distributed models of CDSVR and CDANN, the fuzzy C-means (FCM) clustering technique first split the flow data into three subsets (low, medium, and high levels) according to the magnitudes of the data, and then three single SVRs (or ANNs) were fitted to three subsets. This paper gives a detailed analysis on reconstruction of dynamics that was used to identify the configuration of all models except for ARMA. To improve the model performance, the data-preprocessing techniques of singular spectrum analysis (SSA) and/or moving average (MA) were coupled with all five models. Some discussions were presented (1) on the number of neighbors in KNN; (2) on the configuration of ANN; and (3) on the investigation of effects of MA and SSA. Two streamflow series from different locations in China (Xiangjiaba and Danjiangkou) were applied for the analysis of forecasting. Forecasts were conducted at four different horizons (1-, 3-, 6-, and 12-month-ahead forecasts). The results showed that models fed by preprocessed data performed better than models fed by original data, and CDSVR outperformed other models except for at a 6-month-ahead horizon for Danjiangkou. For the perspective of streamflow series, the SSA exhibited better effects on Danjingkou data because its raw discharge series was more complex than the discharge of Xiangjiaba. The MA considerably improved the performance of ANN, CDANN, and CDSVR by adjusting the correlation relationship between input components and output of models. It was also found that the performance of CDSVR deteriorated with the increase of the forecast horizon.
Keywords: Monthly streamflow forecast
Distributed support vector regression
Reconstruction of dynamics
Singular spectrum analysis
False nearest neighbors
Moving average
Artificial neural networks
Hydrology
Hydrological models
Publisher: American Geophysical Union
Journal: Water Resources Research 
ISSN: 0043-1397
DOI: 10.1029/2007WR006737
Rights: Copyright 2009 American Geophysical Union.
Reproduced/modified by permission of American Geophysical Union.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
WRR.pdfPre-published version1.21 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

201
Last Week
3
Last month
Citations as of Apr 21, 2024

Downloads

590
Citations as of Apr 21, 2024

SCOPUSTM   
Citations

375
Last Week
0
Last month
5
Citations as of Apr 26, 2024

WEB OF SCIENCETM
Citations

350
Last Week
0
Last month
4
Citations as of Apr 25, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.