Please use this identifier to cite or link to this item:
Title: Predicting monthly streamflow using data-driven models coupled with data-preprocessing techniques
Authors: Wu, CL
Chau, KW 
Li, YS 
Keywords: Monthly streamflow forecast
Distributed support vector regression
Reconstruction of dynamics
Singular spectrum analysis
False nearest neighbors
Moving average
Artificial neural networks
Hydrological models
Issue Date: 25-Aug-2009
Publisher: American Geophysical Union
Source: Water Resources Research, Aug. 2009, v. 45, W08432 How to cite?
Journal: Water Resources Research 
Abstract: In this paper, the accuracy performance of monthly streamflow forecasts is discussed when using data-driven modeling techniques on the streamflow series. A crisp distributed support vectors regression (CDSVR) model was proposed for monthly streamflow prediction in comparison with four other models: autoregressive moving average (ARMA), K-nearest neighbors (KNN), artificial neural networks (ANNs), and crisp distributed artificial neural networks (CDANN). With respect to distributed models of CDSVR and CDANN, the fuzzy C-means (FCM) clustering technique first split the flow data into three subsets (low, medium, and high levels) according to the magnitudes of the data, and then three single SVRs (or ANNs) were fitted to three subsets. This paper gives a detailed analysis on reconstruction of dynamics that was used to identify the configuration of all models except for ARMA. To improve the model performance, the data-preprocessing techniques of singular spectrum analysis (SSA) and/or moving average (MA) were coupled with all five models. Some discussions were presented (1) on the number of neighbors in KNN; (2) on the configuration of ANN; and (3) on the investigation of effects of MA and SSA. Two streamflow series from different locations in China (Xiangjiaba and Danjiangkou) were applied for the analysis of forecasting. Forecasts were conducted at four different horizons (1-, 3-, 6-, and 12-month-ahead forecasts). The results showed that models fed by preprocessed data performed better than models fed by original data, and CDSVR outperformed other models except for at a 6-month-ahead horizon for Danjiangkou. For the perspective of streamflow series, the SSA exhibited better effects on Danjingkou data because its raw discharge series was more complex than the discharge of Xiangjiaba. The MA considerably improved the performance of ANN, CDANN, and CDSVR by adjusting the correlation relationship between input components and output of models. It was also found that the performance of CDSVR deteriorated with the increase of the forecast horizon.
ISSN: 0043-1397
DOI: 10.1029/2007WR006737
Rights: Copyright 2009 American Geophysical Union.
Reproduced/modified by permission of American Geophysical Union.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
WRR.pdfPre-published version1.21 MBAdobe PDFView/Open
View full-text via PolyU eLinks SFX Query
Show full item record
PIRA download icon_1.1View/Download Contents


Last Week
Last month
Citations as of Aug 13, 2018


Last Week
Last month
Citations as of Aug 14, 2018

Page view(s)

Last Week
Last month
Citations as of Aug 14, 2018


Citations as of Aug 14, 2018

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.