Comparison of Multi Linear Regression, Nonparametric Regression, and Times Series Models for Estimation and Prediction of Evaporation Values

Document Type : Technical Note (5 pages)

Authors

1 Young Researchers and Elites Club, Science and Research Branch, Islamic Azad University, Tehran, Iran

2 Assistant Professor, Department of Water Engineering, Imam Khomeini International University, Qazvin, Iran.

3 Assistant Professor (Respectively), Department of Irrigation and Reclamation, Faculty of Agricultural Engineering and Technology, University of Tehran, Karaj, Iran

4 Professor (Respectively), Department of Irrigation and Reclamation, Faculty of Agricultural Engineering and Technology, University of Tehran, Karaj, Iran.

Abstract

In order to simulate time series, various methods are presented such as times series models (AR, ARMA and ARMAX), multi-linear regression (MLR), and nonparametric regression (K-NN). In this research, performance of these models for estimation of missing values and prediction of future values of evaporation series (from open water) were assessed. ARMAX model with standardized input time series of Tmin, Tmax, Tav, Wind, RH, and sunshine hours, outperformed the other models and the K-NN and MLR were in the next ranks, respectively. Also after the principal component analysis, ARMAX model showed noticeable deviation for estimating missing values and MLR and K-NN in calibration and MLR in validation stage performed the best. For short-term predictions, ARMAX model has the best performance, but MLR performed better in long-term predictions, Time series models were not robust for long term predictions.

Keywords


Araghinejad, S., Burn, D. H. and Karamouz, M. (2006). “Long-lead probabilistic forecasting of streamflow using ocean-atmospheric and hydrological predictors”, Water Resour. Res., 42, W03431, doi:10.1029/2004WR003853.
Beersma, J. J. and Buishand, T. A. (2003). “Multi-site simulation of daily precipitation and temperature conditional on the atmospheric circulation”, Clim Res., 25, pp. 121-133.
Chi, M. and Bruzzone, L. (2005). “An ensemble-driven K-NN approach to ill-posed classification problems”, Pattern Recognition Letters, 27, pp. 301-307.
Davis, G.A. and Nihan, N.L. (1991). ”Nonparametric regression and short-term free way traffic forecasting”, Journal of Transportation Engineering, ASCE, 117(2), pp. 178-188.
Harrold, T. I., Sharma, A. and Sheather, S. J. (2003). “A nonparametric model for stochastic generation of daily rainfall occurrence”, Water Resour Res., 39, pp. 1-11.
Lall, U. and Sharma, A. (1996). “A nearest neighbor bootstrap for time series resampling”, Water Resour Res., 32, pp. 679-693.
Nemes, A., Rawls, W.J. and Pachepsky, Y.A. (2006). “Use of the nonparametric nearest neighbor approach to estimate soil hydraulic properties”, Soil Science Society of America Journal, 70, pp. 327-336.
Rajagopalan, B. and Lall, U. (1999). “A k-nearest-neighbor simulator for daily precipitation and other variables”, Water Resources Research, 35(10), pp. 3089-3101.
Sharma, A. and Lall, U. (1999). “A nonparametric approach for daily rainfall simulation”, Math Comput Simulat, 48, pp. 361-371.
Todini, E. (2000). “Real-time flood forecasting: operational experience and recent advances”, In: Marsalek, J., et al. (Eds.), Flood Issues in Contemporary Water Management, Kluwer Academic Publisher, The Netherlands,.
Wilson, D. R. and Martinez, T. R. (2000). “Reduction techniques for exemplar-based learning algorithms”, Machine Learning, 38(3), pp. 257-286.
Wojcik, R. and Buishand, T. A. (2003). “Simulation of 6-hourly rainfall and temperature by two resampling schemes”, Journal of Hydrology, 273, pp. 69-80.
Wu, W., Xing, E.P., Myers, C., Mian, I.S. and Bissell, M.J. (2005). “Evaluation of normalization methods for CDNA microarray data by K-NN classification”, Bioinformatics, 6, pp. 191-200.
Young, K. C. (1994). “A multivariate chain model for simulating climatic parameters with daily data”, J. Appl. Meteorol., 33(6), pp. 661–671.