Publications

Finding Diversity for Building One-day Ahead Hydrological Ensemble Prediction System based on Artificial Neural Network Stacks

Darwin Brochero, François Anctil, Christian Gagné and Karol Lina Lopez
En savoir plus...
Abstract - In this study, we addressed the application of Artificial Neural Networks (ANN) in the context of Hydrological Ensemble Prediction Systems (HEPS). Such systems have become popular in the past years as a tool to include the forecast uncertainty in the decision making process. HEPS considers fundamentally the uncertainty cascade model [4] for uncertainty representation. Analogously, the machine learning community has proposed models of multiple classifier systems that take into account the variability in datasets, input space, model structures, and parametric configuration [3]. This approach is based primarily on the well-known 'no free lunch theorem' [1].
Consequently, we propose a framework based on two separate but complementary topics: data stratification and input variable selection (IVS). Thus, we promote an ANN prediction stack in which each predictor is trained based on input spaces defined by the IVS application on different stratified sub-samples. All this, added to the inherent variability of classical ANN optimization, leads us to our ultimate goal: diversity in the prediction, defined as the complementarity of the individual predictors.
The stratification application on the 12 basins used in this study, which originate from the second and third workshop of the MOPEX project [2], shows that the informativeness of the data is far more important than the quantity used for ANN training. Additionally, the input space variability leads to ANN stacks that outperform an ANN stack model trained with 100% of the available information but with a random selection of dataset used in the early stopping method (scenario R100P).
The results show that from a deterministic view, the main advantage focuses on the efficient selection of the training information, which is an equally important concept for the calibration of conceptual hydrological models. On the other hand, the diversity achieved is reflected in a substantial improvement in the scores that define the probabilistic quality of the HEPS. Except one basin that shows an atypical behaviour, and two other basins that represent the difficulty of prediction in semiarid areas, the average gain obtained with the new scheme relative to the R100P scenario is around 8%, 134%, 72%, and 69% for the mean CRPS, the mean ignorance score, the MSE evaluated on the reliability diagram, and the delta ratio respectively. Note that in all cases, the CRPS is less than the MAE, which indicates that the ensemble of neural networks performs better when taken as a whole than when aggregated in a single averaged predictor.
Finally, we consider appropriate to complement the proposed methodology in two fronts: one deterministic, in which prediction could come from a Bayesian combination, and the second probabilistic, in which scores optimization could be based on an 'overproduce and select' process. Also, in the case of the basins in semiarid areas, the results found by Vos [5] with echo state networks using the same database analysed in this study, leads us to consider the need to include various structures in the ANN stack.

Bibtex:

@inproceedings{Brochero978,
author    = { Darwin Brochero and François Anctil and Christian Gagné and Karol Lina Lopez },
title     = { Finding Diversity for Building One-day Ahead Hydrological Ensemble Prediction System based on Artificial Neural Network Stacks },
booktitle = { European Geosciences Union (EGU), Geophysical Research Abstract },
volume    = { 15 },
year      = { 2013 },
month     = { April 7-12 },
web       = { http://meetingorganizer.copernicus.org/EGU2013/EGU2013-12999.pdf }
}

Dernière modification: 2013/05/22 par cgagne

		Accueil \| A propos \| Personnes \| Recherche \| Publications \| Événements \| Profil
		©2002-. Laboratoire de Vision et Systèmes Numériques. Tous droits réservés