Ensemble learning has been widely used to improve the performance and robustness of machine learning algorithms on time series data. However, in real operational processes where the observed data is limited, it hinders the capability of ensemble learning algorithms. To address the challenge of limited observed data, this paper proposes a novel three-layer ensemble learning framework by use of data augmentation. Firstly, multiple classical time series augmentation methods are applied to increase the size of the data set. Subsequently, after pre-processing, these augmented data is trained by multiple basic learners with K-fold cross-validation as the first layer of the developed ensemble learning framework. The outputs of the first layer are integrated via LASSO to further improve the prediction performance, which serves as the second layer of the developed framework. Finally, the third-layer output is generated by averaging the prediction of the second layer and the output from an improved Long-Short Term Memory model that provides prediction based on the augmented data. A case study on a real wastewater treatment plant is used to illustrate the effectiveness of the proposed method.
|Title of host publication||2022 13th International Conference on Reliability, Maintainability, and Safety (ICRMS)|
|Place of Publication||Piscataway, NJ|
|Number of pages||5|
|Publication status||Published - 15 Nov 2022|
|Name||13th International Conference on Reliability, Maintainability, and Safety: Reliability and Safety of Intelligent Systems, ICRMS 2022|
- machine learning algorithms
- time series analysis
- predictive models
- data models
FingerprintDive into the research topics of 'Data augmentation to improve the performance of ensemble learning for system failure prediction with limited observations'. Together they form a unique fingerprint.
Best Paper Award
Shi, Guo (Recipient), Liu, Bin (Recipient) & Walls, Lesley (Recipient), 24 Aug 2022
Prize: Prize (including medals and awards)