Abstract
This article illustrates the importance of exploratory data analysis as the key step in the data validation process and also for further model development. Descriptive statistics is performed by applying the exploratory data analysis approach to a data set that is derived from the simulink library of a benchmark industrial actuator process provided by the Development and Application of Methods for Actuator Diagnosis in Industrial Control Systems research group. In this work, the data set is synthetically generated from an actuator model by simulating it under different fault conditions. The exploratory data analysis is performed as an initial investigation on the data set to reveal the anomalies and patterns for making suitable assumptions and test hypothesis to develop more accurate models. The raw data are visualised using different visualisation techniques to find patterns and behaviours. The data distribution, class distribution, feature correlation and presence of outliers are revealed through data visualisation. Data processing is performed to transform the data, and treat outliers and missing values in the data set. The treated data set after performing data processing is visually confirmed using appropriate visualisation techniques. The inferences from the visualisation methods are validated quantitatively with statistical results to support the exploratory data analysis. Data profiling is dealt with by collecting the metadata on the Development and Application of Methods for Actuator Diagnosis in Industrial Control Systems data set and aids in enhancing the data set quality and content to procure accurate predictive models employed to foresee the actuator performance.
Keywords
Get full access to this article
View all access options for this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
