Abstract
In Industry 4.0, factory assets and machines are equipped with sensors that collect data for effective condition monitoring. This is a difficult task since it requires the integration and processing of heterogeneous data from different sources, with different temporal resolutions and underlying meanings. Ontologies have emerged as a pertinent method to deal with data integration and to represent manufacturing knowledge in a machine-interpretable way through the construction of semantic models. Ontologies are used to structure knowledge in knowledge bases, which also contain instances and information about these data. Thus, a knowledge base provides a sort of virtual representation of the different elements involved in a manufacturing process. Moreover, the monitoring of industrial processes depends on the dynamic context of their execution. Under these circumstances, the semantic model must provide a way to represent this evolution in order to represent in which situation(s) a resource is in during the execution of its tasks to support decision making.
This paper proposes a semantic framework to address the evolution of knowledge bases for condition monitoring in Industry 4.0. To this end, firstly we propose a semantic model (the COInd4 ontology) for the manufacturing domain that represents the resources and processes that are part of a factory, with special emphasis on the context of these resources and processes. Relevant situations that combine sensor observations with domain knowledge are also represented in the model. Secondly, an approach that uses stream reasoning to detect these situations that lead to potential failures is introduced. This approach enriches data collected from sensors with contextual information using the proposed semantic model. The use of stream reasoning facilitates the integration of data from different data sources, different temporal resolutions as well as the processing of these data in real time. This allows to derive high-level situations from lower-level context and sensor information. Detecting situations can trigger actions to adapt the process behavior, and in turn, this change in behavior can lead to the generation of new contexts leading to new situations. These situations can have different levels of severity, and can be nested in different ways. Dealing with the rich relations among situations requires an efficient approach to organize them. Therefore, we propose a method to build a lattice, ordering those situations depending on the constraints they rely on. This lattice represents a road-map of all the situations that can be reached from a given one, normal or abnormal. This helps in decision support, by allowing the identification of the actions that can be taken to correct the abnormality avoiding in this way the interruption of the manufacturing processes. Finally, an industrial application scenario for the proposed approach is described.
Keywords
Introduction
Industry 4.0 aims to improve the production and associated services through the digitalization and automation of manufacturing processes [52,69]. To ensure high productivity, availability, and efficiency of manufacturing processes, the detection of abnormal conditions of production lines is a crucial issue for manufacturers [30]. In order to tackle this issue, factories rely on condition monitoring. This is the task of monitoring all the equipment involved in a manufacturing process for early detection of abnormal behaviors or situations that could lead to anomalies, which affect its performance, energy-use or reliability. Through the early detection of these situations, proactive decisions can be made to avoid production downtime. These decisions can imply, for example, changing the process parameters to adapt the processes behavior.
An important characteristic for industrial production in Industry 4.0 is that physical items such as sensors, actuators, machines and company assets are connected to each other and to the Internet. More particularly, most modern machines have a large number of sensors installed by the Original Equipment Manufacturer (OEM), as well as additional sensors added by the manufacturing train operator. In this environment, devices and sensors generate an increasing amount of data. Such data is available to operators and engineers through SCADA (Supervisory Control and Data Acquisition), OSI-Pi [1] and other interfaces, and can be used for effective condition monitoring of equipment. In order to detect abnormal situations, the data collected by the sensors must be interpreted. This is a challenging task as it requires the integration and processing of heterogeneous data coming from different sources, with different temporal resolutions and different underlying meanings. Moreover, a key point to consider is that the monitoring of industrial processes should depend not only on their internal state and on user interactions but also on the context of their execution. Making industrial processes context-aware allows to provide added-value information to improve their performance. Context is any information that can be used to characterize the situation of an entity [21]. It is usually considered as a mixture of geospatial data, environmental sensor inputs, service descriptions, among others. In manufacturing, a dynamic context model should not only take into account the context of tools, machines, parts and products, but also the planning of the manufacturing processes, the specification of resources and the control system configuration. Context data is subject to constant change and can be highly heterogeneous.
Semantic Web technologies have proved their efficiency to deal with this data integration issue. The Semantic Web is an extension of the World Wide Web that combines knowledge engineering and AI methods to represent, integrate, and reason upon data and knowledge through ontologies and rules. In computer science, an ontology is considered as
Current solutions for reasoning on ontologies were traditionally developed for static or slow changing data. The highly dynamic nature of data in the industrial domain introduces new issues. To tackle these, a number of recent works propose to unify reasoning and stream processing, giving rise to the research field of stream reasoning. Stream reasoning supports decision systems based on the continuous processing of data streams together with rich background knowledge [64]. It can find application in several domains, for example, in [51] the authors introduce a stream reasoning system for monitoring vessel activity in geographical areas in order to detect different types of dangerous, suspicious and potentially illegal vessel activity. Moreover, it can also be found in applications from smart cities to e-health, from the Internet of Things to social network analytics [12,20].
Under these circumstances, the knowledge base itself must evolve in order to represent in which situation(s) the resource(s) is during the execution of its tasks to support decision-making. The knowledge base can evolve due to different reasons. The first possibility is related to a change in the structure of the model itself,
Situations of interest in the industrial domain depend on sensor data and domain knowledge. Besides the static information about industrial entities, dynamic information, such as the way the industrial processes are executed, has to be described. Therefore, the data collected from the sensors deployed in a factory needs to be explicitly represented to describe the process state. This is the first requirement for the semantic model to be build.
Another requirement is the representation of temporal relations among processes and of spatial relations between entities and locations in the factory. These relations are used to identify different situations of interest. This implies handling information that evolves in time, such as the changes of situation(s) a machine can go through or the changes in the values of a machine parameter according to the different decisions made. Consequently, the semantic model must provide enough expressiveness to capture high level knowledge, such as situations, from time annotated data coming from sensors. More specifically, the requirements that the semantic model must meet are:
Main contributions of this article
These requirements motivate the proposal of a novel semantic framework to address the evolution of knowledge base in Industry 4.0. The proposed framework uses semantic technologies to represent the manufacturing domain with special emphasis on modeling the context of the resources involved in a factory. The proposed framework aims at detecting situations that can lead manufacturing processes in Industry 4.0 to failures and their possible causes. Through the detection of these situations and their causes, appropriate decisions can be made to avoid the interruption of manufacturing processes.
The first contribution is related to the development of the knowledge base; and the other ones are related to the management of the evolution of the knowledge base:
An ontological knowledge model for the manufacturing domain, COInd4, is proposed to provide a declarative, abstract representation of the resources, processes, sensors and the relationships among them. Furthermore, the model is strongly oriented towards modelling the context of the resources involved in a manufacturing process. Relevant situations that combine sensor observations with domain knowledge are also represented in it.
An approach that uses stream reasoning to detect relevant situations that lead to potential failures is introduced. This approach enriches data collected from sensors with contextual information using the proposed semantic model. The use of stream reasoning facilitates the integration of data from different data sources, different temporal resolutions as well as the processing of these data in real time. This allows to identify situations from low-level context and sensor data streams combined with background knowledge. Furthermore, our proposal uses classical reasoning approaches for determining the possible causes that generate the situation when stream reasoning results are not sufficient.
Detected situations can trigger actions to adapt the process behavior, and in turn, this change in behavior can lead to the generation of new contexts leading to new situations. To organize the situations based on the constraints they rely on, the building of a lattice is proposed. This lattice represents a road-map of all the situations that can be reached from a given one, normal or abnormal. This helps in decision support, by allowing the identification of the actions that can be taken to correct the process behavior.
The rest of this paper is structured as follows. Section 2 provides a review of existing ontology-based models and systems developed for condition monitoring. Section 3 presents a framework to deal with the evolution of knowledge bases for condition monitoring in Industry 4.0. The approach includes the use of an ontological model for the representation of the entities involved in a manufacturing process, emphasizing the modeling of the context. Each of the components of the proposed framework are detailed as well as the interactions among them. In Section 4 an industrial application scenario for the proposed approach is studied. By means of it, we verify that (i) the proposed semantic model is generic and extensible to accommodate a wide spectrum of manufacturing processes, and that its modular architecture allows the description of manufacturing resources with different levels of modularity; (ii) the use of stream reasoning together with classical reasoning approaches allows the detection of abnormal situations as well as of their possible causes in a suitable way; and (iii) the exploitation of the lattice helps in decision making, by allowing the identification of the actions that can be taken to correct the abnormal behavior of the process. Finally, Section 5 gives some concluding remarks as well as some perspectives for future work.
Related work
Monitoring is a crucial activity in industry. Any unexpected machine failure can degrade or even interrupt the manufacturing processes of a company. Therefore, it is fundamental to develop a well-implemented and efficient monitoring strategy to prevent unplanned stoppages of production, improve reliability and reduce operating costs. The use of the technologies associated to Industry 4.0 such as Artificial Intelligence (AI), Robotics, the Internet of Things (IoT) and Cyber Physical Systems (CPS), makes condition monitoring more efficient and flexible [13,68]. It employs advanced and online analysis of collected data for the early detection of the occurrence of possible machine failures, and supports technicians during the maintenance interventions by providing decision support. However, data from sensors attached to the machines alone are often not sufficient for condition monitoring; metadata about the asset, its operating environment and external variables that influence the asset’s degradation would also be needed [34]. In other words, changes in the asset’s environmental variables must be taken into account in order to do condition-monitoring effectively. Over the last decades, a considerable amount of research efforts has been undertaken to address the development of different models for industrial condition monitoring, using the key technologies associated to Industry 4.0.
In this section, we first review approaches related for fault detection and diagnosis in condition monitoring systems. These approaches can be categorized into data-driven and knowledge-based approaches. For the latter, the most relevant ontologies for modeling the manufacturing domain are also reviewed. We analyse them and their rule-based extensions that are relevant to condition monitoring highlighting their advantages as well as their drawbacks according to the requirements presented in the introduction of this paper: data integration, temporal representation, efficiency and high-level decision support.
Data-driven approaches to condition monitoring
With the development of big data related techniques and the ever-increasing availability of data, data-driven predictive maintenance is becoming more and more attractive. To extract useful knowledge and help to make appropriate decisions from huge amount of data, machine learning and deep learning techniques have been regarded as powerful solutions. The models obtained by applying those techniques perform like a black box that learn the behavior of physical assets directly from their operation data [32]. They have the ability for feature learning, fault classification and fault prediction. Within a data-driven approach, knowledge about machines are extracted internally from machine operation data, instead of externally from domain experts. In the following, we present works that are representative of the domain.
In [50], the authors present an Artificial Neural Network (ANN)-based fault diagnosis for rolling element bearings using time-domain features. Five vibration signals from sensors are used to extract five time-domain features as the input of the designed ANN. The experimental results show that the accuracy can reach 62.5%-100%. Benkercha et al. [9] propose an approach based on the building of a Decision Tree to detect and diagnose the faults in grid connected photovoltaic system (GCPVS). The features used include temperature, irradiation and power ratio, and the class labels contains free fault, string fault, short circuit fault or line-line fault. Experimental results indicate that the diagnosis accuracy can reach 99.80%. In [60], Soualhi et al. apply the Hilbert-Huang transform (HHT) [28] to extract health indicator from vibration signals and utilized SVM to achieve fault classification of bearings. A method based on the evidential k-NN (EKNN) rule in the framework of evidence theory to achieve a condition monitoring and early warning in power plants is developed and presented in [15].
Several surveys on different data-driven approaches for fault detection and diagnosis exist. Zhao et al. [73] provide an overview of Deep Learning based machine health monitoring systems. All the systems presented are aimed at fault identification and classification. Zhang et al. [72] present more recent results on Intelligent fault diagnosis with small & imbalanced data, which refers to build intelligent diagnosis models using limited machine faulty samples to achieve accurate fault identification. In addition, a series of survey papers focus on the fault diagnosis for a specific type of components, e.g., bearing, rotating machinery, wind turbines [7]. In [71], Zhang et al. systematically summarize the existing literature employing machine learning (ML) and data mining techniques for bearing fault diagnosis. Liu et al. [36] provide a comprehensive review of AI algorithms in rotating machinery fault diagnosis from the perspectives of theories and industrial applications. In the same direction, a more recent review of the role of AI in rotor fault diagnosis is presented in [38]. In particular, it is based on fault-wise analysis, providing a proper categorization of rotor failures, rather than on component-wise analysis.
The aforementioned works have given interesting contributions related to the field of fault detection and diagnosis, but most of them are equipment specific. They monitor the behavior of different properties of a machine. They are suitable and efficient when consuming data streams to detect abnormal patterns in the values of a machine’s property (e.g. temperature, vibration, etc.). However, they have two drawbacks: (i) the need, in advance, for a huge amount of annotated data for model training [17,65], most of the research is conducted in laboratories with equipment in test benches or using one of the few public datasets; and (ii) the lack of explicit model to explain decisions. It is therefore difficult to interpret the data and it also complicates the interoperability and re-usability of the models.
Knowledge-based approaches to condition monitoring
Many traditional condition monitoring systems and fault diagnosis systems make use of a priori expert knowledge and deductive reasoning processes [45]. Expert knowledge is exploited to build an ontology that formally describes concepts and relationships that exist in the industrial domain. Different ontologies have been built for monitoring tasks such as predictive maintenance and health assessment systems [41]. Ontology-based modeling allows: (1) knowledge sharing between computational entities by having a common set of concepts, (2) logic inference by exploiting various existing logic reasoning mechanisms to deduce high-level concepts from raw data, and (3) knowledge reuse by reusing well-defined ontologies of different domains. Ontologies can be employed to represent different machinery systems and devices, and can be combined with various existing rule-based reasoning algorithms to achieve monitoring and fault diagnosis. This is done based upon the evaluation of on-line monitored data according to a set of rules which is pre-determined by expert knowledge. Usually, rules are expressed in the form:
Recently, Gul et al. [27] propose a condition monitoring based approach for risk assessment in a rail transportation system by using a fuzzy rule-based expert system. In [33], Kharlamov et al. design a rule-based language for equipment diagnosis in ontology mediated data integration scenarios such as industrial IoT. Its advantages include the easy formulation of diagnosis programs with hundreds of pieces of complex equipment. Berredjem et al. [10] propose a fuzzy expert system to localize bearing faults diagnosis as well as distributed faults. The fuzzy rules are automatically induced from numerical data using an improved range overlaps method.
A more recent approach in this direction is the one proposed in [14], the authors introduce a novel hybrid approach that combines data mining and semantics to facilitate predictive maintenance tasks in manufacturing processes. This approach uses chronicle mining to predict the future failures of the monitored industrial machinery, and a Manufacturing Predictive Maintenance Ontology (MPMO) with its rule-based extension is used to predict temporal constraints of failures and to represent the predictive results formally. However, this approach does not consider the possibility to represent the evolution of a manufacturing process and the rule base. This makes it difficult for the predictive maintenance system to be able to adapt to dynamic situations over time,
Another rule-based model named Adaptive Neuro-Fuzzy Interference Systems (ANFIS) is applied for monitoring wind turbine SCADA (Supervisory Control and Data Acquisition) signals in [53,54]. In order to obtain turbine condition statements, the authors implement rules given by an expert who is familiar with the behavior of the turbine, typical faults and their root causes. There are two types of rules: generic rules used to highlight anomalies, and specific rules providing specific condition or potential root cause. Schmidt et al. [55] develop a semantic framework, using an ontology-based approach for data aggregation, to support cloud-enabled maintenance of manufacturing assets. Xu et al. [70] propose an ontology-based fault diagnosis model to integrate, share and reuse fault diagnosis knowledge for loaders. A loader is a self-propelled heavy machinery having for main function to push and lift (load) ground pieces.
Ontologies provide a way to integrate, share and reuse of domain knowledge, but other reasoning methods have to be integrated with ontologies to achieve predictive maintenance. Rule-based reasoning can be employed in order to achieve monitoring and diagnosis. These rules are built from expert knowledge, or are extracted from the analysis of large data sets, as mentioned above. However, knowledge-based approaches have difficulties in dealing with new (unknown by experts) faults and acquires complete knowledge to build a reliable set of rules.
A typical manufacturing system can be characterized according to three main notions: Product, Process and Resource [37]. In [43], authors have developed a product ontology, named ONTO-PDM. It provides a semantic layer to business, design and manufacturing product-related information. ONTO-PDM harmonizes the product-related knowledge and standards, and this harmonization has shown positive results in solving the interoperability problems among different enterprises and applications. Other works in this direction are: OntoSTEP (ONTOlogy of Standard for the Exchange of Product model data) [8], which allows the description of product information mainly related to geometry; and MCCO (Manufacturing Core Concepts Ontology) [67] that focuses on interoperability across the production and design domains of product life-cycle. It provides some core classes in categories such as
Like ONTO-PDM, the Process Specification Language (PSL) ontology [26] is another notable development work in the area of product information modelling. The PSL ontology covers several domains such as manufacturing, engineering and business processes. In this section we only focus on the manufacturing domain. Even though it mainly focuses on fundamental concepts for representing manufacturing processes, the ontology also provides a basis for the formal description of elements and entities that constitute a Process. The foundational elements of the core of the PSL ontology are four primitive classes (
Other ontologies have been developed to enhance the performance of product information modeling in the manufacturing domain. As the PSL ontology, these ontologies include the notions of resource and process. Among them, the MASON [35] and the ADACOR [11] ontological models are considered as pertinent. The MASON ontology presents a detailed conceptualization mainly focused on the products because it was used for cost estimation of the production of mechanical parts. MASON is an upper ontology for representing what the authors consider the core concepts of the manufacturing domain: products, processes and resources. As a result, the main classes of MASON are
Besides the ontologies presented above, there exist other ontologies that are more focused on modeling manufacturing processes and resources. The P-PSO (Politecnico di Milano Production Systems) ontology [22] considers three aspects in the manufacturing domain: the physical aspect (the material definition of the system), the technological aspect (the operational view of the system) and the control aspect (the management activities), for information exchange, design, control, simulation and other applications. Thus, its main classes are
The presented ontologies define terms and relations for modeling the manufacturing domain, such as machine, process and the possible relations between them. The ontologies previously described do not allow representing the sensors attached to the company’s resources and the properties they measure, although the sensors are also fundamental for anomaly detection on production lines and reflect the correctness of mechanical system conditions. In addition, these ontologies do not allow the representation of the context of the machines and processes. This is important to determine whether the observations collected by sensors represent abnormal behavior or not.
As mentioned in the introduction, IoT is a key element of Industry 4.0. Nowadays, it is possible to use sensor networks to detect and identify a wide variety of observations, from simple phenomena to complex events and situations. However, the lack of integration between these sensor networks often isolates important or relevant data. To deal with this issue, the Semantic Sensor Web (SSW) approach provides tools that allow the integration of data from multiple data sources [58]. It introduces semantic annotations for describing: (1) the data produced by the sensors, introducing spatial, temporal, or situation/context semantics; and (2) the sensors and the sensor networks that provide such data. Furthermore, there are also works on defining suitable ontologies for data and sensors to enable both the integration of data from multiple sensor networks and external sources, and reasoning on such data. As an example, the W3C Semantic Sensor Network Incubator Group [29] developed an ontology to describe sensors and sensor networks, the Semantic Sensor Network Ontology (SSN). Another work in this direction is SAREF4INMA [19], that pursues interoperability with industry standards. The SSN ontology is further detailed in Section 3.1.
Comparison of manufacturing ontologies
Comparison of manufacturing ontologies
Data-driven approaches, based on annotated data, and knowledge-based approaches, based on expert knowledge, for condition monitoring are discussed in this section. Furthermore, we have reviewed existing ontologies and their rule-based extensions for knowledge-based condition monitoring systems. Table 1 summarizes the domain coverage of the discussed ontologies. We evaluate the domain coverage and scopes of these knowledge models by examining whether the key concepts required for condition monitoring in Industry 4.0 exist and are formally described. We categorized these key concepts into five sub-domains: Manufacturing, Context, Change over time, IoT, and Monitoring and Diagnosis. For the Manufacturing sub-domain, the key concepts are Product, Process and Resource. For the Context sub-domain, the key concepts are Identity, Activity, Time, and Location. In this case, the term Activity, used in the definition of context proposed by Dey et. al. [21], refers to industry-related processes. Therefore, we consider Activity and Process as two related concepts. For the IoT sub-domain, Sensor and Observation are the key concepts. For the Monitoring and Diagnosis sub-domain, the key concepts are Situation and Cause. These concepts are the columns of Table 1, and the ontological models are the rows. If a check mark is placed in the table then the concept exists in the corresponding ontology. Otherwise, a cross mark is assigned.
After reviewing the ontological models mentioned previously, we recognize that none of them provides a satisfactory knowledge representation of the five required sub-domains. Some of these knowledge models focus on a narrow field, such as manufacturing resource planning and work orders, and they do not formalize context-related concepts,
In addition, the ontological models of the smart systems reviewed in this section are rather static. As described in the introduction, machines perform manufacturing processes in different contexts and these contexts change over time. According to the context in which a manufacturing process is executed, the rules that manage the process can change. In order to represent the fact that a machine is performing a process in these contexts, the knowledge base needs to evolve in time to represent this changing knowledge.
A novel knowledge-based framework for condition monitoring
The goal of the proposed semantic framework is to address the evolution of knowledge bases (as explained in the introduction) in Industry 4.0 for performing condition monitoring. The main components of our proposal are shown in Fig. 1: (1) the Monitoring component, which is in charge of detecting abnormal behaviors of the production system; (2) the Diagnosis component, which identifies the possible cause(s) that generate an abnormal behavior; and (3) the Decision making component, which determines the adaptation strategies that are needed to correct abnormal behaviors of the production system.
All the components rely on a formal model to semantically enrich data representation and processing. Each of the components operates as an expert does. They can be seen as smart systems to solve complex problems by reasoning. Each one uses the knowledge base to perform its functions and also makes the knowledge base evolve by adding their own results in it. In this way, the tasks of each component are performed considering the updated knowledge base which is a virtual representation of what happens in the real factory.

The main components of the proposed framework.

The context ontology for Industry 4.0 (COInd4).
The development of a smart system requires that the domain knowledge is represented in a formal way. To achieve this objective, ontologies have been widely used to formalize knowledge about the industrial domain [44,70]. However, as indicated in the related work, most of these existing ontological models focus on a very specific field and sometimes lack a formal representation of context.
Our COInd4 ontology is a foundation of our approach. It represents the elements of the real factory, such as machines, processes and sensors, with special emphasis on modeling the context of operation of these elements. Relevant situations representing abnormal behaviors are also represented in the model. The goal of this semantic model is to represent the concepts and relations in the industrial domain to enable context representation and reasoning.
The ontological model is developed according to the methodology proposed in [66]. An overview of the classes and properties from the proposed ontological model is shown in Fig. 2. This ontological model is composed of six modules. They are described in more detail below together with the definition of their main concepts using Description Logics (DL) [5].
The
The Process module
The
The Sensor module
The Available at:
The
The Time module
The
The SWRL Temporal Ontology2
The
It should be noted, that a situation is actually a general description (concept) about a specific scenario and there may be several instances (individuals) of these situations occurring at different points in time. For example, an abnormal behavior of a particular machine can be represented by a situation. This situation can happen more than once on the same machine or even on another machine of the same type;
The
Integration of all the modules
The ontological modules described above reuse others ontologies that have been developed to address specific needs in different domains and following different approaches. Therefore, it is necessary to semantically integrate them in order to have an ontological model that allows context modelling in the Industry 4.0 domain. The integration among the modules is done mainly through the use of relationships (object properties, equivalence (≡) and subsumption (⊑)) that link concepts from one module with concepts from other modules. In Table 2 we present and defined the main relations linking concepts from different modules.
Relations (object properties) that link the modules of the COInd4 ontology
Relations (object properties) that link the modules of the COInd4 ontology
It is worth highlighting that new knowledge can be inferred or made explicit from these relations, as well as many of these relations can be inferred from prior knowledge. For example, the location of where a sensor is located can be inferred from the location of the resource that hosts that sensor. Another interesting example is that from the observations we can obtain the necessary information to determine the resources involved in a situation as well as the processes concerned by the situation, its location and its time.
Each of the six ontological modules that constitute the COInd4 ontology and its integration, enable to represent situations of interest that take into account the context of the resources. Furthermore, our ontological model provides a way to capture the dynamic changes and the evolution of knowledge in time, such as the different situations a machine can go through. This is a key point to deal with the dynamics of manufacturing processes as mentioned in the introduction of this paper. The proposed model is written in OWL DL, a standard language with formal semantics based on logic [6] and is available.3
Certain situations that may lead to machine failures can be detected by interpreting observations in their contexts, for example, if an observation has an abnormal value it may be because another parameter is having abnormal values too. Let us consider the case where the temperature of a machine and the temperature of one of its components are being monitored. It is known that an increase in the temperature of the machine may be due to an increase in the temperature of its component, or vice versa. This allows the exploitation of expert knowledge about relationships between the values of certain parameters from the machines, from the processes and from their context for interpreting observations. Through the early detection of situations, the maintenance schedule can be adapted or further measures to prevent unexpected downtime can be taken.
The
In the following we explain the
The
The
The
Once the data from the distributed and heterogeneous data sources are available in a homogeneous, contextualized and temporally ordered representation, the streams are explored by the
In Section 4, the case study enables to explain in more detail how each of the modules of the Monitoring component works for data enrichment using context information as well as how the detection of situations defined by expert knowledge is performed.
The Diagnosis component
The
As the monitoring component, the Diagnosis component both uses the knowledge base and modifies it. This component mainly uses the model to determine the cause(s) of a detected abnormal situations. The association between the causes and the abnormal situations are obtained from expert knowledge. Once the possible cause(s) associated to the detected abnormal situation are identified, these cause(s) are linked to the detected situation in the knowledge base. This allows the model to represent which are the causes of an abnormal situation detected from the real manufacturing process.
The Diagnosis component has only one module called
Stream reasoning is more suitable to highly dynamic data than classical reasoning approaches. Thus, in the case where causes need to be determined in real-time, the
Therefore, in order to determine the possible causes of a situation classical reasoning approaches can be used. They provide other inferences that can help in determining the causes of a situation. More complex queries can be performed to the ontological model in order to extract information useful for cause determination. For example, it can be necessary to consider the Open World Assumption, which states that the absence of a statement alone cannot be used to infer that the statement is false. In this case, the
In both cases, the
The Decision Making component
Manufacturing processes are not always executed under optimal conditions. Nevertheless, they can sometimes continue their execution in degraded conditions without being completely stopped. Expert knowledge enables to describe these
In order to support the Decision Making component of our framework, it is relevant to consider which other situations can be reached from the current situation, to choose the most appropriate action. Therefore, we propose an approach to establish an order among the situations. The order among the situations is a hierarchy that depends on how situations’ constraints are correlated. This order represents a road-map of all the situations, normal or abnormal, that can be reached from a given one. In this way, it is possible to identify the actions that can be taken to correct the abnormality, considering that certain actions can modify the value of a property and thus change the state of the system, either by satisfying another constraint or, on the contrary, by not satisfying constraints anymore. The idea driving this proposal is to formally represent a hierarchy among situations leading to failures.
The proposed approach uses the lattice theory [18,24,39] to provide an expressive formalization to order the situations in a taxonomic way. In this way, the hierarchy of situations is formally extracted from the situations definitions. In the following, we firstly introduce the definitions that are necessary for the construction of the hierarchy of situations, and secondly, we describe the steps to automatically build this hierarchy from the situations definitions and its associated lattice.
Definitions
In order to formally represent a hierarchy of situations, let us consider the following structure
It is worth mentioning two aspects about situations that were defined in Section 3.1. The first one is that a situation defines an state of affairs that represents a particular scenario of interest and involves observations linked through spatio-temporal relationships, resources and processes. The second one is that situations are abstract, meaning that there may be different instances of a given situation. Instances of the same situation can happen during different periods of time and involve several resources, but they all satisfy the same constraints.
Set
The binary relation
The sets

Situations with constraints (
The example below is used to illustrate the definition of the structure and the operators. Let us consider the following sets of constraints and of situations, with six constraints and six situations, respectively:
The corresponding
Example of situations and associated constraints (in bold face the constraints that are implied by other constraints)
Some constraints can be more general than others,
Consider the implications among the constraints from the set of all constraints
In order to extend the formalization between a situation and a constraint to a set of situations and a set of constraints, two operators are defined below, based on the use of both
The first operator
For a situation set
Considering situations
The second operator
For a constraint set
This operator retrieves all the situations involving at least all the constraints in the set
Using the elements of the structure
In our approach, we group situations and their constraints as ordered pairs

Calculate all the pairs
In order to find all the pairs and thus the nodes of the lattice, given a set of situations
Considering the example in Fig. 3 and Table 3, if the set of situations

Lattice representing the hierarchy of situations (the situations defined exactly by all the constraints of the second component of the pair in the node are shown in bold face).
Once all the pairs are found, the next step is to order them in a lattice to build the hierarchy of situations. A
Let
The hierarchy of situations has been proved to be a lattice [23]. The fact that the hierarchy is a lattice allows us to know the situations that can be reached from the current situation, knowing also the intermediate situations (this is inherited from the fact that a lattice is a partial order). In addition, let us suppose that two situations are happening simultaneously and these situations are represented in two different nodes of the lattice. Each set of situations from those nodes have a larger common set of situations with the constraints that are common to all situations in both sets of the two nodes. Dually, each set of situations have a smaller common subset of situations, which comprises all the constraints that all the situations in both sets of the nodes have. This allows to know from two situations that are happening, which situations can be reached by the production system: situations that satisfy the smaller number of constraints between the two situations that are happening (
The interpretation and exploitation of the lattice to support the decision making is detailed in the following section.
In order to apply our approach, we have developed a software prototype. The software uses deductive approaches, domain ontologies and ontology reasoning, and stream reasoning to analyze industrial data and to detect abnormal situations that can lead to failures. The system is open source.4
For the development of each component of the proposed framework, several software and tools are used. As already mentioned, the proposed framework is a smart system in which each component can also be seen as a smart system. The proposed framework is implemented in Java 1.8. The main technologies used are C-SPARQL,5
An illustrative case study is described in this section to highlight how the proposed framework can be used and the advantages it offers to support decisions that need to be made when an abnormal situation is detected in an industrial framework.
The case study is based on a manufacturing production line (Fig. 5), named

Production line.
Constraints definition

Representation of the scenario using our semantic model.
Several abnormal situations that lead to failures in this scenario are defined by experts. Each of them is expressed as a set of constraints. We focus on situations covering the following types of failures: machines malfunctions and global malfunctions of the production line including hydraulic oil leakage and cooling system filter obstructions. The abnormal situations are describe in Table 5. The situations indicate different levels of severity. For example, situations
Situations and their concerned constraints (the constraints that are implied by other constraints are in bold face)
The situations defined are detected through C-SPARQL queries. In this section, we describe a particular query and see the effect it has on the ontological model when the situation is detected.
We must emphasize that for this case study all the data streams of the properties (
The C-SPARQL query presented in Listing 1 has the purpose of detecting the situation

C-SPARQL query to detect the S6 situation

Background knowledge and streaming data for S6 situation detection.
Once the situation is detected, it is added as an instance to the ontology as well as the relationships representing the resources concerned by this situation. Figure 8 shows a part (relevant to this case) of the ontological model before and after the detection of the
The next step is to determine the possible cause(s) of the
It is worth mentioning that in case the cause is not known beforehand, it can be added to the ontological model. Thus, the next time the situation is detected the cause is automatically determined.
Once the situation and its possible cause(s) have been detected, the next step is to make a decision to avoid the failure associated with the situation. To determine what action to make, we use the approach proposed in Section 3.4 to build the hierarchy of situations, based on the constraints on which the situations rely. The Fig. 9 shows the hierarchy of situations defined in the case study.

Part of the ontological model: (a) before detecting the S6 situation; and (b) after detecting it.

Hierarchy of the situations defined in the illustrative case study.
Having the situation detected and its possible cause, the Decision making component can use both to build a strategy to correct the abnormal behavior. For example, one action could be to replace the polluted filters which would lead to the constraints associated with the
Our approach provides the Decision making component with high level information such as situations and their possible causes from raw data. In addition, the hierarchy of situations allows visualizing the possible intermediate situations that the production system can reach. In this way, it is sought that the Decision making component takes appropriate actions to maintain the reliability and availability of the production line.
First, we evaluate the COInd4 ontology and then we discuss the limitations of the proposed framework for performing condition monitoring in Industry 4.0. We then discuss some of the limitations of the monitoring and diagnostic components of the proposed framework.
COInd4 ontology evaluation
We evaluate the ontology to check that it does not contain pitfalls and that it covers all the requirements identified in the introduction. In order to detect common mistakes done when developing ontologies we have used OOPS! [48]. In OOPS!, ontology pitfalls are classified into three categories: structural, functional, and usability-profiling. Under each category, fine-grained classification criteria are provided to cope with specific types of mistakes. The ontology is evaluated according to the following three categories:
The evaluation of COInd4 with OOPS! has yield some minor pitfalls, that do not affect the consistency, reasoning or applicability of the ontology.
A first issue mainly regards “missing domain or range” errors inherited from the SSN ontology. The documentation of SSN states that not adding the domain or range to certain properties is a design decision not to be restrictive with them. However, we have decided to add the missing range and domains. Another minor issue is the case of the “recursive definitions”. In our case, it was needed to define several recursive relations, such as “A production equipment can contain another production equipment”. Therefore, we have not considered this pitfall as a mistake. The final evaluation of the COInd4 ontology by OOPS! appears in Fig. 10.

Screenshot of the proposed ontology evaluation results by OOPS!
The proposed ontology is generic and extensible to cover a wide spectrum of manufacturing services. Its modular architecture also allows the description of the capabilities of manufacturing resources at different levels of modularity namely,
The technologies and tools used for the development of the proposed framework are introduced as well as the implementation of the core functionalities of it.
An illustrative case study from the industrial domain is presented to demonstrate the application of the framework. The goal is to show the interaction between the different components and how they modify the knowledge base, making it evolve for the representation of what happens in the real factory. This proof of concept application of our framework with simulated data is encouraging. Unfortunately, it has not been possible to validate the whole approach on real industrial data, even offline.
The main limitation of the proposed approach is that not all situations associated with failures are known in advance. Therefore, if these failures happen they are not detected. It is thus required to consult domain experts for decisions about these undetected failures. The domain experts asses the current state of the production system and provide appropriate decisions that can be further capitalized. In this way, new queries capitalizing the experts’ experience need to be registered to the stream reasoner to update the initial set of queries. Thus, when in the future a similar situation needs to be detected, the updated set of queries will detect that situation. Ideally, it would also be interesting to test our proposal in a real production line. This would allow to verify if the decisions made based on the use of our proposal would improve its efficiency and reliability.
Conclusion
This paper proposes a novel semantic framework to address the evolution of knowledge bases in Industry 4.0. The proposed framework allows to automate and facilitate condition monitoring and diagnosis, and support decision-making in the manufacturing domain. To this end, firstly we propose an ontological model for the manufacturing domain that represents the resources and processes that are part of a factory, with special emphasis on modeling the context of these resources and processes. Relevant situations that combine sensor observations with domain knowledge are also represented in the model. In this way, background knowledge from one manufacturer may be shared with another who may benefit in condition monitoring and decision making for the same type of processes and resources. The proposed framework enriches data collected from sensors with contextual information using the ontological model and uses stream reasoning to allow abnormal situation detection in real-time. Furthermore, it also uses classical reasoning approaches to compensate for possible non-detection of the causes by the stream reasoning method. Through the detection of these situations and their causes, appropriate decisions can be made to avoid the interruption of the monitored process. In addition, to support decision-making the proposed framework provides a hierarchy of situations that represents a road-map of the situations that can be reached from a given one, normal or abnormal. This allows the identification of the actions to take to correct the abnormal behavior, by avoiding or minimizing in this way the interruption of the manufacturing processes.
The contributions presented in this paper enable to identify several research perspectives. Future works can be oriented to the enrichment of the proposed semantic model. New types of relations can be explored aside from the spatial and temporal ones addressed in this work, in order to allow the representation of richer contextual information. For example, representing the relationship that two production lines execute the same process could be used to divert the production from one production line to the other if the first one is behaving abnormally. Another possibility can be the representation of relations between production lines and work orders, so that depending on whether the production line is close to finishing a work order and whether the deadline is coming up, the decision to continue the production until the order is finished or to stop the production line can be made. Another work to be completed in order to enable ontology-based interoperability is the alignment of the semantic model with top-level ontologies since several of the reused ontologies are already aligned with top-level ontologies, such as SSN with DUL.8
Since the manufacturing domain is highly-dynamic, how to process real-time and heterogeneous data streams is a crucial concern. In order to test scalability and complexity issues to detect situations in real-time, a broader case study with more complex situations, involving more properties observations that are temporal and spatially related, has to be explored. Different implementations of stream reasoning engines should be evaluated in terms of efficiency and scalability, considering also the size of the knowledge base.
Another interesting direction to investigate is the identification of causality patterns that could enable the definition of generic queries for certain types of anomalies. These generic queries could be reused in different cases, and perform complex operations among the sensed values.
Regarding the refinement and identification of new situations, it can be interesting to apply machine learning methods to exploit the data stored in the knowledge base. As seen in this paper, the queries are executed during certain time windows every certain intervals of time, therefore the application of machine learning methods can help to determine the duration of the time windows in which a query needs to be executed or to dynamically adapt the duration of the time windows. Another possibility is the extraction of new situations from the application of machine learning methods and their integration in the semantic model. This could give rise to interesting problems such as situations more general or specific than others, contradictory situations and therefore it would be necessary to select which of the situations should remain as part of the model.
There are also perspectives associated to the construction of the hierarchy of situations from real industrial data. Firstly, this construction also raises scalability and complexity issues to build the lattice. Therefore, tests need to be performed with different variations of Algorithm 1 to evaluate their relative efficiency. In addition, another possible future work is to investigate how to add new situations or the refined ones without having to rebuild the hierarchy from scratch.
Another perspective to explore is to create an ontological representation of the hierarchy of situations taking advantage of the fact that the situation R is already in the ontology and the relation T should be added accordingly. In this way, this ontological representation of the hierarchy could be exploited and not only be a visual road-map of the different situations that can be reached by the system.
Finally, a more exhaustive and detailed study can be made on the constraints implications that represent dependencies among constraints for the identification of new situations or the refinement of existing ones. In this way, different hierarchies of situations can be obtained according to the chosen
Footnotes
Acknowledgements
These works are funded by the Normandy Region (France) in the framework of the STEaMINg (Semantic Time Evolving Models for Industry 4.0) project.
