Abstract
The communication among heterogeneous embedded devices could lead to correctness problems in M2M environment. Sometimes, it is not easy to classify the data because they may provide wrong or uncertain information. The data from these devices should be gathered in a safe, efficient, and right manner without the help of server or human intervention; even the low-level information from each device causes interoperability problems. This data gathering or data fusion process is very important because the data mapping result could be understood as totally different situation and hence cause different reaction, feedback, and controls. In this paper, we propose a hierarchical aggregation for uncertain sensor data using reinforcement learning to get correct and efficient data gathering result for reliable wireless sensor network. In our proposal, we add a new category for uncertain data and classify them through reinforcement learning using hierarchical subcategories. By adopting our proposed aggregation, false classification caused by uncertain data can be decreased and the correctness of data gathering can be enhanced.
1. Introduction
In wireless sensor network, a lot of sensor nodes are deployed in the field; they sense data and send them to the sink node. Sensor network technology has been utilized for military, environment monitoring, healthcare, agriculture, weather conditions, home automation, vehicular automation, and so on. This machine-to-machine (M2M) communication application area will be further extended in various industries. For decreasing communicational and computational overhead of dealing with the sensed data, we need aggregators or clusterheads which can gather the data from each sensor node and then aggregate them before sending the aggregated data to the server system. This process, which is called aggregation or data fusion, is the technique which combines data from multiple sources and gathers that information to make inferences. With this process, we can decrease the amount of redundant data and filter the wrong sensed results, which include false positive and false negative, and can increase the accuracy of the result through classification of the sensed data.
Several sensed data aggregation mechanisms have been proposed considering the efficiency and correctness of the result. However, they have disadvantages when some data are not easy to classify because they are located somewhere between the two groups and show uncertainty as in Figure 1. A linear function can be applied to data as in Figure 1(a). In some cases, we need a nonlinear function as in Figure 1(b) or a delimitation function to detect boundaries for uncertain data as in Figure 1(c).

Data classification problem with data uncertainty.
In this work, we propose an efficient data classification mechanism which can determine the uncertain data correctly based on risk levels of system purpose. Some sensed data stream and environment information can be easily classified in any of the groups while the others are hard to be categorized because of their subtle and slight differences. Our proposed method can assign the uncertain data to proper class through the reinforcement learning process and class hierarchy and relevance information. Automatic detection of uncertain area and organization method is well represented in our previous work [1]. In ubiquitous environment, we need several preprocessing modules, acquisition module and context management module, for service [2]. Based on the previous process, our proposal can be adopted as an aggregation module at the last phase of processing modules for mobile device. With our proposal, sensed data and environment information can be classified in a more accurate and efficient way so that the server system can analyze the status correctly and cope with the situation properly.
This paper is organized as follows. In Section 2, we present related work about general aggregation models and data classification mechanisms. In Section 3, we explain our proposed aggregation method based on hierarchical classification and reinforcement learning. Section 4 describes simulation results and analysis. We conclude the paper and describe our future work in Section 5.
2. Related Work
In ubiquitous environment, the types of information, their complexity, and their similarity have been increased. Even if the similar features and common values were extracted from each data, some of them are hard to classify because they are uncertain to be assigned to any of the groups, such as in Figure 1. Some researches dealing with this problem have been carried out.
Su et al. proposed a hierarchical aggregate classification mechanism in tree structure where each sensor node locally makes cluster analysis and forwards only its decision to the parent node [3]. The decisions are aggregated along the tree, and eventually the global agreement is achieved at the sink node. Kunal and Mohan presented a distributed reinforcement learning (DReL) middleware that provides adaptive WSN management by applying techniques from reinforcement learning and utility theory by exploiting a two-tier learning scheme [4]. Mal-Sarkar et al. proposed a soft computing approach to manage uncertainty by reasoning over inconsistent, incomplete, and fragmentary information using classical rough set and dominance-based rough set theories [5]. Yu et al. presented a comprehensive framework for managing continuously changing data objects with insights into the spatiotemporal uncertainty problem and presented an original parallel-processing solution for managing the uncertainty using the map-reduce platform of cloud computing [6]. Zhang et al. proposed a trust based framework, which is rooted in statistics and some other coupled techniques. The trustworthiness of each individual sensor node is evaluated by using an information theoretic concept, Kullback-Leibler (KL) distance, to identify the compromised nodes through an unsupervised learning algorithm [7]. Ou et al. presented a reinforcement learning approach to multisensor fusion problems with conflicting objectives when the mapping of multiple raw streams of sensory data to the appropriate actions is not easy, especially when multiple conflicting objectives are involved [8]. Savić and Limbourg proposed a reliable aggregation of sensor data for safety related systems applying the Dempster-Shafer theory to combine multiple unreliable and uncertain knowledge sources [9]. Fong et al. presented a stream-based classification to handle continuous data streams, which are unbound and unstructured and simulated on analyzing biological signal such as diagnostic tests in real time [10]. In [11], Hońko proposed a framework for generating classification rules from relational data. The framework was intended for mining relational data and was defined in granular computing theory. Pedrycz and Bargiela designed granular prototypes being reflective of the structure of data to a higher extent than the representation that is provided by their numeric counterparts [12]. The design was formulated as an optimization problem, which is guided by the coverage criterion, meaning that it maximizes the number of data for which their granular realization includes the original data.
3. Hierarchical Classification and Reinforcement Learning for Data Uncertainty
In this work, our goal is to increase reasonability while minimizing predictive error using class hierarchy reinforcement learning.
Figure 2 simply shows elements of reinforcement learning and explains the process of how action influences next states in transition model. Reward

Elements of reinforcement learning.
Most classification systems assume that the level of all classes is flat and each document is labeled by one class. Hierarchically structured classes were examined in some researches where the classes are reorganized into a hierarchical structure for increasing specificity Figure 3. Though a document assigned to a child class is automatically considered as belonging to a parent class, the document is not allowed to belong to more than two classes but on a generalization path in the hierarchy. Hierarchical classification can achieve much better performance than flat classification; however, it is also known as the complex related to class organization. Combining functions for class assignment is also very high. In general, top-down method is known as more superior than bottom-up. However, bottom-up method is more flexible than top-down because we need only to adjust combining functions without entire retraining process to update existing rules.

Comparison of classifications depending on hierarchy: solid line represents the need to apply a classification algorithm.
Figure 4 shows the overview of our proposed classification method based on hierarchical classification.

Aggregation method using hierarchical classification.
The training set construction part of each target category is based on our previous work, and it was proposed for automatic finding of uncertain boundary area
3.1. Category Design and Definition
In Figure 1, decision boundary is not a line but a region, and the data on this boundary region could be predicted as false positive. Thus we separate the training set into target and intermediate categories as in (c). Figure 4 shows the categorization scheme for hierarchical training and assignment rules of reinforcement approach. For hierarchical training, we add category
3.1.1. A Set of Target Categories
A set of target categories,
3.1.2. A Set of Boundary Categories
A set of boundary categories,
3.1.3. A Set of Subclasses
A set of subclasses,
3.2. Reinforcement Learning and Classification for Data Uncertainty
In aggregation module, sensed data are checked if they are authentic or not and then gathered according to environment information representing current status. After being gathered, they are aggregated and finally transmitted to server system. Uncertain information of the entire data can be assigned to the proper categories, and then feedback rules are found for a guideline on the previous stages in reinforcement manner.
Generally, traditional classifier makes a list of candidates in given data. And the list is simply assigned to the first candidate in the list, even though there may be subtle differences among the candidates. But, it is usually prone to make misclassification errors in an area of uncertain data in that case. Figure 4(b) shows how computation is adapted in the assigning process for uncertain data.
In stage 1, to classify the data in low layer category, we used well trained training dataset. They were trained by the reinforcement learning scheme. We can get the first candidate list made by bottom-up hierarchical classification based on learning scheme. These lists are sent to the next stage and analyzed on comparisons using the candidates' scores given by the classifier. In stage 2, we define a simple rule and parameter values as threshold to filter the uncertain data. When the data score satisfies a condition rule, including parameter values, the data are classified in the final target category. When there are two categories, max_support and min_confidence represent the threshold scores to be considered for target categorization.
Time period in learning policy is optimally decided by the experts. Constrained-decision is also set by the expert for training. However, it can be computed dynamically depending on the environment information collected by the clusters as rounds are repeated. In constrained-decision matrix,
Figure 5 shows the learning result of sensed value modification with weighted values applied. To compare the original data stream for learning and the learning result adopting our proposal, we accomplished some computations using basic data. We set the learning time as 5, 10, 15, 20, 25, 30, and 35 and showed the difference between learning results.
We perform reinforcement learning with Schedule S and decision D given by experts. (1) making learning data sets from sensor data streams segmented into different class and time between t and (2) analyzing segmented sensor data to be covered or be diverged by constrained-decision, weight applied w = weight parameter given by expert-constraint by (1) (3) aggregating values in segments (4) computing threshold parameters for setting boundary region by (2), parameter(0) means max_support, upper_limit value in signal level_0., parameter(1) means min_confidence, lower_limit value in signal level_2. these approximate factors, max_support and min_confidence are used in the next stage for hierarchical classification.

The effect of aggregation scheme by reinforcement learning (x-axis: time, y-axis: data values).
In constrained-decision matrix
In aggregation phase, we can determine values of certain data stream based on parameters and aggregate values of uncertain data in intermediate segment Algorithm 2. When the signal level is 2, we can get two parameters,
Hierarchical Aggregation Phase: Goal: send proper aggregated values for certain and uncertain data stream Input: data stream of each cluster i, interval of sensing time τ and parameters [p] Stage 2: Aggregation Process—aggregation of data stream represented as each signal level _i, we can determine values of certain data stream based on the parameters: for if (average ( else } } Stage 3: Transmission process—sending the set of aggregated data, A, to sink and the feedback to Stage 1, learning Process with classification result. The result uses new learning cases for uncertain data.
4. Simulation and Analysis
4.1. Simulation Environment
In this section, we describe our simulation and analyze the result. We used Matlab for our simulation. Figure 6 shows the experiment environment for earthquake sensing. We divided the area into six sections, and a lot of sensor nodes and a clusterhead in each cluster are located in each location. Sensor nodes sense the vibrations of the earth and send the data to each clusterhead, and then the clusterheads aggregate the data before sending the result to server system or base station. We assume that there is an earthquake in time slot 3.

Simulation environment for sensing earthquake.
In the experiment, we used 0.3 and 1.6 as weight values for reinforcement learning phase. Then we get a set of parameters for thresholds in each location. Upper_limit value,
4.2. Simulation Result and Analysis
We have compared our proposal and the other methods about sample data streams. As in Figure 7, our proposal shows a more informative and correct result by showing higher sensing value when there is a shake in location 1 in timeslot 3 and lower value in other timeslots or in different time slots.

Comparison of earthquake sensing results.
It is possible that these data streams can send unclear results and are prone to analytical error. In our proposal, when data in the field delivered to the clusterhead are uncertain, we categorize the data to classify them more correctly based on threshold parameters given by hierarchical categorizing and reinforcement learning. The result can be more accurate when more learning is carried out.
Figure 8 shows the aggregation result from six locations in 100 unit times. Solid line is from aggregated values and the dotted lines are from the values with our proposal adopted. With the values in dotted line, we can see that the aggregated values in solid line with slight differences are getting valid enough to be categorized in different classes.

Simulation result of our aggregation process (unit time = 100).
5. Conclusion and Future Work
In a sensor network composed of many sensor nodes and a server system, intermediate devices between two elements need to communicate with one another to send and aggregate the sensed data before the data are delivered to server system. As the complex applications in specific domain are increased, management for data uncertainty is required, and aggregation needs to be dealt with in the process.
In this work, we have proposed a hierarchical aggregation mechanism for classifying uncertain information from sensor data streams and environment information based on reinforcement learning to get correct and efficient data gathering result for reliable wireless sensor network. Different from traditional aggregation mechanisms, our proposal gives feedback to previous learning phase after sending aggregated information to server system. This feedback makes the data reorganized and retrained and it helps classification of uncertain data through the bottom-up hierarchical classification in reinforcement method. It provides additional advantages of reducing human efforts for identifying complex information.
Our proposal can be adopted in various application systems which may include characteristics of uncertainty, and it can increase the reliability and robustness of the system. Our future work is to simulate the mechanism in multiple vector spaces such as physical sensor stream and context information of time and locations. We will show that our work can improve the efficiency in data processing and management.
Footnotes
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgment
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science, and Technology (no. 2012R1A1A3019459).
