Abstract
The mobile crowdsourcing technology has been widely researched and applied with the wide popularity of smartphones in recent years. In the applications, the smartphone and its user act as a whole, which called as the composite node in this article. Since smartphone is usually under the operation of its user, the user’s participation cannot be excluded out the applications. But there are a few works noticed that humans and their smartphones depend on each other. In this article, we first present the relation between the smartphone and its user as the conditional decision and sensing. Under this relation, the composite node performs the sensing decision of the smartphone which based on its user’s decision. Then, this article studies the performance of the composite sensing process under the scenario which composes of an application server, some objects, and users. In the progress of the composite sensing, users report their sensing results to the server. Then, the server returns rewards to some users to maximize the overall reward. Under this scenario, this article maps the composite sensing process as the partially observable Markov decision process, and designs a composite sensing solution for the process to maximize the overall reward. The solution includes optimal and myopic policies. Besides, we provide necessary theoretical analysis, which ensures the optimality of the optimal algorithm. In the end, we conduct some experiments to evaluate the performance of our two policies in terms of the average quality, the sensing ratio, the success report ratio, and the approximate ratio. In addition, the delay and the progress proportion of optimal policy are analyzed. In all, the experiments show that both policies we provide are obviously superior to the random policy.
Introduction
With the proliferation of personal smart devices, such as smartphone, human is able to capture information/event from the physical world with smartphones more easily than before.1–4 Embedded with a rich set of sensors, the current smartphone can support increasing applications across a wide variety of domains, such as crowdsensing,1,5–7 environmental monitoring,
8
and social networks.
9
These applications can be classified into two major classes: participatory sensing (user is directly involved) and the opportunistic sensing (user is not involved).5,10,11 In the participatory sensing, user can act as the preliminary sensor and decision-maker before his or her smartphone implements a certain sensing task. For example, users make decisions whether to take part in an application, and then operate his or her smartphone to implement the application.12–14 Most of the previous works on crowdsensing take the smartphone into consideration, only a small part works suggest that crowdsensing should also include user as the sensor instead of just sensor carrier and operator.15–17 For example, Wang et al.
16
took human as sensor and studied their behavior’s affecting the sensing data quality. But there are a few articles noticed that humans and their smartphones depend on each other. There are two questions should be focused on the relationship with humans and their smartphones. The first is how to describe the relation between the smartphone and its user during smartphone sensing. The second is how two improve the performance of the smartphone sensing by exploiting the relation. As we all know, human has more powerful ability of recognition than the smart device and plays a key role before the process of smartphone sensing. In this article, we propose a framework to clarify the relation, and then study the performance improvement of the crowdsensing under a scenario, where users are willing to have good experience to take part in the crowdsensing. Since smartphones are under control of its user, its sensing decision is made after its user’s willingness. We design the framework as conditional sensing as shown in Figure 1, where each user takes the action “sleeping” if he or she is not willing to taking part in the smartphone sensing. The scenario studied in this article represents a class of common applications in the participatory sensing, where some users are asked to implement a certain task, such as to detect the interesting object/event around them. We further investigate the case where the users have limited cost to implement the task, and hope a certain success implementation probability, denoted by

Compound node.
Summary of key contributions
The key contributions of this article are listed as follows:
This article studies the relationship between human and smartphone during the smartphone sensing, and proposes the framework: composite sensing.
We study the scenario of the object detection, and formulate the composite sensing problem, that is, how to improve the user experience under the framework of composite sensing as the partially observable Markov decision process (POMDP). We also design a new scheme, called composite sensing policy, to solve the composite sensing problem and get the maximal overall sensing quality.
We provide the theoretical and experimental analysis for the composite sensing policy. The theoretical optimization of the policy is guaranteed while the experimental results certificate the performance of the optimal and myopic policies we proposed.
Road map
This article is organized as follows. The related works are reviewed in section “Related work.” Section “Preliminaries” presents the composite sensing and system models. We formulate the composite sensing problem and map it as the POMDP in section “Composite sensing problem.” The composite sensing policy for the problem is designed and the theoretical performance of the policy is presented in section “Composite sensing policy.” The performance of our solution is also evaluated by the experiment in section “Experiment results.” The work of the whole article is summarized in section “Conclusion.”
Related work
Today’s smartphone is embedded in a number of specialized sensors, including camera, global positioning system (GPS), digital compass, and so on. It can sense the environmental information, and share the information with the friend of the smartphone holder or report to a certain server. 13 It has become not only the core communication device in people’s daily life but also a smart sensing device for environmental monitoring, smart transportation systems, social networks, and so on. 10 Its applications are thus widely exploited and are extended to many more areas than before. According to the awareness and involvement of the user in the architecture as sensing device custodians, the smartphone applications can be classified into two major classes: participatory sensing (user is directly involved) and the opportunistic sensing (user is not involved). 10 The participatory sensing includes both the smartphone and its holder into the significant decision stages in the sensing application. One type of relation between the smartphone and its holder is the composite sensing proposed in this article.
Participatory sensing
A wide range of environmental information, such as road traffic, can be sensed and disseminated by ordinary citizens with smartphones. It brings a new way for the development of many application areas, such as environmental monitoring and social networks. The interesting examples include road traffic monitoring, 18 SmartPhoto, 17 and Ear-phone. 14 Rana et al. 14 designed an end-to-end participatory urban noise mapping system called Ear-phone. The key idea of Ear-phone is to crowdsource the collection of urban noise to people, who carry smartphones equipped with sensors and location-providing GPS receivers. In the end-to-end system, the urban noise is sent to a central server. A noise map is reconstructed and then is provided to the end user. In VTrack, some participatory drivers with smartphone send its location estimated by WI-FI or GPS to a central server in real time, and the server provides the real-time routes with the minimal travel time to users. 19 Mohan et al. 18 have presented TrafficSense to monitor road and traffic conditions in a setting where there are much more complex varied road conditions (e.g. potholed roads), chaotic traffic (e.g. a lot of braking and honking), and a heterogeneous mix of vehicles (two wheelers, three wheelers, cars, buses, etc.). Wang et al. 17 proposed a framework, called SmartPhoto, to quantify the quality (utility) of crowdsourced photos based on the accessible geographical and geometrical information (called metadata), including the smartphone orientation, position, and all related parameters of the built-in camera. The sensed photos are sent to a server by the participators and different rewards are feedback to them because the smartphone orientation and position cause the different sensing qualities. There are increasingly new applications appearing, such as CrowdAtlas, for generating a high quality map by crowdsourcing. 20 For more details on smartphone sensing, we refer interested readers to the survey articles.2,10 From the observation from the related works on smartphone applications, we can find the following features: (1) sensing result report: many smartphone applications require the participators to report their sensed information to central servers; and (2) human acts sensor: in the smartphone applications with the participatory sensing, human is a key part of the systems in these applications, and makes key stages of the decision to sense the environmental information. Not all users are willing to be participators and not all of their sensing results have equal value because the smartphone types and sensing conditions may be different.13,16
Human as sensor
Human’s decision is the necessary part of the smartphone applications with the participatory sensing, and has great affection on the sensing result. For example, SmartPhoto needs humans to observe the Event of Interesting (EoI) and then take pictures. 17 Most of the current smartphone sensing applications are based on voluntary participation.13,21 In these applications, 13 humans estimate the incentive reward at first, and then operate their smartphone to participate if satisfied or they observe the EoI at first, and then decide to collect and report the information about the EoI if it is observed and satisfies requirement.14,18 Zhao et al. have showed that mobile crowdsourced sensing (MCS) is a new paradigm that takes advantage of pervasive smartphones to efficiently collect data, enabling numerous novel applications. They proposed incentive mechanisms which are necessary to attract more user participation to achieve good service quality for an MCS application. 21 ND Lane et al. have surveyed some existing mobile phone sensing algorithms, applications, and systems. They also discussed the emerging sensing paradigms, and formulated an architectural framework for discussing a number of the open issues and challenges emerging in the new area of mobile phone sensing research. 2 The smartphones’ decisions base on their users’ observation and decision. It is an underlying phenomenon in the applications of smartphone sensing. Wang et al. 16 used humans as sensors, and studied their decisions affecting the sensing data quality. Although human makes a key decision in the smartphone applications with participatory sensing, most of the previous works make simply an assumption on human’s decision or ignore the humans’ decision. Furthermore, the participator’s decision and its relationship with its smartphone are fairly considered and researched.
Preliminaries
Object, observing, and sensing model
This article concerns a set

System model: (a) object model, (b) observing model, and (c) sensing model.
Each user’s observing range is modeled as a disk as shown in Figure 2(b) and the smartphone’s sensing range is modeled as a fan-shaped sensing area in Figure 2(c). They have the same radius since the user would not notice the object out of the observing range. The smartphone can fix a direction to sense one of the objects in its sensing range as shown in Figure 2(c). Let the object ID denotes the direction that the node chooses. The example in Figure 3 shows that the node has the directions as many as the number of the objects.

Each node has four directions to choose: (a)
Conditional sensing
In the crowdsourcing applications with the participatory sensing, the smartphone must be under the control of its user. Each user acts the preliminary sensor, and implements the composite operation with his or her smartphone as a whole. We call such a whole as a composite node (node in brief) as shown in Figure 1. In each node, the user can make observing decision

Composite detection.
In this following context, we present the composite sensing from the view of an arbitrary node. The objects refer to these in the sensing range of the node.
System model
This article studies the scenario where the nodes and objects are static and uniformly randomly deployed in the interested area. With an additional server, these nodes and objects compose the composite sensing system. In each time slot, each object
Definition 1
Object state
The object state indicates the appearance of an object
The design of the optimal observing and sensing decision uses the definition of the object state. When an object is in the state: disappear, that is,
Definition 2
System state
The system state is the collection of the states of the
Given a sequence of time slots
where
Symbol and meaning.
Composite sensing problem
This section presents the composite sensing process with the goal to maximize the overall sensing quality, and then maps it as a POMDP.
Compound sensing system
The structure of the composite sensing system, illustrated in Figure 5, implements the crowdsourcing task, which is implemented including four parts: task broadcast, composite sensing process, report, and reward.

The composite sensing system.
Task broadcast
The application server broadcasts some advertisements to the users and to attract them to participate in the task: to sense the objects in their sensing ranges. After the node accepts the task, it implements the composite sensing process to maximize the reward returned from the server.
Composite sensing process
Each node implements the composite sensing process, which is composed of conditional decisions made in a series of time slots. In each time slot, the observing decision
Observing decision
At the beginning of each time slot
where
Sensing decision
After the observing decision for the object
where the conditional probability
Report
After the sensing decision is made to achieve the sensing result
where the last equality is obtained by equations (3) and (4)
Reward
The reward, denoted by
Notice that the node chooses only one object to sense each time if its sensing decision
Convert to POMDP
The composite sensing process can be mapped as the POMDP. In the process, the node observes only a part of the objects around it, and the report result cannot be directly known after it reports the sensing result to the server. The system states thus cannot be fully observable. In the following, this article formulates the composite sensing process as the POMDP by a tuple
Belief vector
In the composite sensing process, the node makes the decision according to the historical information
This article adopts a reward-based updating function
When the user takes the observing decision
where the denominator is a normalizing constant and is given by the sum of the numerator overall values of
where
Objective
The composite sensing policy is a sequence of decision couples:
where
Composite sensing policy
Some previous works, such as the one-pass algorithm, 23 can carry out the sequence of the optimal decision. The computation complexity required to obtain the optimal decision increases exponentially with the size of the state space, and can be very high for the general POMDP. 25 One of the alternative methods for addressing this problem is to design the myopic policy. 25 Myopic policy focuses on the immediate reward and ignores the impact of current policy on future rewards. Generally, the myopic policy is suboptimal. In this section, we explore some specific properties of the composite sensing system: monotonicity and the independence between the action and object states. With these properties, the computation for the optimal policy given in this section can be simplified.
Value function
The key step of making the composite decision is to measure how good the previous decision is. Value function can express the objective in equation (11) explicitly as functions of the belief vector
where the first term in the right of the equation denotes the expected immediate reward
Optimal composite sensing policy
This section analyzes the properties of the composite sensing process, which includes: (1) monotonicity of value function and (2) monotonicity of success report probability. With these properties, we can obtain an explicit optimal design for the composite sensing process and a deterministic optimal sensing policy in Lemma 2, and observing policy in Lemma 3.
Lemma 1
Monotonicity of success report probability
Given the sensing decision
The proof of Lemma 1 is referred to Appendix 1.
Theorem 1
Monotonicity of value function
The value function
Recall that the object of the composite sensing process is to maximize the overall reward under the constraint of the successful sensing probability as given in equation (11). If there is no constraint, the node would always make the composite sensing to wake up in each time slot so as to maximize the overall outcome. With the constraint given in equation (12), the composite sensing must be decided carefully. Since the successful report possibility increases monotonically with the aspect
According to equation (6), the successful sensing probability is affected by both the observing and sensing decisions. By Lemma 1, the sensing decision
Lemma 2
Optimal sensing decision
Given the observing outcome
where the threshold aspect
The next is to design the optimal observing decision, which chooses the best object to observe in each time slot since there are
The above two probabilities can be calculated and updated from the transition probability of the system states given in equation (9) or (10)
Because one object’s states are independent of others’, the probability that the object
We have the following lemma to determine the optimal observing decision.
Lemma 3
Optimal observing decision
Suppose that there are
Proof
According to the definition of the group state
Next, we prove by induction that the value of the observing decision given in Lemma 3 is maximized. According to the system model in section “System model,” the object states have the equal transition probability among its states. The transition probability does not change with time. When the observing decision makes the node to sleep, that is,
Optimality of myopic policy
A myopic policy does not consider the impact of the current action on the future or long-term reward, and focuses solely on maximizing the expected immediate reward. It is usually suboptimal for the general POMDP. The myopic policy need not estimate the future reward so that the computation complexity can be reduced. In this article, the myopic policy only cares the impact on the next time slot so we modify the value function as the following equation
The description of the myopic policy is quite similar to the optimal one except that equation (13) in step 5 of Algorithm 1 is replaced by equation (19) .
Experiment results
In this section, we conduct numerical and simulation to verify the performance of our optimal and myopic policy by comparing it with a randomized algorithm, which is just to select some objects in each round randomly. We numerically analyze the impact of various parameters such as the average quality, sensing ratio, success report ratio, and the algorithm approximate ratio under proposed algorithms in terms of the number of iterations and different thresholds. Besides, we give the progress proportion and delay analysis of the optimal policy.
Evaluation setup
To better validate the performance of our proposed algorithms, we build a test bed and conduct field experiments. Our evaluation field is divided into three disks according to composite node

Convergence of the optimal, myopic, and random policies: (a) average quality, (b) sensing ratio, (c) success report ratio, and (d) approximate ratio.

Performance under different thresholds with fixed 1500 iterations: (a) average quality, (b) sensing ratio, (c) success report ratio, and (d) approximate ratio.
Performance comparison
Average quality
Figure 6(a) shows the average quality obtained by the optimal, myopic, and randomized policies, respectively, under the different number of iterations and fixed threshold value
As shown in Figure 7(a), we evaluate the average qualities obtained by the optimal and myopic policies compared with the random policy when we set various thresholds and keep the fixed
Sensing ratio
As mentioned in equation (12), when the success report probability is less than the threshold
As shown in Figure 7(b), we evaluate the sensing ratio obtained by the optimal and myopic policies compared to the random policy when we set various threshold values and keep the fixed number of iterations, that is,
Success report ratio
As mentioned in equation (2), the server only accepts the maximal sensing quality for a same object among the nodes’ reporting for it. Therefore, the success report ratio is also one of the criteria to evaluate how good a strategy is. Figure 6(c) counts the ratio of the number of success reports to that of observing actions with the number of iterations increasing from
As shown in Figure 7(c), we evaluate the success report ratio obtained by the optimal and myopic policies compared to the random policy when we set various threshold values and keep the fixed
Approximate ratio
The approximation ratio can measure the performance difference of our policies. It reflects the performance of the optimal, myopic, and randomized policies clearly. Again, we set the threshold value of
As shown in Figure 7(d), we evaluate the approximation ratio among the optimal, myopic, and randomized policies when we set various thresholds and keep the fixed number of iterations, that is,
Delay and progress proportion
To review the complex perceptual system in Figure 5, the server goes through five steps from the start of broadcasting to feedback rewards to the object. In this experiment, we use delay to represent the time from the beginning of the broadcast to the end of the feedback. As shown in Figure 8, we observe that the delay of the optimal policy increases significantly as the number of objects increases. In addition, after several hundred iterations, the delay of the optimal policy is basically stable. In this experiment, it is assumed that we need the optimal strategy to complete the calculation of 1500 iterations. The progress proportion represents the percentage of the number of completed iterations to the total 1500 iterations under a particular timestamp. As shown in Figure 9, we observe that with the increase in the number of objects, the time to complete the fixed 1500 iterations of the optimal policy is significantly extended.

Delay of the optimal policy with 1500 iterations.

Progress proportion of the optimal policy.
The main trends in the results are summarized as follows:
The average quality, sensing ratio, success report ratio, and other indicators obtained by the optimal policy and myopic policy tend to be stable.
Compared with the myopic policy, some indicators of the optimal policy reached the stability earlier.
The effect of threshold setting on myopic policy and random policy is much greater than that of optimal policy.
With the increase in objects’ number in the experimental scene, the delay increases significantly and the progress proportion slows down significantly.
Conclusion
This article observed the phenomenon of composite sensing with user as sensor in crowdsourcing. The phenomenon usually happens and has not been well studied. We thus proposed the framework: composite sensing, and then map it as a POMDP problem. The composite sensing policy is proposed and analyzed theoretically and experimentally. The theoretical optimization of the policy is guaranteed. In this article, we discuss the case where the smartphone can choose one direction to sense in each time slot. We take another case as a future work, where the smartphone may choose one or more directions to sense in each time slot. Compared with traditional methods, the use of this method in large-scale environmental data has yet to be verified and optimized.
Footnotes
Appendix 1
Handling Editor: Pascal Lorenz
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the General Programs of the National Natural Science Foundation of China (grant nos 61473109, 61572164, and 61603119), the General Research Foundation of the Education Department of Zhejiang Province (grant no. Y201840731), the Zhejiang Major Science and Technology Program (grant no. 2018C04012), and the Graduate Scientific Research Foundation of Hangzhou Dianzi University (grant no. CXJJ2018053).
