Abstract
Mobile crowdsensing is a special data collection manner which collects data by smart phones taken by people every day. It is essential to pick suitable workers for different outdoor tasks. Constrained by participants’ locations and their daily travel rules, they are likely to accomplish light outdoor tasks by their way without extra detours. Previous researchers found that people prefer to accomplish a certain number of tasks at a time; thus, we focus on assigning light outdoor tasks to workers by considering two optimization objectives, including (1) maximizing the ratio of light tasks for different workers and (2) maximizing the worker’s satisfaction on assigned tasks. This task allocation problem is a non-deterministic polynomial-time-hard due to two reasons, that is, tasks and workers are many-to-many relationships and workers move from different places to different places. Considering both optimization objectives, we design the global-optimizing task allocation algorithm, which greedily selects the most appropriate participant until either no participant can be chosen or no tasks can be assigned. For the purpose of emulating real scenarios, different scales of maps, tasks, and workers are simulated to evaluate algorithms. Experimental results verify that the proposed global-optimizing method outperforms baselines on both maximization objectives.
Introduction
The smart phones, wearable devices, and mobile social networking techniques have made mobile crowdsensing (MCS) and computing (MCSC) 1 a popular research area in recent years. MCSC leverages cross-space, heterogeneous crowdsourced data for large-scale sensing and computing. Participatory MCS is considered as an effective way to intensively collect professional data for data requester through publishing tasks and recruiting workers. A participatory task is a data collection campaign composed of active sensing actions, requiring a certain degree of human collaboration from workers and their direct involvement, such as going to a place in the city and taking a picture, answering a survey, and tagging a place.2,3 Location-constrained tasks usually require workers to be at the outdoor scene,4–8 but there are not always participants who are at the task’s place at the time of need and would like to accept tasks. Therefore, we have to ask people to go to places specified in tasks and accomplish different data-sampling tasks, which is usually referred to as location-constrained multi-task allocation.
The task provider specifies the 2W2H (including Where, When, How and How many) requirement of the MCS task4,9 on the MCS platform and expects plenty of high-quality data. 2 There are two basic methods for task allocation, that is, push-based task allocation and pull-based task allocation. The former method uses MCS platform to broadcast tasks to participants and some of them will accept to take the tasks. The latter method allows participants to query tasks and select tasks by their own on the MCS task platform. In this article, we marry the advantages of these two methods for the light task allocation. First, participants use the pull-based method to share their current locations and submit a task query (including the destination, her or his expected task number, and the rough duration of MCS working). Then, the MCS platform computes personalized suitable tasks for those participants according to their queries by using proposed task allocation methods in this article and uses the push-based method to publish suitable tasks to participants who are chosen to be workers. The task allocation component of the MCS platform is a black box for both workers and task providers, and reasonable task allocation results will attract more task providers and more participants, which is very important for the healthy MCS system.
There are two challenges to address the location-constrained multi-task allocation problem, including (1) people have different outdoor schedules and tasks must be attractive for them to become workers; (2) workers constantly change their locations during performing different tasks, hence the task allocation process should be based on a dynamic worker set. There are two types of tasks for workers, namely, light tasks and heavy tasks. A light task refers to a task that a worker can perform it without making a detour. Otherwise, it is a heavy task if a worker has to make detour to perform the task. A task can be one worker’s light task but another worker’s heavy task. To address the first challenge, we lower workers’ labor intensity by only allocating light tasks to workers and ensure them expected rewards by assigning multiple tasks expected by them,10–12 which also improve their satisfaction. Although workers’ locations continually change, tasks’ locations are stationary. As a result, to address the second challenge, we utilize the stationary locations of tasks to create the light task relationship matrix for every worker candidate and globally optimize workers’ tasks with this matrix.
In this article, the proposed framework addresses the trade-off between the task allocation ratio and the worker satisfaction, and our main contributions consist of the following:
We analyze the light task allocation problem of MCS and propose two optimization objectives, that is, (1) maximizing the mean satisfaction degree of workers and (2) maximizing the ratio of allocated tasks.
We design the personal light task relationship matrix which is used for two local-optimizing task allocation strategies, namely, (1) worker-first: searching maximum light tasks for each worker and (2) task-first: searching maximum suitable workers for each task.
We propose a global-optimizing task allocation algorithm, and the participant who can take the maximum suitable tasks will be prior chosen to be a worker. In this way, tasks are allocated as many as possible and workers will get their expected number of tasks leading to satisfied experience with task allocation.
We conduct experiments with different sizes of data sets and prove that both the ratio of allocated tasks and the mean satisfaction of workers can be increased by proposed optimal algorithms. Comparing to baseline and local-optimizing task allocation methods, the global optimizing algorithm shows higher evaluation results in both the task allocation ratio and the satisfaction degree of workers.
The remaining of this article is organized as follows. Section “Related work” introduces related work. Section “Problem formulation” formulates the light task allocation problem and defines two optimization objectives. Section “Algorithms” describes four task allocation algorithms, including three local-optimizing allocation algorithms and one global-optimizing task allocation algorithm. Section “Evaluation” presents experiments and discuss the results, followed by conclusions in section “Conclusion.”
Related work
There has been a significant amount of MCS applications in regard to utilizing user-contributed or crowdsourced data. Different domains require different human inputs and different sensor readings, thus hybrid multiple-task management platforms are developed, where different tasks may have different temporal and spatial requirements. A task may be taken by one or multiple workers, depending on the application domain and the task requirements. Similarly, a worker may take multiple tasks. Participants’ mobility is always considered by different location-constrained task allocation methods. In order to economically assign the right tasks to the right participant, some work pay attention to predicting and utilizing worker mobility,13–17 while some works focus on high-performance task allocation methods.15,18,19
Location-based MCS tasks
Location-constrained tasks exist in the physical world, and people are rewarded for their labors to go to a specific place and accomplish the tasks.3,4,15,20 Two kinds of rewards are commonly seen. First, people are morally encouraged after they accomplish tasks, for instance, people upload photos of sidewalk issues through SeeClickFix App (https://seeclickfix.com/, accessed 2 November 2017). Second, people are paid by data consumers. For instance, FlierMeet paid money to participants for their contributed photos of fliers. 3 No matter which incentive mechanism is used, saving the cost of data collection is always considered as a key problem of every MCS application.
A lot of applications need the location information of the data. Different location accuracy are used for data sampling by different MCS applications. According to the location accuracy requirement, a spatial area is usually divided into a lot of grids and the data will be valid if the participant gets them in the task-specified grid, named grid-based sampling.
Some applications require low-accuracy location information, and then, the grid size will be a little larger, for example, 0.01–1 km2. If the grid is very large, then any people in the same grid can be chosen to perform data-sampling task; therefore, these applications can recruit a large number of participants with the low labor cost because most participants can stay where they are and perform task, such as most MCS-based noisy monitoring systems and air quality monitoring systems. 14 The task allocation of these applications will not request high-precision location information of users, for example, global positioning system (GPS) points, but only use the low-precision and low-power-consumption location information, for example, cell tower localization. For a multiple-task allocation system, the participant recruitment is influenced by the grid size. First, if the grid are very large, then a lot of participants do not need to go to different grids and they can perform many different tasks in the same grid or some closed grids. Second, participants are unevenly distributed and a larger grid can easily contain participants. Third, packaged multi-tasks are more attractive for participants than single task. 10 In this article, we adjust the grid size and investigate the consequent effect on the task allocation.
Sometimes, for the purpose of protecting user’s privacy, the task allocation system will not require users’ high-precision locations, so the task allocation server uses the rough location of participants to allocate tasks. Some location-constrained applications must recruit more participants to collect adequate data. In order to lower the cost of recruiting participants, MCS-based applications normally leverage highly effective task allocation algorithms. Task allocation and completion rates are severely affected by limited resources, so Liu et al. 15 investigated two situations, that is, scarce participants and adequate participants, and proposed two optimal task allocation algorithms based on the Minimum Cost Maximum Flow theory. The optimization objective of Liu et al. 15 and Guo et al. 17 is to minimize the total distance that workers go to different tasks’ locations and maximize the number of accomplished tasks.
People are inclined to take tasks that can be easily performed on their ways to somewhere, so several studies leverage human mobility prediction to assign tasks.13–15,21 Wang et al. 13 considered the original task allocation problem as the form of matching discovered mobility patterns with MCS task sequences, with the goal of minimizing the sensing cost. Ji et al. 14 also considered human mobility and proposed a urban MCS framework that maximizes the coverage of collected data in a spatio-temporal space, based on human mobility of participants recruited by a given budget. These works leveraged previous mobility data of participants to predict the following routes of them, which protect participants’ privacy but need to recruit redundant workers to obtain adequate samples due to some failure task assignments. Zhao et al. 6 investigated the task assignment of spatial crowdsourcing under destination-aware task assignment and aimed at finding an optimal assignment of tasks to workers such that the total number of accomplished tasks is maximized. In this article, we also consider human mobility. We require participants’ exact planned routes with either low or high location accuracy and their expected task numbers in order to precisely allocate tasks to the most suitable participants who will get expected numbers of tasks without making a detour.
Location-constrained task allocation
The location constraints of tasks are set by most MCS systems, which guide recent studies to consider the spatial relationships between participants and tasks and maximize the spatial coverage of participants.11,19,22 Reddy et al. 23 proposed a greedy participant selection method to improve coverage of a limited geographic scope. CrowdRecruiter 22 selects the near-minimal set of participants, which meets coverage ratio requirement in each sensing cycle of the task. It predicts the call and coverage probability of each mobile user based on historical records and then computes the joint coverage probability of multiple users as a combined set. No matter what constraints are given, the common objective of some studies is to maximizing the spatial coverage of selected participants.19,22,20 Time-to-live of tasks were also focused in some work, such as minimizing the total time cost of performing tasks through utilizing the social network11,24 and minimizing the delay of performing tasks to ensure the data quality through utilizing the participant’s start position and the task’s valid duration. The task allocation is always related to the incentive strategy and the budget, so research has been undertaken to limit or minimize the budget of participant recruitment. 25 In this article, tasks’ locations are geographic points and their distribution has no regular patterns, so the spatial coverage is actually the ratio of allocated tasks. On one hand, considering lowering the incentive budget, we lower the cost through performing light tasks. On the other hand, considering satisfying participants, tasks are assigned as many as possible to the same participant. Therefore, different from recent work, our proposed methods will satisfy two optimization objectives for location-constrained MCS task allocation.
Problem formulation
This section presents the location-constrained light task allocation problem for MCS on the multiple workers and multiple-task platform.
The multi-task allocation problem
Location-constrained tasks usually require workers to be at the scene, but it is impossible to always recruit workers at the exact venue defined by the task. Therefore, we have to ask workers to go to some specified places to take pictures, and this process is called task assignment.
In order to introduce the problem, two kinds of participants are defined. If a participant shares her location and submits a task query, then the participant becomes a worker candidate (candidate for short). Consequently, if a worker candidate gets works, then the worker candidate becomes a worker.
A worker (or a candidate) is denoted by
Each worker’s expected number of tasks (i.e.
A task is denoted by a three-tuple
Given a candidate set
The general task assignment problem can be formulated to an optimization problem as shown in equation (1), where the extra movement (i.e. detour) is minimized
where
The problem in equation (1) is the multi-task assignment problem and is non-deterministic polynomial-time (NP)-hard. The frequently used notations are shown in Table 1.
Frequently used notations.
GPS: global positioning system.
Fast light task searching method
Searching the shortest route is a NP-hard problem, so the Manhattan distance is used to explain the fast light tasks finding method as follows.
Task-performing route: At the beginning, the worker
Task rectangles: Key points (denoted by
No-detour working area: The area will be called a worker’s no-detour working area if there is always a no-detour travel route shorter than the Manhattan distant from the starting point to the destination. According to the constraint of a light task, we can consider any task covered by the no-detour working area as a light task. According to the definition of the task rectangle, we can find that the no-detouring area consists of all task rectangles.
In this article, we only assign light tasks to worker, so the worker is supposed to be able to always find the shortest route in each task rectangle; therefore, task rectangles of a worker will not overlap to each other. As shown in Figure 1, the dark areas present no-detour working areas (area for short). At beginning, the area is composed of the task rectangle

The different task allocation results and the corresponding no-detour working areas: (a) |Bk| = {}, (b) |Bk| = {t1}, (c) |Bk| = {t2}, (d) |Bk| = {t1, t4}, (e) |Bk| = {t1, t3}, (f) |Bk| = {t2, t3}, (g) |Bk| = {t1, t3, t5}, (h) |Bk| = {t2, t3, t5}, (i) |Bk| = {t1, t4, t5}, (j) |Bk| = {t3}, (k) |Bk| = {t4}, and (l) |Bk| = {t4}.
Optimization objectives of task allocation
Location-constrained light task allocation (light task allocation for short here after) is a specific problem of the multi-task allocation problem. It only assigns each worker light tasks. There are two objectives of light task allocation. On one hand, since using light task allocation will not bring extra distance to workers, we do not need to consider minimizing the distance of trips but focus on how to allocate more tasks. On the other hand, in order to improve workers’ experiences, tasks should be intensively assigned to workers, so the number of assigned tasks should be as much close to a workers’ expected number as possible.
Musthag and Ganesan
26
revealed that a small fraction of agents (<10% of all agents), whom they referred to as super-agents, performed more than 80% of the tasks and earn more than 80% of the total earnings. Thus, super-agents’ experiences are very important. The satisfaction degree of jth worker is reflected by the degree of jth worker’s task number reaching his or her expected task number, denoted by
Given a candidate set
The objective is formulated as equation (4)
Notice that if
Location-constrained light task allocation problem can be considered as a multi-dimensional and dynamic knapsack problem. A worker’s original route can be regarded as the capacity of the bag and tasks are items. Since a worker must go to the task places, items’ weights change. If items in the bag are different, then weights of outside items are different. Therefore, the light task allocation problem is NP-hard. 27 In the next section, we describe three local-optimizing task allocation methods and one global-optimizing task allocation method based on the greedy algorithm to address this problem.
Algorithms
In the case of there are multiple tasks and multiple workers, then two priority strategies can be used, including the worker-first strategy and the task-first strategy. The worker-first strategy will assign tasks to each worker based on the workers’ requests and preferences, while the task-first strategy will select workers for each task based on a task’s requirements. No matter which strategy is used, if only considering one worker or one task during each selection round, the result could be locally optimal. To address multiple workers and multiple tasks scenarios, we propose a global-optimizing method in this section.
Local-optimizing task allocation methods
Worker-first local-optimizing algorithm
Using an enumerative method, we can find the optimal solution. Before introducing the enumerative method, we define the directed no-detouring relationship, dubbed as no-detouring subsequence, between task pairs. Given a worker
Each task can have zero or multiple no-detouring subsequence task. Given the available task set
An instance of the enumerative forest generation is shown in Figure 2. As shown in Figure 2(a), for the kth worker, the initial task rectangle is

The instance of no-touring task allocation: (a) light tasks covered by the initial task rectangle, (b) directed graph, and (c) forest.
If there is a branch whose length is larger than or equal to
Cheng et al. 20 proposed a greedy algorithm, which iteratively assigns workers to spatial tasks that can always achieve high ranks, which is a worker-first algorithm. In this article, given a candidate set, the worker-first local-optimizing (WF-LO) algorithm generates the enumerative forest of one candidate at one time and assign maximum tasks to this candidate who then becomes a worker. Candidates are considered and assigned tasks one by one. Here, the optimal task set is only optimal for one worker. As each task requires limited number of workers, tasks assigned to one worker will influence another’s task assignment.
Native task-first algorithm (TF-N)
The location of a task is fixed, any worker whose no-detour working area covers this task can be chosen for this task, called a valuable candidate. The TF-N algorithm selects suitable workers for one task at a time.
Given a task
TF-LO
Given a task
Because the no-detour working area changes, only one task can select this worker and then refresh the no-detour working area for the next round of worker selection. Therefore, although the task location is stable to simplify the task allocation, either using worker-first strategy or using task-first strategy for multi-task allocation must consider changes of tasks’ worker numbers, workers’ task numbers, and worker’s locations during the task allocation process, which is a very complex problem. In the following, we propose a greedy-based task allocation algorithm with global optimization.
Global-optimizing task allocation methods
Light task packaging
For one worker, we can find the optimal task package with the enumerative method, however, for multiple workers, it is hard to enumerate the optimal task allocation solution. Tasks that can be assigned to the same worker are considered as a task package. As shown in Figure 2(c), according to the enumerative forest and the task number parameter (i.e.
Given
For instance, assume that
There are
The number of theoretical task allocation solutions is
Worker-first globally optimized algorithm
For the purpose of both greedily assigning more tasks and satisfying most workers, we propose worker-first globally optimized (WF-GO) algorithm, which searches the most suitable worker rather than assigning the sequential candidate proper tasks like WF-LO.
WF-GO uses the backtracking to satisfy both MAX-I and MAX-II at the same time. First, all task packages of each worker is created and then sorted in descending order of package sizes (which refers to the task number in the package). Second, every candidate is assessed to determine their selection. In the ith assessment round, the task number limitation of a candidate
Evaluation
Experiment setup
Datasets and simulations
We use the dataset of individual Divvy bike sharing trips to imitate workers’ trips, including the origin, destination, and timestamps for each trip (https://data.cityofchicago.org/browse?q=divvy, accessed12 October 2018); 386,809 trips of 3 months are used to evaluate our methods. The map is divided into
For the same task set and the same worker set, if an area is divided into different numbers of grids, the density of workers and also the density of tasks will be different. The sensing coverage, denoted by
Baselines and metrics
Greedy algorithms are usually used by researches to efficiently allocate tasks.20,27,28 We provide the following baseline methods for comparative studies.
Naive Greedy Allocation—TF-N algorithm. 28 Much work without optimization.
Local-optimizing Greedy Allocation—TF-LO algorithm and WF-LO algorithm.20,27
We use two evaluation metrics, namely, task allocation ratio and satisfaction degree. Given the task set
where
Experimental results
Task allocation ratio
Figures 3 and 4 show the experimental results of the task allocation ratio, from which we can draw the following finding: (1) when the candidate number increases, the task allocation ratio also increases, which is in line with the common knowledge; (2) comparing to WF-LO, the task allocation ratio of using WF-GO increases and the increment shown in Figure 4 even reaches 10% when the worker number is small; (3) when

The experimental result of the task allocation ratio when

The experimental result of the task allocation ratio when
According to experimental results and the above findings, we can conclude that if the candidate number is relatively small, using WF-GO can obtain higher task allocation ratio, and if there are adequate candidates, that is,

The experimental result of the task allocation ratio when g = 50,
We also evaluate the proposed methods for maximizing the satisfaction degrees of workers, which is presented in the following.
Satisfaction degree
Constrained by no-detouring, candidates cannot be chosen to be workers if their routes do not pass any task’s location. Given the same candidate set and the same task set, the smaller worker number means that the task is centrally allocated as well as the task number is closer to the worker’s expectation.
We vary both the worker number and the task number, respectively, for the purpose of simulating situations of either adequate candidates or scarce candidates. Figures 6 and 7 show the experimental results of the satisfaction degree evaluation. As can be seen in Figure 6, when the participant number increases, the satisfaction degree of WF-GO increases. When more candidates are available, through using WF-GO, we globally select the most suitable candidate one by one, thus both the satisfaction degree (see Figure 6) and the task allocation ratio (see Figure 4) are raised. In addition, TF-N, TF-LO, and WF-LO do not consider the whole candidate set and some candidates might nearly never be chosen.

The experimental results of satisfaction degrees when

The experimental results of satisfaction degrees when
When the task number increases and candidates will be in shortage, as shown in Figure 7, every worker has more options of choosing tasks and the satisfaction degree of using WF-GO is still the highest. When the task number increases, candidates easily obtain plenty of light tasks, so the satisfaction degrees of all methods increase. In the case of a lack of candidates, TF-N and TF-LO have a similar result of the satisfaction degree.
As the satisfaction degree is related to the expected task number, we adjust this number and show the experimental result in Figure 8. On one hand, with enough candidates, the allocation ratios of all methods are nearly 100%. On the other hand, if all candidates expect more tasks, the mean satisfaction degree will decrease. As such, the advantage of using WF-GO becomes greater than the others. To conclude, we recruit less workers through using WF-GO which will save the labor cost. In this way, workers will get their expected number of tasks leading to satisfied experience with task allocation.

The satisfaction degree comparison under adjusting the worker’s expected task number when
Conclusion
Task allocation is an important step for the MCS applications with a limited labor budget. Constrained by the spatial and temporal situation of both candidates and tasks, a task allocation method must consider the spatial and temporal relationship between candidates and tasks to maximize the number of allocated tasks as well as minimizing the gap between the number of assigned tasks and the number of required tasks for every worker (i.e. satisfying workers). We define two optimization objectives for this task allocation problem and develop two greedy-based task allocation algorithms to both worker-first and task-first methods. Experiments are conducted and results show that global-optimizing algorithms increase both the task allocation ratio and the worker satisfaction degree. How to minimize workers’ detour distances and lower the total payment will be our future research effort.
Footnotes
Handling Editor: Salvatore Distefano
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was partially supported by the National Natural Science Foundation of China (No. 61972092, No. 61602230, No. 61772428, and No. 61725205).
