Abstract
Model complex production business contains the attributes of completion time, accuracy, and so on; these factors influence each other; and pursuing the balance of these factors is the core idea of the article. Due to the problems of low accuracy or more completion time which may be caused by the traditional algorithm, the research group proposed the serial reduction algorithm which can achieve higher accuracy under deadline for the business. The algorithm sets the active portion for each task and uses the inverse iterative way to obtain a path that can balance both the completion time and the accuracy. By the case analysis, compared with the traditional unidirectional objective algorithm, serial reduction algorithm can optimize the accuracy under the deadline. Finally, the article analyses the parameters that can affect the algorithm performance; the experiment shows that serial reduction algorithm applied to the workflow is feasible and effective, which has the simple advantage.
Introduction
Modern business workflow technology is based on network flow which uses service as the basic elements to framework and uses collaborative services to complete the business. Business process is abstracted by workflow system which divides the process into many steps; combined with the service attributes, finally, the system determines the best completion path. In the workflow system, selecting the service in each task is extremely important because it determines the efficiency and completion quality of the entire business. Selecting a reasonable service in the task plays an important role on the effective implementation of the entire business process.1,2
To balance completion time and accuracy is the goal of modern workflow optimization. 3 The project process as a whole is divided into several steps, usually since the problems exist in one or several steps, such as pursuing the efficiency but neglecting the quality of service or pursuing the quality of service but delaying the completion time, which make the completion time and the completion quality of the entire project imbalanced.4,5 Therefore, pursuing efficiency unilaterally will result in low accuracy rate; on the contrary, pursuing service accuracy unilaterally will affect the completion time of complex products. To solve this problem, this article provides a high-quality algorithm that can be applied to ensure the workflow of complex products in time as well as the highest quality, which implements serial reduction optimization of completion path quality. Comparing with other algorithms that aim to minimize the completion time or maximize the completion accuracy, serial reduction algorithm makes a balance between both of them. Serial reduction algorithm maps the whole process task and its relevant services to a directed acyclic graph (DAG). Passing the hierarchical strategy, it divides activity section to each task, gets the most accuracy rate of each task iteratively when the task starts at different times, and gets the optimization path by the previous calculation process finally.
Relevant works
The workflow model contains tasks and services; the services carry the attributes of time, cost, and quality in the workflow model; a reasonable link among the services plays a key role to optimize the operation of the task. With the continuous expansion of the workflow structure, optimizing the workflow no longer regards completion time as fundamental, but has a higher demand for its cost or quality. Buyya et al.6,7 proposed three kinds of heuristic algorithm to reduce the cost of constraint time; V Khajehvand et al. 8 proposed a scalable cost-time trade-off (SCTT) model for scheduling workflow tasks; P Czarnul 9 selected a capable service for each task so that a global criterion is optimized such as a product of workflow execution time and cost, a linear combination of those, or minimization of the time with a cost constraint; Rodenburg et al. 10 minimized the cost of workflow execution under deadline constraints in the a mathematical programming language (AMPL) model. Kim et al. 11 used a resource consolidation in a parallel manner to decrease the cost with the deadline assurance and maximized the service quality.
Based on the problem of realizing goals under constraint time, Chinese researchers also carried out corresponding research. Liu et al. 12 adopted the priority factor technology to optimize time and cost in the workflow and analyzed the differences in several algorithms. Wu et al. 13 proposed the method of the management for the multi-tenant uses of SaaS and created the cost optimization strategy with Markov chain; Liang et al. 14 proposed a new dynamic scheduling method; Zhang and Qi 15 proposed a bi-directional scheduling algorithm; and Cao et al. 16 proposed optimal scheduling method based on particle swarm optimization.
Visibly, when making the scheduling for the business workflow, multiple factors need to be considered; how to optimize the completion quality in constraint time is also a hot research area.
Problem descriptions
Definition of workflow model
The workflow model is a reasonable arrangement for business process; it can effectively arrange the service to the corresponding set and make the complex product process to a DAG. Simply put, workflow model is a process of using all information in the business to compose the graph.
Definition 1: Task pool
It refers to a set of all tasks in a given business process. If indicated by P, then P = (p1, p2, …, pi, …, pn).
Definition 2: Service pool
It refers to the set of all services corresponding to task pi in set P. If represented by Si, then Si = (s1, s2, …, sj, …, sm).
Definition 3: Rank of service pool
Refer to the number of services in set Sp for any p in set P, represented by rp.
Definition 4: Workflow model
If the model is represented by W, then W can be formalized as a binary group W = (P, Sp), p∈P, P is the task pool, and Sp is the service pool of task p. In a specific business process, ordering relation exists between tasks pi and pj (i ≠ j), and each task p corresponds to a service pool Sp, and through the services it coordinates with each other reasonably between tasks to implement the entire workflow. 17
A typical complex product workflow model is shown in Figure 1. In the model, task set P represents all the tasks of the business, service set Sp (p∈P) represents the corresponding database resources, and spk indicates that task p is completed by the kth service in set Sp. Start indicates the start of the process, ps represents the virtual task, and Ss represents the set of services; then ps and Ss point to Start. End indicates the end of the process, pe represents the virtual task, and Se represents the virtual set of services; then pe and Se point to End.

Workflow model.
Definition 5: Workflow graph
The graph that uses the DAG for combining with task pool R and service pool S is used to describe the business process clearly. It is shown as G = {spk, E}, p∈P, 0 < k ≤ rp; spk indicates that task p is completed by the kth service in set Sp; E represents a directed edge and indicates the order between tasks; the next task can begin only when the previous task is completed. In the workflow graph (WFG), the node which does not have a precursor is the start node, represented as Start; ps represents a virtual task, Ss represents a virtual set of services, then ps and Ss point to Start; the node which does not have a successor is the end node, represented as End; pe represents a virtual task, Se represents a virtual set of services, then pe and Se point to Start. 18
Definition 6: Service attribute
It refers to the attribute parameter that service spk owns in WFG G, represented as ptpk = (tpk, apk), p∈P, 0 < k ≤ rp; tpk represents the time of using service spk to complete task p and apk represents the accuracy of using service spk to complete task p.
Generation algorithm of WFG
The WFG can effectively simulate the business process with its tasks and services; it plays a vital role for safety personnel to own the entire working path in detail. On the basis of previous studies,19,20 combining with workflow attributes, it gives the generation algorithm of WFG as follows:
For the business process, divide the task set P, arrange all the tasks in order, and plan out service set Sp which corresponds to task p.
Start at the beginning node, add the services of the first task into the same layer, and connect Start with the services in order.
Discover the next task, add all the corresponding services into the same layer, and connect with the services of the previous layer in order.
Repeat step (3) until no task and make the services of the last layer connect to the final state End in order.
Add attributes ptpk to all services spk.
Complete the WFG.
According to the above policy, it can design the pseudo-code of generation algorithm of WFG as follows:
Input: parameters: spk; ptpk; Start; End; len; rp; //len is the number of the task set P
Output: WFG G
Example of generation algorithm of WFG
Assume task set P in the workflow model is {α, β, γ, δ}, and each task has a corresponding service pool Sα, Sβ, Sγ, Sδ; then using generation algorithm of the WFG and inputting parameters, the WFG, for instance, and the corresponding parameters are shown in Figure 2:

Example of the workflow graph.
In Figure 2, for each task p, it has a corresponding service pool S. For each service spk in set Sp, it has corresponding attribute ptpk = (tpk, apk), by the tasks to generate the workflow.
Constraint analysis of workflow
Definition 7: Completion time
It means using services spk (p∈P, 0 < k ≤ rp) of different tasks p to build a working path; the time the path spends is represented as T.
Definition 8: Completion accuracy
It means using services spk (p∈P, 0 < k ≤ rp) of different tasks p to build a working path; the accuracy the path reaches is represented as A.
Definition 9: Completion deadline
It refers to the latest time to complete complex product and is represented as ψ.
In the WFG G, it needs specified constraints’ policy to complete the business. The formal formulas for the process are as follows
where lpk uses the service spk to complete the task p. Formula (1) means the accuracy of a path that can complete the complex product
Formula (2) means the path can only select one service when completing one task
Formula (3) means the time complex products spend must be less than or equal to the deadline
Formula (4) means lpk is a variable.
Under deadline ψ, the article uses formula (1) as the heuristic function and uses formulas (2)–(4) as the constraint conditions to work out the accuracy of a path that it can complete the complex product, and then uses different algorithm strategies to obtain the accuracy corresponding different paths.
Traditional unidirectional objective algorithms
In a complex production process, it carries the attributes of completion time and completion accuracy; the traditional unidirectional objective algorithm means only ensuring the minimum completion time but neglecting the completion time or only ensuring the maximum completion accuracy but neglecting the completion time.
The unidirectional objective algorithm based on minimizing the completion time
Minimizing the completion time in a business process can save more time; the algorithm selects the services which spend minimum time completing each task to build a workflow path, according to formula (5) to calculate the accuracy of the path
In formula (5), Tmin is the constraint condition, and the completion time is minimized; A is the accuracy of the path.
The unidirectional objective algorithm based on maximizing the completion accuracy
Maximizing the completion accuracy in a business process can make the path with optimal quality, so each task in the path needs the service which has maximum accuracy to complete, according to formula (6) to calculate the completion time of the path
In formula (6), Amax is the restrictive condition, and the completion accuracy is maximized; T is the time of the path.
From the analysis of these two unidirectional objective algorithms, the disadvantages of these two policies are as follows:
Minimizing the completion time results in the minimum completion accuracy of the path.
Maximizing the completion accuracy causes a problem that the completion time may exceed deadline ψ; it cannot meet the business requirements that complete the process on time.
Based on serial reduction optimization strategy in the time–accuracy
The core idea of serial reduction technology is fully considering the completion time and accuracy of the business process, within deadline ψ, assigning each task with an activity section, translating complex production process into the implementation of the local node, through an iterative manner, and to find a path that can balance both completion time and accuracy.
Definition
Definition 10: Task freedom WFp
It refers to the active portion of task p in workflow model; the portion is represented as [STp, ENp], p∈P, STp means the earliest start time for the task, and ENp means the latest start time. STp and ENp of WFp are obtained by formula (7)
Theorem 1
In WFG G, any task p has only one task freedom WFp.
Authentication
Existing the tasks p, p′, and they are in order, p′ is the precursor of p.
Prove that the earliest start time STp is only one. Suppose the running time of the kth service for task p′ is shortest, namely,
Prove that the latest start time ENp is only one: suppose the latest start time of task p′ is
In summary, the earliest start time of task p is only one and the latest start time of task p is also only one, so the task freedom WFp also has one and only one.
Q.E.D.
Arithmetic statement
Serial reduction hierarchy algorithm is an algorithm that can choose the appropriate services in the activity section and balance the completion time and the accuracy. The traditional algorithm may produce some time segments because of the service’s inadequate use of time in the service of the activity section. Serial reduction algorithm can make full use of time and gather the time fragment to achieve a local optimal solution. Based on the freedom of the task, there is a WFp = [STp, ENp] for any task p in the set of task P; this article solves the maximum accuracy from back to front that each task can obtain when it starts at different times and determine the optimal path by the comparison of final completion accuracy.
Suppose the business process contains n tasks, f(p,t) represents the maximum accuracy that task p can achieve when it starts at time t, t∈[STp, ENp], that is, the activity freedom of task p. Within task freedom WFp, task p in the last layer must be able to get a maximum accuracy when it starts at different times; the process can be obtained by formula (8)
Existing task p′ which is the precursor of task p. After the reduction of all tasks, based on the results of task p which is in the last layer, the algorithm obtains the maximum accuracy of each task when it starts at different time iteratively, the process can be obtained by formula (9)
With the iteration, when traversing to the initial task, the combination of services which has the maximum accuracy of business processes is the optimal path.
From the above, the steps of the algorithm are as follows:
Call the algorithm WFG to generate WFG.
Use the business deadline and formula (7) to obtain the freedom WFp for each task.
Use formula (8) to obtain the maximum accuracy of the task in the last layer when the task starts at different times.
Use formula (9) to obtain the maximum accuracy of each task when it starts at different times iteratively.
Compare the final accuracy to achieve the optimal path which has the maximum accuracy.
According to the above strategies, the pseudo-code of the algorithm SRO (based on serial reduction optimization strategy in the time–accuracy) is as follows:
Input: parameters: P; spk; ptpk; len//len is the number of the task set R
Output: the optimal path
Experimental results and analysis
Case design
By the research, the business process of the boiler plant in a power plant can be divided into application, examine and approve, execution, record, debug, and completion of the six areas; after the analysis, the processes of application and completion can be set to virtual task. These task links form task set P, which can be represented in the abstract as follows: examine and approve is p1, execution is p2, record is p3, debug is p4, the virtual task of application is ps, and the virtual task of completion is pe; each task p corresponds to a service pool Sp, the service set of examine and approve p1 is S1 = {s11, s12, s13}; the service set of execution p2 is S2 = {s21, s22}; the service set of record p3 is S3 = {s31, s32, s33}; the service set of debug p4 set is S4 = {s41, s42}; the virtual service set of application is Ss; and the virtual service set of completion is Se. Any elements in each service set have attributes ptpk; combined with the service, it can be expressed as spk (tpk, apk); s11(3,0.96) means the time of using service s11 to complete task p1 is 3, and the accuracy is 0.96. Other services in each task and their working time and accuracy are shown in Table 1.
Task and service attributes.
Through the above tasks and service pool Sp of each task p, combined with the WFG (generation algorithm of WFG), the workflow can be established as shown in Figure 3.

Workflow graph.
Figure 3 shows four tasks of the boiler plant and their service pools, and the corresponding attributes of the services for each task are marked in the figure. By this, we call different algorithms to calculate different test results.
Algorithm analysis
Based on the WFG in Figure 3, specifying the completion deadline of this business process ψ = 19, the article uses the traditional unidirectional objective algorithm with the SRO (based on serial reduction optimization strategy in the time–accuracy) to compare, reflecting the optimized effect of SRO.
Traditional unidirectional objective algorithm
1. The unidirectional objective algorithm based on minimizing the completion time
Based on minimizing the completion time, then calling formula (5) to obtain the corresponding path which is s11 → s21 → s31 → s41, at this time, the constraint condition is Tmin = t11 + t21 + t31 + t41 = 3 + 2 + 5 + 6 = 16; the Tmin meets the condition that it is less than 19, then the corresponding workflow accuracy is A = a11 × a21 × a31 × a41 = 0.96 × 0.97 × 0.96 × 0.96 = 0.858.
2. The unidirectional objective algorithm based on maximizing the completion accuracy
Based on the strategy, the paper uses the formula (6) to obtain the corresponding path which is s13 → s22 → s33 → s42, at this time, the constraint condition is Amax = a13 × a22 × a33 × a42 = 0.99 × 0.99 × 0.98 × 0.97 = 0.931, the corresponding completion time is T = t13 + t22 + t33 + t41 = 6 + 4 + 9 + 8 = 27, because the completion time is greater than the deadline 19, so that the path does not meet the condition.
Based on serial reduction optimization strategy in the time–accuracy
By the task pool and the service pool in Table 1, the paper uses algorithm 2 to obtain the earliest start time of p1, p2, p3, p4 are 0, 3, 5, 10 respectively, the latest start time are 3, 6, 8, 13, respectively. Then the freedom of task p1 is [0, 3], the freedom of task p2 is [3, 6], the freedom of task p3 is [5, 8], and the freedom of task p4 is [10, 13]; within the freedom of each task, the maximum accuracy of each task that starts at different times can be obtained:
p4(debug):
f(4,10) = max{0.96,0.97} = 0.97; f(4,11) = max{0.96,097} = 0.97;
f(4,12) = max{0.96} = 0.96; f(4,13) = max{0.96} = 0.96.
p3(record):
f(3,5) = max{f(4,10) × 0.96; f(4,12) × 0.97} = 0.931;
f(3,6) = max{f(4,11) × 0.96; f(4,13) × 0.97} = 0.931;
f(3,7) = max{f(4,12) × 0.96} = 0.921;
f(3,8) = max{f(4,13) × 0.96} = 0.921.
p2(execution):
f(2,3) = max{f(3,5) × 0.97,f(3,7) × 0.99} = 0.912;
f(2,4) = max{f(3,6) × 0.97,f(3,8) × 0.99} = 0.912;
f(2,5) = max{f(3,7) × 0.97} = 0.894;
f(2,6) = max{f(3,8) × 0.97} = 0.894.
p1(examine and approve):
f(1,0) = max{f(2,3) × 0.96,f(2,4) × 0.98; f(2,6) × 0.99} = 0.894;
f(1,1) = max{f(2,4) × 0.96,f(2,5) × 0.98} = 0.876;
f(1,2) = max{f(2,5) × 0.96,f(2,6) × 0.98} = 0.876;
f(1,3) = max{f(2,6) × 0.96} = 0.858.
The f(1,0) = 0.894 is maximum, so the optimal path for the serial algorithm is as follows: p1 starts at time 0, and f(2,4) × 0.98 = 0.894, so p1 is completed by service s12 and the time that s12 uses is 4; p2 starts at time 4, and f(3,8) × 0.99 = 0.912, so p2 is completed by service s22 and the time that s22 uses is 4; p3 starts at time 8, and f(4,13) × 0.96 = 0.921, so p3 is completed by service s31 and the time that s31 uses is 5; p4 starts at time 13, and f(4,13) = 0.96, so p4 is completed by service s41 and the time that s41 uses is 6. The completion time was 19, it is equal to the deadline, and the completion accuracy obtained by the path is 0.894.
Algorithms’ comparison
Aiming at traditional unidirectional objective algorithm and serial reduction algorithm, we combined with the data in section “Algorithm analysis” and made a comparison between algorithms, using the MATLAB to describe the differences and the path of different algorithms, as shown in Figure 4. Figure 4(a) shows the accuracy the path reaches and the time the path spends when the process reaches different layers of tasks in different algorithms, and Figure 4(b) shows that different algorithms have different completion paths.

(a) Accuracy and time of different tasks and (b) the path of different algorithms.
From Figure 4(a), when the process takes on to task p4, its accuracy is the completion accuracy A; it can be seen that the accuracy of the unidirectional objective algorithm based on maximizing the completion accuracy is maximum, but the completion time exceeds the deadline, so the method is abandoned. The completion time of the other two strategies is less than or equal to the deadline, so the accuracy of these two strategies can be obtained; the accuracy of the unidirectional objective algorithm based on minimizing the completion time is A1 = 0.858; the accuracy of the SRO (based on serial reduction optimization strategy in the time–accuracy) is A2 = 0.894, using the above data to calculate the increase rate K = (A2 − A1)/A1 × 100% = 4.20%. Visibly, serial reduction optimization strategy compared to the traditional unidirectional objective algorithm has played a promoting effect, and the optimal path can be obtained by Figure 4(b): s12 → s22 →s31 → s41.
The influence of other parameters on the algorithm performance
Size of the deadline ψ
Deadline ψ is the latest workflow completion time; as a general rule, increasing the deadline will improve the completion accuracy of the optimal path. In this article, serial reduction algorithm is based on confining the time; through changing the deadline, the freedom of some tasks also makes some changes. If the freedom increases, the task can select the higher accuracy service; on the contrary, if the freedom decreases, the task can only select the smaller accuracy service. Taking {5, 10} as the number of tasks to form two workflow structures, for each task pi in the structure corresponding to a service pool Si, its number is a value of the interval [2, 4], and the service has attribute ptpk. Based on time Tmin which the strategy based on minimum time costs, increasing 10%, 20%, 30%, 40% as the deadline respectively, and calculate the time Tmax and the accuracy that the strategy based on maximum accuracy cost. Through the below constraint times, Table 2 shows the influence of changing deadline on the algorithm performance.
Influence of different deadlines.
From Table 2, when the increase rate is 0, the strategy is based on the minimum time. In these different workflow structures, with the increase in the deadline, the increasing rate becomes higher; it means the optimization performance is getting better and better. Therefore, in the actual business process, increasing the deadline appropriately will increase the completion accuracy and improve the optimization performance of the serial reduction algorithm.
Number of tasks
The completion accuracy of a path is obtained by the product of the accuracy rates which belong to the services in the path. With the increasing number of tasks, it will influence the completion accuracy and the optimization performance. Taking {5, 10, 15, 20} as the number of tasks to form four workflow structures, for each task pi in the structure corresponding to a service pool Si, its number is a value of the interval [2, 4], and the service has attribute ptpk. Based on time Tmin, that is, the strategy based on minimum completion time, increasing 20% as the deadline, through the comparison of the strategy based on minimum time and the serial reduction algorithm, Table 2 shows the influence of the tasks’ number on the algorithm performance.
From Table 3, obviously, with the increasing number of tasks, each link has a deviation, so the completion accuracy will be reduced, but the optimization performance of serial reduction algorithm will be better.
Influence of task’s number.
Conclusion and outlook
Aiming to optimize the accuracy under the deadline, an algorithm which combines the task, service with the directed acyclic graph (DAG) is used in this paper firstly, namely WFG algorithm which can obtain the workflow structure. Based upon that, the research group first proposed a unidirectional objective algorithm based on the minimum time and the maximum accuracy of the workflow; aiming at the potential disadvantages of lower completion accuracy or exceeding the deadline, the researchers proposed a serial reduction optimization algorithm under the constraint deadline, adopted a hierarchical strategy in this algorithm, by taking an iterative approach, and confined the time for every task to obtain the maximum accuracy of the task when it starts at different times, and determined the optimal path by the maximum completion accuracy when all tasks are completed. The experiment shows that the performance of the serial reduction optimization algorithm was enhanced 4.2% than the traditional unidirectional objective algorithm. But there still remain some problems like not putting the business processes’ operational cost into this algorithm which require further discussion and study with people concerned. Future research direction is to take the relative properties of the service in the business process into consideration and truly find an optimal path which is efficient, accurate, and cost-saving; the group will further improve and refine it.
Footnotes
Academic Editor: Ramoshweu Lebelo
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (grant no. 61403109).
