Multi-centric management and optimized allocation of manufacturing resource and capability in cloud manufacturing system

Abstract

Cloud manufacturing offers the potential to make mass manufacturing resources and capabilities more widely integrated and accessible to users through network. Most related research assumes that there exists only one management center for all manufacturing resources and capabilities in a manufacturing cloud. However, this could cause the efficiency problem (e.g. scheduling time) and harm the quality of service (e.g. response time). Actually, a large-scale manufacturing cloud should have multiple management centers to deal with massive, widely distributed manufacturing resources and capabilities and users; meanwhile, the constraint of finite manufacturing resources and capabilities and the cost of remote collaboration should be taken into consideration. Thus, this article first presents the architecture for the multi-centric management with two-level scheduling strategy combining the advantages of the centralized and decentralized decision-making. Then, after quantifying the availability and the collaborative cost of the manufacturing resources and capabilities, we propose a global optimization model for the manufacturing resources and capability allocation under the multi-centric architecture. Finally, a case study adopting our new method shows that the utilization of the manufacturing resources and capabilities would be more balanced, while the cost of the total collaboration would be reduced, compared with the typical decentralized solution. The research results can support cloud manufacturing to effectively deal with the challenge of management and allocation for increasingly large-scale and distributed manufacturing resources and capabilities.

Keywords

Cloud manufacturing resource management resource allocation distributed scheduling multi-centric manufacturing capability

Introduction

Cloud manufacturing (CMfg) as a new service-oriented manufacturing paradigm, adopts and extends the concept of cloud computing,^1,2 in which the computing and storage resources are concentrated and managed for sharing. Integrating the existing manufacturing technology with the newly-emerged information technology (such as cloud computing, internet of things, modeling and simulation and big-data), CMfg virtualizes various manufacturing resources and capabilities (MR/Cs) and builds an MR/C pool (cloud) to deliver on-demand manufacturing services for the whole lifecycle activities of manufacturing, through network anytime and anywhere.^3–5 The cloud need concentrate massive and widely distributed MR/Cs. For example, a manufacturing cloud of a Chinese aerospace conglomerate integrates and shares lots of MR/Cs from its subsidiaries across China, including (a) high-performance clusters that run 100 teraflop operations per second and 320 TB storage; (b) more than 300 sets of tool software for the design and analysis in mechanical, electronic, control and other system; (c) not less than 10,000 parameterized models and knowledge files; (d) hundreds of high-end digital production lines and cellular manufacturing systems from several plants and (e) thousands of manufacturing capabilities⁶ such as basic technology, parts supply, and production process in the whole lifecycle of manufacturing.

However, most research about management and allocation of MR/Cs in CMfg^7,8 and other traditional paradigms such as grid manufacturing^9,10 assumes that there exists only one management center, and that is what we need to re-examine. For one thing, in a manufacturing cloud with the single-centric architecture, the quality of service (QoS) such as the response time cannot be guaranteed, because collecting the state of massive, widely distributed MR/Cs could be affected by the network speed and instability, especially when the scheduling cycle of some jobs should be limited to only tens of seconds. For another, it is actually not a single-centric but multi-centric architecture. There are many existing MR/Cs such as the digital production lines and the cellular manufacturing systems which cannot be centralized like the data centers in cloud computing due to the high migration cost. A feasible way is that the MR/Cs in original workshops or factories are connected by reusing and reforming their scheduling systems into a nearby management center, and then all of the MR/Cs can be combined as a whole by integrating the management centers scattered in different areas shown in Figure 1. With this multi-centric architecture, we focus on what kind of strategy (e.g. centralized or decentralized decision-making) is better for the efficient management and allocation.

Figure 1.

Layout of the MR/Cs in multi-centric architecture.

In fact, multi-centric architecture is very common in cloud computing (for example, federated clouds^11,12), and its methods of management and allocation are worth learning, but CMfg and manufacturing tasks have special characteristics. First of all, the MR/Cs are usually expensive and limited and used in the queuing-up way, unlike cloud computing services (such as electronic payment services) which can support massive concurrent access. So, the availability of the MR/Cs needs to be measured. Second, virtualization in CMfg has realized the fine-grained management and allocation of MR/Cs. Virtualization technology can be used to divide physical MR/Cs into a number of logical units and combine logical units into virtual instances elastically according to user demands. So, the measurement of the availability is different compared with the traditional ways. Third, a manufacturing task usually consists of multiple sub-tasks which would be achieved by the collaboration of the MR/Cs such as logistics transportation between different processes, business process synchronization among sub-tasks. If the collaborative MR/Cs scatter in different management centers far away, the cost of remote collaboration must be considered. The availability and the collaborative cost of the MR/Cs are two important indicators of management and allocation in multi-centric architecture. We also pay attention to how to quantify them.

In addition, compared with the traditional network manufacturing, CMfg pays more attention on operation model,³ so the management and allocation in multi-centric architecture have to satisfy the interests both of the MR/C service providers and consumers. For the MR/C service providers, no matter which management center their services register in, they call for a mechanism to guarantee that the services with the same category and level can be acquired equably to avoid unbalanced load of MR/Cs globally. For the MR/C service consumers, to ideally fulfill their manufacturing tasks within the delivery time at the lowest cost and highest efficiency, they want the cost of remote collaboration between the MR/Cs to be as low as possible. For example, in the process of collaborative production, the lower cost of the remote collaboration means the logistics transportation is cheaper and quicker; in the process of collaborative research and development (R&D), it means the business process synchronization is executed with lower latency and wider bandwidth. This is a multi-objective problem, and we are concerned about whether there is an elegance and simple mathematical model for the multi-centric management and optimized allocation of MR/Cs.

The main contributions of this article are as follows:

The architecture for the multi-centric management is presented. The architecture extends the existing MR/C management framework which does two-level scheduling to solve the heterogeneity problem of MR/Cs and executes two-level scheduling strategy combined with centralized and decentralized decision-making to tackle the distribution issue of MR/Cs.

The methods to quantify the availability and the collaborative cost of the MR/Cs are given and discussed. The methods take the characteristics of manufacturing clouds (such as virtualization) and manufacturing tasks (such as collaboration) into full consideration.

A global optimization model for the MR/C allocation under the multi-centric architecture is proposed. Using the model, compared with the existing solution, the utilization of the MR/Cs would be more balanced, while the cost of the total collaboration would be reduced.

The rest of the article is organized as follows. In section “Related works,” we survey related works. In section ‘Coordinated management architecture of MR/C in CMfg,” a new framework for the multi-centric management is presented. In section “Availability and collaborative cost of MR/Cs,” we discuss the metrics for the availability and the collaborative cost of the MR/Cs. In section “Global optimization model for the MR/C allocation under the multi-centric architecture,” we propose a global optimization model for the MR/C allocation under the multi-centric architecture. In sections “Application example” and “Conclusion and future work,” we present a case study and the summary, respectively.

Related works

There are various kinds of MR/Cs in the manufacturing cloud. Assuming only one management center, most current research on the management of MR/Cs concerns how to integrate, monitor and allocate MR/Cs. We have proposed an MR/C management framework of manufacturing clouds in Lin et al.¹³ Based on the respective matured methods of virtualizing and managing computing resource, software (and their license), models, manufacturing facilities and capability,^14–16 we utilized different specialized modules to manage the virtual instances of different kinds of MR/Cs, and then on top of the modules, a management middleware of MR/C was presented to organize and allocate the whole MR/Cs, including the unified ID allocation and the lifecycle management of MR/Cs and the scheduling of tasks in a single global task queue. However, this research also assumes only one management center, so it should be further extended to support the multi-centric management.

Virtualization¹⁷ plays a vital role in CMfg. Li et al.⁵ thought that virtualization provides the way to encapsulate and abstract the heterogeneous, distributed MR/Cs to facilitate the unified management of MR/Cs in CMfg. After virtualization, the physical MR/C may have different mapping relationships with a virtual instance (The Service Oriented Architecture¹⁸ is employed to deliver services of virtual instances to users), including one-to-one, one-to-many and many-to-many relationships (Figure 2). For example, a machine tool typically has the one-to-one relationship with its virtual instance, because it usually could not process two more workpieces at the same time. A manufacturing capability can be divided into different number of virtual instances if measured in man-month or man-hour (one-to-many relationship), while a number of computers in a computing cluster can be virtualized into multiple virtual clusters in cloud computing (many-to-many relationship). However, manufacturing grid only supports the one-to-one mode,¹⁹ as it cannot efficiently manage the fine-grained MR/C like multi-cores on a computer without the latest virtualization technology.

Figure 2.

Relationships between the physical MR/Cs and virtual instances: (a) one to one, (b) one to many and (c) many to many.

Under the multi-centric architecture, cloud computing usually deploys application services in several data centers, so that users’ requests can be routed to the most suitable center.^20,21 There are two practical ways to schedule tasks in a cloud: the centralized scheduling method based on the primary and standby architecture and the distributed, decentralized one.²² The former method sets one main center to schedule all tasks. All users request services from this center. When the main center or its network breaks down, the standby center will automatically perceive such situations (e.g. through monitoring heartbeats) and deliver services to users instead. As for the second method, all sub-centers operate in the peer-to-peer way: each sub-center is responsible for the scheduling of tasks from the nearby users and will process the tasks to the best of its ability or else will send the request for the MR/C to other sub-centers. Basically, the first method bases on the single center architecture, probably leading to low QoS (e.g. responding time) when the MR/Cs are large-scale and distributed. The second method does not consider the cost of remote cooperation and the global optimizing of MR/C allocation, so that the interest of users and providers of MR/Cs cannot be well-balanced.

Actually, this is the distributed scheduling problem and the above methods are only a subset of the solutions. Toptal and Sabuncuoglu²³ proposed a systematic classification framework of the distributed job scheduling problems from the aspects, such as information flow structure, communication mechanism, local agent types, assignment method, schedule generation, objection and machine environment. The distributed job scheduling can be done in the centralized way (with or without an independent, global scheduling center) or the decentralized way (with or without the coordinators), according to the information flow structure. Besides, the distributed job scheduling can be done hierarchically. The introduction of multi-agent technology²⁴ greatly promotes the development of distributed scheduling. The multi-agent system can make optimal decisions through complex interactions between distributed, autonomous agents. Kamsu-Foguem and colleagues^25,26 proposed methods to fuse the knowledge from different sources to support collaborative decision-making in industrial maintenance, including knowledge formalization of domain vocabulary to improve the communication and knowledge sharing with conceptual graphs formalism, multi-expert knowledge management with the transferable belief model to support collaborative decision-making and a variant of the case-based reasoning mechanism with a process of solving new problems based on the solutions of similar past problems. However, these methods cannot be used directly to solve the domain problems in CMfg and new solutions should be developed to satisfy the application requirements.

Coordinated management architecture of MR/C in CMfg

To manage MR/Cs in CMfg, we propose a two-layer coordinated management framework (Figure 3) by extending our previous management architecture.¹³

Figure 3.

Coordinated management architecture of MR/C in CMfg.

First, the basic layer of the architecture is based on our previous framework, which primarily aims to shield the heterogeneity of MR/Cs and manages MR/Cs in a single center.¹³ The old framework has two levels. The bottom level employs different management systems to manage different kinds of MR/Cs (e.g. computing resource, software, models, facilities and capabilities). Each kind of MR/Cs is usually heterogeneous, and the heterogeneous physical MR/Cs are virtualized as virtual instances and managed by a management system which can be the model management system, the software management system, the capability management system and so on (as shown in Figure 3). At the upper level, the internal scheduling module is responsible for selecting and organizing different kinds of virtual instances to deliver composite services according to the state information of virtual instances and the composition relationship between different kinds of MR/Cs. For example, the virtual desktop service which can provide users virtual product–design environments is built by composing virtual software and virtual machines. There are multiple management centers in a manufacturing cloud, but each center can manage MR/Cs in such similar way.

Based on the basic layer, the high layer mainly addresses the distribution problem of MR/Cs also in two levels. According to the design, every management center in the CMfg system only accepts the service requests nearby. As collecting the state information from all virtual instances globally can easily cause the efficiency problem, and every management center only maintains the MR/C categories of other management centers and asks for the availability information of related MR/Cs from corresponding management centers according to the accepted tasks. Then, at the top level, the requested management center will make the optimal decision according to the whole availability information and the cooperation cost of related MR/Cs and route the sub-tasks to the optimal centers; at the bottom level, the chosen centers will accept the allocated sub-tasks and optimally select the suitable virtual instances from their inside MR/Cs to implement the sub-tasks independently. Thus, we can see that the framework combines the advantages of the centralized and decentralized decision-making: a management center performs the centralized decision-making by acquiring essential information from all management centers and scheduling its tasks globally and optimally to suitable management centers when it accepts tasks from users; each management center in CMfg can behave like this, so from this aspect, this is the decentralized decision-making which makes the cloud more robust and reliable.

The overall coordination process of the framework is as follows:

Step 1. An MR/C management middleware accepts a task from the upper application by the service-requesting interface. The task may consist of many sub-tasks, each requesting certain amount of certain kinds of MR/Cs.

Step 2. Through the coordination module, the MR/C management middleware asks for the availability information of related MR/Cs from corresponding management centers’ MR/C management middleware and negotiate the cooperation cost between MR/Cs with them on demand.²⁷ The “related MR/Cs” refer to the MR/Cs that may potentially be used by the accepted task.

Step 3. Every MR/C management middleware updates the state information of its MR/Cs periodically via the monitor module¹⁴ and calculates the availability of MR/Cs on demand.

The cooperation cost of MR/Cs between two centers is determined by two middleware through negotiations.

The MR/C management middleware makes the optimal decision using the mathematic model shown in section “Global optimization model for the MR/C allocation under the multi-centric architecture and intelligent algorithms.”

Step 4. The MR/C management middleware then allocates the sub-tasks to the optimal centers according to the decision. The management middleware in the chosen centers will reserve and lock the virtual instances for the sub-tasks.

Step 5. The MR/C management middleware in the chosen centers schedules and executes the sub-tasks using the internal scheduling module, and then returns the execution progresses or results.

Availability and collaborative cost of MR/Cs

Availability of MR/Cs

We use the availability of MR/Cs, because there are only limited available MR/Cs in CMfg.²⁸ To measure the availability of MR/Cs is the first step toward optimized allocation of MR/Cs; otherwise, the cloud could not know whether and when there exist idle MR/Cs for new tasks. The availability can be measured according to two main factors:

(a) Virtualization

The availability of MR/Cs mainly depends on how much the physical MR/Cs remain, how much the requested virtual MR/C instances are expected to occupy and how many virtual MR/C instances could be virtualized from the left physical MR/Cs. There exist one-to-one, one-to-many and many-to-many relationships between the physical MR/Cs and the virtual MR/C instances. In the case of one-to-one relationship such as for the machine tool, the remaining physical MR/Cs could be measured by the proportion of the idle ones; in the case of one-to-many relationship such as for the manufacturing capability, the left physical MR/Cs could be measured by the proportion of the idle working hours; in the case of many-to-many relationship such as for the high-performance cluster, the remaining physical MR/Cs could be measured by the proportion of the idle CPU and memory (weighted sum). Meanwhile, the requested virtual MR/C instances could be measured by a few key parameters (e.g. working hour and station number) in the similar way, and we define P to represent the proportion.

(b) Task characteristic

The availability of MR/Cs also depends on task characteristics: manufacturing tasks can be finished in unpredictable time (e.g. we could not predict how much time a user would spend on a product design using the virtual desktop), can be finished in predictable time but without time limit (e.g. we could predict the execution time of batch jobs according to the job files), and need to be finished in predictable time and within designated windows (e.g. for the production order, the delivery time is given in the order, and we can determine the starting time by estimation). In this article, we assume that any kind of MR/Cs could only service manufacturing tasks in one mode.

Manufacturing tasks with unpredictable finishing time

If there are no enough physical MR/Cs left, the incoming manufacturing tasks have to wait in queue. It is easy for the management middleware to know the queue length N and the expected amount P of the requested virtual MR/C instances. Then, the availability of MR/Cs can be calculated as

\begin{matrix} Avail = \frac{1}{(\sum P_{i} + 1)}, N \neq 0 \\ Avail = 1, N = 0 \end{matrix}

(1)

Manufacturing tasks with predictable finishing time but no time limit

If there are no enough physical MR/Cs left and the finishing time of the current manufacturing tasks is predictable, we can know how long the incoming manufacturing tasks have to wait in queue. It is easy for the management middleware to know the queue length N and the expected quantity P of the requested virtual MR/C instances and the execute time (predicted to be T) of each manufacturing task in the queue. Then, the availability of MR/Cs can be calculated as

\begin{matrix} Avail = \frac{1}{(\sum (P_{i} \times T_{i}) + 1)}, N \neq 0 \\ Avail = 1, N = 0 \end{matrix}

(2)

Manufacturing task with predictable time and designated windows

No matter whether there are enough physical MR/Cs left currently, we have to look for and reserve MR/Cs for these kinds of manufacturing tasks within designated windows; otherwise, it will affect the execution of their succedent processes and the subsequent tasks reserved in calendar.

It is easy for the management middleware to know the length N of the reservation queue and the expected amount P of the requested virtual MR/C instances and the execute time T_SPAN of each manufacturing task in the queue. For a new task, we assume that it requires start time and the expected ending time as T_ST and T_DUE, and the expected quantity of the requested virtual MR/C instances as P₀. Then, the availability of MR/Cs can be calculated as

\begin{array}{l} A v a i l = 1 / ((T_{D U E} - T_{S T}) \\ \times P_{0} / T_{D U E} - T_{S T} - \sum (P_{i} \times T_{S P A N i})) + 1) \end{array}

(3)

Collaborative cost of MR/Cs

The collaborative cost of MR/Cs possibly includes the cost of the logistics transportation in collaborative production and the cost of the business synchronization in collaborative R&D (measured by the weighted sum of network bandwidth and network latency c = α × Bandwidth + β × Delay). The greater the indicator, the higher the collaborative cost, and vice versa. Usually, in order to simplify the problem, we assume that the collaborative cost equals to zero if the collaborative MR/Cs are in the same management center; in order to avoid the influence of the dimension, we use the ratio of real collaborative cost to max collaborative cost between the same collaboration of MR/Cs instead of the absolute value.

Global optimization model for the MR/C allocation under the multi-centric architecture

Global optimization model

Usually, the MR/Cs for sub-tasks could be accessed from several centers. Then, the mathematical model could help us to find out the global optimized allocation of the MR/Cs by scheduling the sub-tasks to optimal centers.

We assume that there are M sub-tasks and N centers, and each sub-task could only be scheduled to a center. We define the vector for the center selection in equation (4) as X^T. X^T equals to $[Y_{1}^{T}]_{1 \times M}$ , in which $Y_{i}^{T}$ equals to $[y_{h}]_{1 \times N}$ . $Y_{i}^{T}$ is a special vector in which y_h equals to 1 if the sub-task i is scheduled to the center h; otherwise, y_h equals to 0. Consequently, $\sum h \cdot y_{h}$ equals to 1, in which h values from 1 to N.

X^{T} = [Y_{1}^{T}, Y_{2}^{T}, \dots, {[y_{1}, y_{2}, \dots, y_{h}, \dots y_{N}]}_{i}, \dots Y_{M}^{T}]

(4)

The objective function is designed to be the weighted sum of the cost for acquiring the MR/Cs (the availability of MR/Cs) and the cost of the collaboration between the chosen MR/Cs for the sub-tasks in different manufacturing clouds.

We define the matrix for evaluating the costs in equation (5) as E. E equals to [F_ij]_M _× _M, in which F_ij equals to [f_hk]_N _× _N, then the objective function is expressed as (X^T × E × X)/2 in equations (6) and (7). The value of F_ij and f_hk is shown as follows

E = [\begin{matrix} F_{11} & \dots & F_{1 j} & \dots & F_{1 M} \\ ⋮ & ⋮ & ⋮ \\ F_{i 1} & \dots & {[\begin{matrix} f_{11} & \dots & f_{1 k} & \dots & f_{1 N} \\ ⋮ & ⋱ & ⋮ & ⋮ \\ f_{h 1} & \dots & f_{hk} & \dots & f_{hN} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ f_{N 1} & \dots & f_{Nk} & \dots & f_{NN} \end{matrix}]}_{ij} & \dots & F_{iM} \\ ⋮ & ⋮ & ⋮ \\ F_{M 1} & \dots & F_{Mj} & \dots & F_{MM} \end{matrix}]

(5)

\begin{array}{l} \frac{1}{2} \times X^{T} \times E \times X = \frac{1}{2} \times {[\sum_{i = 1}^{M} Y_{i}^{T} \times F_{i j}]}_{1 \times M} \\ \times {[Y_{j}]}_{M \times 1} = \frac{1}{2} \times \sum_{j = 1}^{M} \sum_{i = 1}^{M} Y_{i}^{T} \times F_{i j} \times Y_{j} \end{array}

(6)

Y_{i}^{T} \times F_{i j} \times Y_{j} = {[y_{h}]}_{1 \times N} \times {[f_{h k}]}_{N \times N} \times {[y_{k}]}_{N \times 1} = \sum_{k = 1}^{N} \sum_{h = 1}^{N} y_{h} \times f_{h k} \times y_{k}

(7)

1. If i equals to j, $Y_{i}^{T} \times F_{ij} \times Y_{j}$ equals to the cost for acquiring the MR/C requested by sub-task i when sub-task i is allocated to a specific center. F_ij, which is on the diagonal, is designed in equation (8). Avail_ih means the availability of the MR/C requested by sub-task i in center h and can be calculated by equation (1) or (2) or (3). If the MR/C requested by sub-task i do not exists in center h, we set Avail_ih as a positive real number as small as possible. Consequently, the higher the availability, the smaller the cost for acquiring MR/Cs and (X^T × E × X)/2 will be

F_{ij} = [\begin{matrix} \frac{1}{Avai l_{i 1}} & \dots & 0 & \dots & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & \dots & \frac{1}{Avai l_{ih}} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & \dots & 0 & \dots & \frac{1}{Avai l_{iN}} \end{matrix}]

(8)

2. If i does not equal to j, $Y_{i}^{T} \times F_{ij} \times Y_{j}$ equals to cost of the collaboration between the MR/Cs which is allocated to execute the sub-task i and sub-task j. F_ij, which is not on the diagonal, is designed in equation (9). c_ijhk means the cost of the remote collaboration between sub-task i executed in center h and sub-task j executed in center k. If sub-task i and sub-task j are executed in the same center, we assume the cost equals to zero. Consequently, the lower the cost, the smaller (X^T × E × X)/2 will be

F_{ij} = [\begin{matrix} 0 & \dots & c_{ij 1 k} & \dots & c_{ij 1 N} \\ ⋮ & ⋱ & ⋮ & ⋮ \\ c_{ijh 1} & \dots & 0 & \dots & c_{ijhN} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ c_{ijN 1} & \dots & c_{ijNk} & \dots & 0 \end{matrix}]

(9)

Obviously, if the weight is not considered for the moment, the objective function is composed of $\sum_{i = 1}^{M} Y_{i}^{T} \times F_{ii} \times Y_{i}$ and $\sum_{j = 1}^{M} \sum_{i = 1, i \neq j}^{M} Y_{i}^{T} \times F_{ij} \times Y_{j}$ . $\sum_{i = 1}^{M} Y_{i}^{T} \times F_{ii} \times Y_{i}$ equals to the cost for acquiring the MR/Cs when scheduling M sub-tasks to N centers, and $\sum_{j = 1}^{M} \sum_{i = 1, i \neq j}^{M} Y_{i}^{T} \times F_{ij} \times Y_{j}$ equals to the cost of the collaboration between the MR/Cs which is allocated to execute the sub-tasks.

We refine the model from three aspects:

1. Use the variance to present the dispersion degree of the cost for acquiring MR/Cs shown in equation (10). The variance is added into the objective function, so that the smaller the variance, the smaller the objective function will be. This means the availabilities of MR/Cs assigned to sub-tasks are roughly the same to avoid the severe waiting of collaborative sub-tasks for each other

\begin{matrix} D = \frac{\sum_{i = 1}^{M} (Y_{i}^{T} \times F_{ii} \times Y_{i} - E)^{2}}{M} \\ where E = \frac{\sum_{i = 1}^{M} Y_{i}^{T} \times F_{ii} \times Y_{i}}{M} \end{matrix}

(10)

2. Set the maximum acceptable cost of the remote collaboration as c_max, which represents the maximal acceptable cost of logistics transportation or business process synchronization and is a constraint of the global optimization model.

3. Set the factor ω₁ and (1 − ω₁) to reflect different weight of the availability and the remote collaboration cost for global optimized allocation of MR/Cs.

The mathematical model for the multi-centric global optimized allocation can finally be expressed as follows

\begin{matrix} Minimize ω_{1} \times (\sum_{i = 1}^{M} Y_{i}^{T} \times F_{ii} \times Y_{i} + D) \\ + (1 - ω_{1}) \times (\sum_{j = 1}^{M} \sum_{i = 1, i \neq j}^{M} Y_{i}^{T} \times F_{ij} \times Y_{j}) \\ Subject to \sum_{h = 1}^{N} y_{h} = 1, y_{h} = 0, 1, y_{h} \in Y_{i} \forall i \\ Y_{i}^{T} \times F_{ij} \times Y_{j} ⩽ c_{max} \forall i, j i \neq j \end{matrix}

(11)

In the objective function, we use $Balance = \sum_{i = 1}^{M} (((\sum_{h = 1}^{N} Avai l_{ih}) / N)) / (\sqrt{\sum_{h = 1}^{N} {(Avai l_{ih} - (\sum_{h = 1}^{N} Avai l_{ih}) / N)}^{2}})$ to present the utilization balance of services and $Collabrationcost = \sum_{j = 1}^{M} \sum_{i = 1, i \neq j}^{M} Y_{i}^{T} \times F_{ij} \times Y_{j}$ to present the total cost of collaboration.

Complexity analysis and solving algorithm

We first analyze the complexity of the model. The mathematical model for multi-centric global optimized allocation is designed to allocate the M sub-tasks to N centers; the size of the solution space (not the feasible space) is N^M. For the given M sub-tasks, the problem complexity is O(n^M), which means the problem has the polynomial complexity. For the given N centers, the problem complexity is O(Nⁿ), which means the problem has the index complexity. The later problem is a NP-hard problem, because we cannot find a feasible solution of the (most common) model and validate whether the solution is optimal in polynomial time.

Certainly, a NP-hard problem is not an insoluble problem, because the numbers of sub-tasks and centers are usually limited. Take five sub-tasks and three centers as an example: the size of the solution space is only 243. But when we have 10 sub-tasks and 5 centers, the size of the solution space will sharply increase to 9,765,625. In this case, the intelligent optimization algorithm should be adopted.²⁹ Otherwise, solving the model will only cost a lot of time, let alone the time taken by the negotiation and collaboration among centers. In this article, we will only discuss the algorithm construction based on the particle swarm optimization (PSO) algorithm. The concrete solving process will be omitted.

(a) Particle construction

If we directly use the vector for the center selection, the formula (4) as the particle encoding, as each element of the vector is either 1 or 0 and the sum of those N elements must be 1 (a sub-task could be assigned to only one center), there will be too many constraints on particle flight. So, we redesign a vector (the particle location) with only M elements shown in equation (12)

\begin{array}{l} X^{T} = [x_{1}, x_{2}, \dots, x_{i}, \dots x_{M}] w h e r e \\ x_{i} \in {1, 2, \dots, N} \forall i \in {1, 2, \dots, M} \end{array}

(12)

When calculating the particle fitness, we only need to execute the following decoding: x_i corresponds to $Y_{i}^{T}$ in the way that the x_ith component in $Y_{i}^{T}$ equals to 1 and other components equal to 0.

In addition, we define another vector with M elements to present the particle velocity shown in equation (13). v_max is the limit of ith element of the velocity. If v_max is too large, the search accuracy of particles is not enough so that the optimal solution will be easily missed. If v_max is too small, particles will easily fall into local search

V^{T} = [v_{1}, v_{2}, \dots, v_{i}, \dots v_{M}] where v_{i} \in [- v_{max}, v_{max}]

(13)

(b) Particle flight

We assume that there are K individuals in the particle swarm, and several arrays were defined as follows. (1) x[K][M] represents the mth location component of the kth individual. (2) v[K][M] represents the mth velocity component of the kth individual. (3) lBest[K][M] represents the mth component of the best historical location of the kth individual; gBest[M] represents the mth component of the global best historical location.

The flight velocity update method of kth particle’s mth velocity component is shown in equation (14),³⁰ where ω is the inertia weight, c₁ and c₂ are the acceleration constants

\begin{matrix} v [k] [m] = & ω \times v [k] [m] + c_{1} \times ran d_{1} () \\ \times (lBest [k] [m] - x [k] [m]) \\ + c_{2} \times ran d_{2} () \times (gBest [m] - x [k] [m]) \end{matrix}

(14)

The value of location component must be an integer from 1 to N, so the location update method of kth particle’s mth location component is defined in formula (15), lnt() is the rounding function

x [k] [m] = {\begin{array}{l} 0 & i f x [k] [m] + v [k] [m] ⩽ 0 \\ \ln t (x [k] [m] + v [k] [m]) & i f 0 < x [k] [m] + v [k] [m] < N + 1 \\ N & i f x [k] [m] + v [k] [m] ⩾ N + 1 \end{array}

(15)

Application example

Application background

The proposed methodology has been applied to the collaborative production for aerospace complex products shown in Figure 4. The data come from one big manufacturing factory for complex products.

Figure 4.

Collaborative production for aerospace complex products.

After proper simplification, the case involves three kinds of aerospace complex products. The manufacturing of each product has four similar processes, including main structure processing, launch tube production, processing of special parts for vehicles, system-level assembly and debugging. The main structure processing requires the capability of composite material processing; the launch tube production requires the capability of sheet metal welding; the processing of special parts for vehicles requires the capability of large stroke machining; the system-level assembly and debugging require the capability of final assembly. The capability requirements of each product in every process are shown in Table 1. Each element in the table represents the (T_DUE − T_ST) × P₀ in formula (3).

Table 1.

Capability requirement by each product in every process.

	Product1	Product2	Product3
Main structure processing	200H	80H	120H
Launch tube production	150H	60H	90H
Special parts of the vehicle processing	160H	64H	96H
System-level assembly and debugging	100H	60H	80H

In addition, the case involves three management centers which are located in Beijing, Xian and Wuhan, respectively. The capability of each center is shown in Table 2. Each element in the table represents the T_DUE − T_ST in formula (3). In this case, formula (3) would be used to calculate the availability of MR/Cs.

Table 2.

Capability volume of each center.

	Center1 (in Beijing)	Center2 (in Xian)	Center3 (in Wuhan)
Composite material processing capability	25,000H	30,000H	20,000H
Sheet metal welding capability	22,500H	20,000H	21,000H
Large stroke machining capability	23,000H	24,000H	0H
Final assembly capability	0H	0H	40,000H

Moreover, the collaborative costs between the main structure processing and the system-level assembly and debugging are shown in Table 3; the collaborative costs between the launch tube production and the system-level assembly and debugging are shown in Table 4; the collaborative costs between the processing of special parts for vehicles and the system-level assembly and debugging are shown in Table 5. The meaning of each table is defined by formula (9), and the dimensionless values of these costs in the tables denote the ratios of real values to maximum values that belong to the same kind of cost. The collaborative cost needs to be multiplied by the corresponding scale coefficient (SC) for different products

SC (Product 1) : SC (Product 2) : SC (Product 3) = 5 : 3 : 4

(16)

Table 3.

Collaborative cost between main structure processing and system-level assembly and debugging.

	Center1 (in Beijing)	Center2 (in Xian)	Center3 (in Wuhan)
Center1	0	0.2	0.2
Center2	0.2	0	0.5
Center3	0.2	0.5	0

Table 4.

Collaborative cost between launch tube production and system-level assembly and debugging.

	Center1 (in Beijing)	Center2 (in Xian)	Center3 (in Wuhan)
Center1	0	0.3	0.3
Center2	0.3	0	0.75
Center3	0.3	0.75	0

Table 5.

Collaborative cost between special parts of the vehicle processing and system-level assembly and debugging.

	Center1 (in Beijing)	Center2 (in Xian)	Center3 (in Wuhan)
Center1	0	0.4	0.4
Center2	0.4	0	1
Center3	0.4	1	0

The case is simplified in order to ignore the minor factors. We assume that the time window of the corresponding process overlaps completely (to ignore the calendar traversal). We also assume that any process of a product should only be done by the capability of one management center; all of the tasks are submitted to the management center in Wuhan, and the system-level assembly and debugging and delivery would be in Wuhan, so the capability of the system-level assembly and debugging in other centers are all set as 0; the large stroke machining capability in the Wuhan center is set as 0, so that at least one process should be done in other centers.

Considering the stochastic factors, the number of tasks of each product was randomized independently (uniform distribution). The range of randomization is between 20 and 100 in which 20 is the minimum production lot and 100 is the maximum production lot.

Result comparison

In the following part, we will compare the results of the proposed model and the distributed and decentralized scheduling model (the primary-standby centralized scheduling model basically has the single-centric architecture, so it is not considered in this part). The latter model (shown in Table 6) prefers to choose the local manufacturing capability and another center which has the most remaining manufacturing capability will be considered only when the local manufacturing capability is insufficient. In addition, the latter method does not take the collaboration cost into consideration.

Table 6.

Distributed and decentralized scheduling model.

function SELECTION(M, Req(MR/C), N, Avail(MR/C)) returns a vector of selection result

inputs: M, num of sub-tasks

Req(MR/C)[M], vector of MR/C type requested by Sub-task i (from 1 to M)

N, num of centers

Avail(MR/C)[M][N], matrix of the availability of the MR/C requested by Sub-task i (from 1 to M) in Center j (from 1 to N)

local variables: Result[M], vector of selection result

Index, index of the center for a choice

for i ← 1 to M do

if the local manufacturing capability is sufficient then continue

for j ← 1 to N do

index ← select the index where Avail(MR)[i][index] is maximum

Result[i] ← index

return Result

The experiment results will be compared in two aspects: the utilization balance of manufacturing capability and the total cost of collaboration. The details are presented in the last paragraph of section “Global optimization model.”

The results are compared by subtraction, as in equations (17) and (18)

Result 1 = \frac{(Balance 2 - Balance 1)}{Balance 2}

(17)

Result 2 = \frac{(Collaborationcost 1 - Collaborationcost 2)}{Collaborationcost 1}

(18)

Where Balance1 and Collaborationcost1 are the utilization balance and total collaborative cost of the comparison model, respectively; Balance2 and Collaborationcost2 are the utilization balance and total collaborative cost of this article’s model, respectively.

Experiments are conducted when ω₁ (shown in (11)) ranges from 0 to 1 (increment is 0.01), and each experiment is conducted 100 times randomly. Note that the optimized allocation results of the distributed and decentralized management model are independent of ω_1.

Experiments are designed with two different inputs: the first input is the task of product 1 and product 2 and the result is shown in Figure 5; the second input is the task of product 1, product 2 and product 3 and the result is shown in Figure 6. In Figures 5 and 6, the horizontal ordinate presents ω₁ (shown in (11)), and the vertical ordinate presents the comparative result of Balance and Collaborationcost.

Figure 5.

Results comparison 1 (input: task for two kinds of products).

Figure 6.

Results comparison 2 (input: task for three kinds of products).

From each figure, we can conclude that when ω₁ approaches 0, this article’s model mainly focuses on minimizing the collaboration cost and compared with the existing distributed and decentralized model, the cost decreases by 50% with almost the same utilization balance. In the case that local MR/Cs are insufficient, the compared model’s balance is a little higher than that of this article’s model because its policy to choose another center is favorable to balance.

When ω₁ approaches 1, this article’s model mainly focuses on maximizing the utilization balance, which increases by 20% than the existing model, but the collaboration cost augments so fast that it could be 30% higher than the existing model (for the latter model, the local MR/Cs are assigned with high priority and the collaborative cost inside one center is neglected, so it is advantageous in the collaboration cost).

No matter the input tasks are two or three kinds of products, in the interval ω₁∈(0.45, 0.85), both the balance and collaboration cost of this article’s model are better than the existing model. In fact, these two indicators are mutually exclusive, so we compromise ω₁ to make them optimal.

From the two figures, we can see that when the amount of requested products in the input manufacturing task increases, the local MR/Cs would be insufficient so that the existing distributed and decentralized model has to get the left requested MR/Cs from other centers, improving the balance of the MR/C utilization. Hence, when ω₁ tends to 1, the balance of this article’s model does not augment as fast as the existing model. As the existing method neglects remote collaboration cost, from the perspective of the total collaborative cost, this article’s model has very good performance. We made a comparison on the performance (Balance and Collaborationcost) of the two models, as shown in Table 7. The option of performance Balance includes good, fair and poor; and the option of performance Collaborationcost includes high, medium and low.

Table 7.

Comparative result analysis (set ω₁ equals to 0.6).

		Global optimization model	Distributed and decentralized scheduling model
Over capacity	Balance	Good	Poor
	Collaborationcost	Low	Medium
Insufficient capacity	Balance	Good	Fair
	Collaborationcost	Low	High

Other discussions

In fact when applying the methodology proposed in this article to practice, more complex factors need to be taken into account. First, the available amount of MR/Cs is variable due to the different working modes and status, Figure 7 is an example of variable amount of MR/Cs. Second, the execution of task hardly overlaps, and the assignments of every task should be recorded in the R/C calendars, because when a new task arrives, its assignment will depend on those calendars. Moreover, the operational failures and errors will lead to perturbation of spread,³¹ and as a result, the associated tasks in the execution or pending queue will be reassigned and more calculation will be needed. In addition, some other factors like vendor inventory management (VIM), back order, will enrich the diversity of management and will affect the assignment based on calendars.

Figure 7.

Availability curve of MC services.

We define a matrix shown in equation (19) to present the MR/Cs’ utilization in the period of $t_{1}$ to $t_{n}$ , where $v_{ik}$ means the expecting amount used by ith virtualized MR/C instance(corresponding to ith task) at the time $t_{k}$ ; m means the maximal virtualized MR/C instances at a time. If at the time $t_{k}$ the MR/C is allocated to complete p tasks (p < m), we have $v_{qk}$ which equals to 0 (p < q < m)

V = [\begin{matrix} v_{11} & v_{12} & \dots & v_{1 n} \\ v_{21} & v_{22} & \dots & v_{2 n} \\ ⋮ & ⋮ & \begin{matrix} ⋱ \end{matrix} & ⋮ \\ v_{m 1} & v_{m 2} & \dots & v_{mn} \end{matrix}]

(19)

The vector $[\begin{matrix} V_{1} & V_{2} & \dots & V_{n} \end{matrix}]$ presents the maximal amount of the MR/C at different time from $t_{1}$ to $t_{n}$ (the values are not a constant because of different working modes and status). $a V_{k}$ shown in equation (20) is the remaining amount. So far, we can get the fitted curve of availability named $Curv e_{avail} (t)$ . When a new task arrives, the curve can be directly used without calendar traversal; and when operating failures or errors happen, we only need to roll back and update the curve

a V_{k} = V_{k} - \sum_{j = 1}^{m} v_{jk}

(20)

We assume that we have N requests to an MR/C with their expecting amount $V o l_{req}$ , the First Come, First Served(FCFS) policy is involved to process those requests. We just discuss the manufacturing task with predictable time and designated windows. When $\int_{t_{start}}^{t_{end}} Curv e_{avail} (t) dt < Vo l_{req}$ , we can conclude that the task cannot be accomplished from $t_{1}$ to $t_{n}$ , and $A vail = 0$ if the task is not achievable with all measurements(overtime as an example) taken. Otherwise, the availability is calculated in equation (21)

A vail = \frac{\int_{t_{start}}^{t_{end}} Curv e_{avail} (t) dt}{t_{end} - t_{start}}

(21)

$Curv e_{avail} (t)$ should be updated once tasks are assigned using this article’s methodology. But the update method could be different such as uniform occupation and smallest fragmentation occupation. The inventory of service suppliers should be taken into account when the output is entity. Moreover, if it is off-season, the curve updating should be uniform; and if it is busy season, processing too much tasks at the same time should be avoided.

As we discussed, the factors like VIM, back order, will enrich the diversity of management. Regarding to the VIM, the ideal state of MR/C service consumers is zero stock and MR/C service providers also need to realize lead production as much as possible; regarding to the latter, if the back order is permitted, the production arrangement can ignore the time limit at some level. These factors can influence the calculation of Avail.

Conclusion and future work

CMfg integrates massive, widely distributed MR/Cs for sharing and operation, and it should optimally have multiple centers. Most related research assumes that there exists only one management center for all MR/Cs in a manufacturing cloud, but this could not satisfy the demands for high efficiency and QoS, especially in a large-scale manufacturing cloud. Although the multi-centric architecture in cloud computing is not rare (e.g. federated clouds) and its management methods are worth learning, there are lack of considerations for the constraint of limited MR/Cs and the cost of remote collaboration.

To address this issue, we first propose the architecture for the multi-centric management. The solution adopts a two-level scheduling strategy that combined the advantages of the centralized and the decentralized decision-making. Considering the special characteristics of manufacturing cloud and manufacturing tasks, we also put forward the methods to quantify the availability and the collaborative cost of MR/Cs systematically. Then, we propose a global optimization model for the MR/C allocation under the multi-centric architecture. The application example shows that the proposed methodology could achieve more balanced utilization of the MR/Cs and lower cost of the total collaboration, compared with the typical decentralized solution.

Our future works will be as follows:

Analyze the bound of the global optimization model, and do more research on the scalability of the problem;

Improve the flexibility of the optimized allocation which will select the most appropriate MR/Cs instead of the best performance ones to serve mass users;

Start with the reliability design for the disturbance processing and fault tolerance in the multi-centric management.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work is financially supported by National Key Lab in Intelligent Manufacturing System Technology of Complex Product, Beijing Engineering Technology Research Center in Advanced Manufacturing System of Complex Product, the National 863 Plan (2015AA042101), the National Natural Science Foundation of China (grants 51522501 and 51475032), Beijing Natural Science Foundation (grant 4152032), Beijing Youth Talent Plan (grant 29201411) and the Fundamental Research Funds for the Central Universities in China.

References

Armbrust

Fox

Griffith

et al . A view of cloud computing. Commun ACM 2010; 53(4): 50–58.

Velte

Elsenpeter

Cloud computing, a practical approach. New York: McGraw-Hill, Inc., 2009.

Zhang

Wang

et al . Cloud manufacturing: a new service-oriented networked manufacturing model. Comput Integr Manuf 2010; 16(1): 1–7.

Zhang

Ren

et al . Further discussion on cloud manufacturing. Comput Integr Manuf 2011; 17(3): 449–457.

Zhang

Ren

et al . Typical characteristics, technologies and applications of cloud manufacturing. Comput Integr Manuf 2012; 18(7): 1345–1356.

Luo

Zhang

Tao

et al . A modeling and description method of multidimensional information for manufacturing capability in cloud manufacturing system. Int J Adv Manuf Tech 2013; 69(5–8): 961–975.

Thames

Rosen

et al . Towards a cloud-based design and manufacturing paradigm: looking backward, looking forward. In: Proceedings of the ASME 2012 international design engineering technical conferences and computers and information in engineering conference, Chicago, IL, 12–15 August 2012, pp.315–328. New York: American Society of Mechanical Engineers (ASME).

Greer

Rosen

et al . Draft—cloud manufacturing: drivers, current status, and future trends. In: Proceedings of the ASME 2013 international manufacturing science and engineering conference, Madison, Wisconsin, 10–14 June 2013, pp.1–10. American Society of Mechanical Engineers.

Tao

Ding

et al . Resources publication and discovery in manufacturing grid. J Zhejiang Univ Sc A 2006; 7(10): 1676–1682.

10.

Tao

Zhao

Yefa

et al . Correlation-aware resource service composition and optimal-selection in manufacturing grid. Eur J Oper Res 2010; 201(1): 129–143.

11.

Chang

Huang

Tsai

et al . Rapid access control on Ubuntu cloud computing with facial recognition and fingerprint identification. J Inf Hiding Multimed Signal Process 2012; 3(2): 176–190.

12.

Liu

. Geographic trough filling for internet datacenters. In: Proceedings of the IEEE 2012 INFOCOM, Orlando, FL, 25–30 March 2012, pp.2881–2885. New York: IEEE.

13.

Lin

Chai

. Research on key technologies of resource management in cloud simulation platform. In: Proceedings of the 23rd European modeling and simulation symposium, Rome, 12–14 September 2011, pp.508–515. Genova, Italy: DIPTEM University of Genoa.

14.

Tao

Zuo

et al . IoT-based intelligent perception and access of manufacturing resource toward cloud manufacturing. IEEE T Ind Inform 2014; 10(2): 1547–1557.

15.

Liao

et al . Representation and share of part feature information in web-based parts library. Expert Syst Appl 2006; 31(4): 697–704.

16.

Jian

Yan

et al . Aircraft tooling collaborative design based on multi-agent and PDM. Concurr Eng 2009; 17(2): 139–146.

17.

Barham

Dragovic

Fraser

et al . Xen and the art of virtualization. ACM SIGOPS: Oper Syst Rev 2003; 37(5): 164–177.

18.

Erl

SOA: principles of service design, vol. 1. Upper Saddle River, NJ: Prentice Hall, 2008.

19.

Tao

Zhang

Resource service management in manufacturing grid system. Hoboken, NJ: John Wiley & Sons, 2012.

20.

Nygren

Sitaraman

Sun

The Akamai network: a platform for high-performance internet applications. ACM SIGOPS: Oper Syst Rev 2010; 44(3): 2–19.

21.

Walley

Whitehead

It’s not easy being green. In: Welford

Starkey

(eds) The earthscan reader in business and the environment. 1994, pp.36–44. Association for Computing Machinery (ACM).

22.

Ranganathan

Foster

Simulation studies of computation and data scheduling algorithms for data grids. J Grid Computing 2003; 1(1): 53–62.

23.

Toptal

Sabuncuoglu

Distributed scheduling: a review of concepts and applications. Int J Prod Res 2010; 48(18): 5235–5262.

24.

Wooldridge

An introduction to multiagent systems. Chichester: John Wiley & Sons, 2009.

25.

Kamsu-Foguem

Noyes

Graph-based reasoning in collaborative knowledge management for industrial maintenance. Comput Ind 2013; 64(8): 998–1013.

26.

Potes Ruiz

Kamsu-Foguem

Noyes

Knowledge reuse integrating the collaboration from experts in industrial maintenance management. Knowl Based Syst 2013; 50: 171–186.

27.

Tao

Cheng

Zhang

et al . Advanced manufacturing systems: socialization characteristics and trends. J Intell Manuf. Epub ahead of print 18 February 2015. DOI: 10.1007/s10845-015-1042-8.

28.

Tao

Zhang

Venkatesh

et al . Cloud manufacturing: a computing and service-oriented manufacturing model. Proc IMechE, Part B: J Engineering Manufacture 2011; 225(10): 1969–1976.

29.

Tao

Laili

Liu

et al . Concept, principle and application of configurable intelligent optimization algorithm. IEEE Syst J 2014; 8(1): 28–42.

30.

Kennedy

Particle swarm optimization. In: Sammut

Webb

(eds) Encyclopedia of machine learning. New York: Springer, 2010, pp.760–766.

31.

Wang

Tang

et al . Distributed coordination scheduling technology based on dynamic manufacturing ability service. Comput Integr Manuf 2012; 18(7): 1563–1574.