Abstract
To meet the increasing demand for improving railway service quality while using the precious mobile resources reasonably and economically, this research proposes a hierarchical approach that integrates the model for constructing an efficient electric motor train unit circulation plan into the model for optimized timetable design without predefining timetable details. A simulated annealing algorithm for solving the timetabling (main) model is designed, in which the neighborhood system is pretreated. A special tree construction–based branch-and-bound algorithm is improved for solving the electric motor train unit circulation planning (sub)model. The results of the numerical experiment verify the effectiveness of the proposed method. Specifically, compared to the randomly generated initial solution, the optimal solution obtained by the proposed methods reduces the total travel time by 686 min and reduces the number of electric motor train units by 5. The number of electric motor train units needed by the proposed method is on average at least two less than the method that handles the problem in a sequential way. Railway operators can implement this approach for balancing the efficiency of timetable and the quality of electric motor train unit circulation plan within a reasonable computation time when scheduling trains in railways.
Keywords
Introduction
Train timetable and electric motor train unit (EMU) circulation plan are among the most important plans in the operation of high-speed railway systems. In the timetabling problem, the exact departure and arrival times of each train at each station along its route are determined. Certain operational rules such as track capacity limitation and achievement of the highest possible passenger service quality (e.g. travel time) should be considered. A number of studies related to this problem are available in various literatures. Caprara et al. 1 and Brännlund et al. 2 developed the Lagrangian relaxation–based heuristics to compute a timetable. The track capacity constraints were relaxed in order to make the problem to be handled more easily. Zhou and Zhong 3 proposed two solving algorithms: a branch-and-bound algorithm and a beam search algorithm to deal with the double-track train-scheduling problem with multiple objectives. Lee and Chen 4 presented an optimization heuristics to solve the train timetabling problem when considering the problems of train pathing. The method was able to deal with real-sized instances. Castillo et al. 5 proposed an optimization method to solve the train-scheduling problem for a double- and single-tracked bi-directional railway network. Some excellent techniques were incorporated into the method to speed up the optimization solution of the problem. Cacchiani et al. 6 modeled the problem of timetabling at a station. The model did well in considering both complex rules for correct junction traversal, and constraints on the maximum number of trains allowed for the simultaneous presence at a station. Chu et al. 7 studied the bi-objective optimal urban rail train scheduling. The presented method aimed at finding a good train timetable which achieves considerable service quality with as little operation cost as possible.
The EMU circulation planning problem needs to determine how to allocate the EMUs to the train trips with a given timetable under the condition that the EMUs should be maintained properly. Specifically, the circulation plan consists of a set of connection relationships between train trips. These connection relationships indicate which train trips each EMU should fulfill in each day and when/where to arrange the first-level maintenance for the EMUs. All of the EMUs are operated in a periodic manner according to the circulation plan. In railways, there are often several interrelated train routes. Each route corresponds to a part of railway line. It is more reasonable to utilize the EMUs throughout the interrelated routes than separating the EMUs into several groups and each for a different route (see, for example, Zhao et al. 8 ).
Due to the complexity of the timetabling problem itself, the EMU circulation plan is often ignored when finding an optimal timetable for trains. The computations of these two plans are handled in a sequential way. We refer to this method as the ordinary method in this article. The timetable is served as the input when finding EMU circulation plan. Peeters and Kroon 9 described a branch-and-price algorithm to deal with the problem of efficient circulation of train units within a certain scope of railway line given the timetable and the passengers’ seat demand. Alfieri et al. 10 presented a solution approach to determine the appropriate numbers of train units of different types together with their efficient circulation on a single line. The approach is based on an integer multi-commodity flow model. Cacchiani et al. 11 proposed a heuristics for the train unit assignment problem. The proposed method is based on the natural Lagrangian relaxation of a natural integer linear programming and turns out to be much faster in practice and still providing solutions of good quality. Abbink et al. 12 considered the train types and subtypes when dealing with the problem of allocating units of rolling stock to the train series, such that as many people as possible can be transported with a seat, especially during the rush hours. Shi et al. 13 designed a simulated annealing algorithm for the EMU circulation planning problem. The algorithm is based on the penalty function and 3-opt neighborhood structure for the circular permutation of all trains. Zhao and Tomii 14 introduced a probability-based local search algorithm for solving the connection of the trains and the generation of the maintenances of the problem after it was transformed into the traveling salesperson problem.
Besides the deterministic models for strategic scheduling of trains or circulation planning of the EMUs, there are also methods that focus on designing timetables or EMU circulation plans to handle the different types of disturbances, such as delays or any other unforeseen events occurring at an operational level.15,16 These methods are mainly based on robustness optimization or delay (disturbance) control. YL Wei et al. 17 proposed a powerful reliable controller to preserve system stability, accommodate the component faults, and maintain the preferable system performances using Markov chain to describe component failures commonly occurred in dynamical (fuzzy) systems. The various controlling approaches based on Markov models are able to handle different types of dynamic systems subjected to random abrupt changes in their structures.18–21
Since the timetable provides the premises for the EMU circulation plan, even a subtle change in the departure and arrival times of some of the trains would have a direct influence on the potential optimal EMU circulation plan. When the EMU circulation plan is found, the number of EMUs needed is determined accordingly. As shown in Figure 1, although the numbers of train trips in the timetable (a) and (b) are the same, the structures of these two timetables are different. As a result, the circulation plan in (a) consists of at least four circulation fragments (①-④). The circulation plan in (b) consists of at least five circulation fragments (①-⑤). A circulation fragment represents the train trips, connections of train trips, and first-level maintenance that an EMU should fulfill in one period (e.g. 24 h) of schedule. The whole circulation plan is formed by connecting all the fragments through the overnight connections. The number of the necessary EMUs is equal to the number of circulation fragments contained in a circulation plan.

Circulation fragments in two different circulation plans.
Therefore, the number of EMUs needed for fulfilling all the tasks of the train trips is not only related to how to build connection relationships between these train trips but also related to what the exact departure and arrival times of each train trip have been. It is reasonable to consider the EMU circulation plan when calculating the timetable for the train trips. The existing studies did not put enough emphasis on making a coordination optimization of the detailed timetable and EMU circulation plan. The particular interest of this article is to find an optimal timetable which is beneficial to efficiently utilize the EMUs. The main contribution of this article is that we propose an approach to optimize the timetable and EMU circulation plan in a unified way. The proposed hierarchical approach integrates the construction of an efficient EMU circulation plan into the optimized design of the timetable under the condition that the timetable details (all departure and arrival times, train order, the stations for overtaking operations, etc.) are not predefined. Furthermore, the algorithms for solving the main model and the sub model are efficiently designed.
The structure of this article is organized as follows: section “Problem illustrations” first describes the conceptual overview of the hierarchical framework proposed for the coordinate optimization of timetable and EMU circulation plan. Then, the space-time-network-based problem illustration is presented. Section “Formulation of the coordinate optimization framework” first defines the notations of the parameters and variables used in the proposed framework. Then, the main timetabling model and the EMU circulation planning submodel are formulated. Section “Solution method for the coordinate optimization framework” describes the procedure of the algorithm for solving the main model which is incorporated with the algorithm for solving the sub model. Section “Numerical example” presents the comparison of the results of the experiments with the proposed method and the corresponding methods in contrast. Section “Conclusion” summarizes the proposed method and mentions the future tasks of this research.
Problem illustrations
Conceptual hierarchical framework
There are two types of objectives in the problem of coordinate optimization of timetable and EMU circulation plan. First, a timetable with optimal efficiency should be found. Second, the quality of the EMU circulation plan calculated based on the found timetable is also optimal among those calculated based on other timetables. Although these two objectives are integrated in the single optimization problem, the variables used for the evaluation of the EMU circulation plan are separated from those variables used for the construction of the optimal timetable (as shown in Figure 2). The determination of these two types of variables is relatively independent. At the same time, the decision space of finding the optimal EMU circulation plan is determined by the decision space of finding the optimal timetable. Therefore, the computations in the proposed framework for coordinate optimization of the timetable and EMU circulation plan naturally consist of a timetabling model and an EMU circulation plan calculation model. The former is the main model and the latter is the submodel.

Framework for coordinate optimization of the timetable and EMU circulation plan.
Specifically, the computation process of the framework could be summarized as follows: at each iteration of the main model, a feasible timetable is generated without predefined details; the quality of the circulation plan based on the iterative timetable is evaluated through the submodel; the total objective value of the iterative timetable is compared with the best timetable found so far, and the best timetable is possibly updated accordingly; Finally, the balanced optimal timetable and the EMU circulation plan are found.
Space-time-network-based problem description
For each train with a certain physical route on the railway network, a space–time network is associated with it. The space–time network of each train consists of several vertex sets and three types of directed arc sets. The position of each vertex set on the vertical axis corresponds to the station along its route. Each vertex set consists of linearly arranged vertices which are parallel to the horizontal axis and represent the discretized timestamps that the train could depart from or arrive at the station. The first type of arc is the virtual arc. All of the departure vertices at the first station and the arrival vertices at the last station are connected with an artificial source vertex
Given a topology of a railway network where the railway is double track, a multi-graph could be constructed by combining the space–time network associated with each train to be scheduled (as shown in Figure 3). Let the connection arc denote the possible connection relationship between trains at a station. There are also train connection arcs in the multi-graph between each pairs of departure trains and arrival trains at stations such that the duration between the arrival time of a train and the departure time of another train is at least equal to the minimum connection time. The direction of a connection arc is from the arrival train to the departure train which means that after fulfilling the train trip and arriving at the station, the EMU will continue to fulfill the next train trip departing from the same station. The minimum connection duration of EMU at a station is the necessary criterion of the EMU dwelling time for performing the passengers’ alighting/boarding, EMU cleaning, and servicing activities before starting the next train trip.

Space–time multi-graph of trains.
For clarity purposes, Figure 3 only shows the example of the selected path and some possible range of candidate paths associated with the selected path for each train. Then, the problem of the coordinate optimization of the timetable and the EMU circulation plan aims to find a feasible path for each train and a set of connection arcs linking all the trains in the space–time multi-graph under the condition that the efficiency of the timetable and the quality of the EMU circulation plan are balanced. In order to generate a feasible circulation plan, when selecting the connection arc, the cumulative running time/distance of EMU should be considered. The cumulative running time/distance of EMU could not exceed the upper bound. Otherwise, maintenance must be arranged (in advance) for the EMU at the stations linked with an EMU depot. In case of arranging maintenance only, the connection arcs whose durations are longer than the necessary maintenance time can be selected. For example, in Figure 3, stations 1 and 2 are linked with an EMU depot. The maintenance can be arranged at these stations in the circulation plan.
Formulation of the coordinate optimization framework
Notations of parameters and variables
The parameters include the following: K is the set of trains;
The decision variables include the following: the binary variable
Formulation of the main model
The main model in the framework selects the set of space–time paths for each train under the condition of satisfying the safety rules. The timetable constructed based on the selected paths should have an optimal efficiency. The optimal quality of the circulation plan is another objective used to evaluate the obtained timetable. But the detailed selections of the train connection arcs and calculation of the objective value of the optimal circulation plan are realized through the solving of the submodel. Specifically, the main model is formulated as follows
Subjected to
Objective function (1) contains three components. The first component is the total travel time. The reduction in travel time would improve the service quality of the railway enterprise and the satisfaction of passengers. The second component is the total mean variance of intervals of each adjacent train departure times at each station
Else
Constraint (2) indicates that only one arc can be selected among those leaving the virtual starting vertex for each train. Constraint (3) is the flow balance constraints, which ensures to select a path for each train in its space–time network. Constraint (4) imposes that there are no conflicts between the space–time paths of trains. Constraint (5) imposes that the dwelling time of each train at each station should not be less than the lower bound. Constraint (6) imposes that the total extra waiting time of medium-speed trains due to being overtaken by high-speed trains should be within a certain scope. The extra waiting time is related to the utilization of the capacity of the route and can be determined through experimentation. This group of constraints could also be used as a supplement to the first component of the objective.
Formulation of the submodel
On the basis of the variables
Subjected to
The objective function (9) includes two components. The first component is the total connection time of all the trains. Then, the total number of EMUs
The second component is the times of the maintenance in the circulation plan. Constraints (10) and (11) impose that there is only one connection arc entering and leaving each train. Constraint (12) imposes that the connection arcs should be selected from the candidate arcs between trains. Constraints (13) and (14) impose that the accumulated traveling mileage and traveling time for an EMU after maintenance should be less than the corresponding upper bound. Intermediate variable W indicates the set of maintenance arcs. Intermediate variable
Solution method for the coordinate optimization framework
It can be seen from each of the model that they are both complicated integer programming problem. In order to ensure the speed in solving the proposed framework, which integrates these two models, a simulated annealing algorithm is designed. It is embedded with an improved branch-and-bound method. The neighborhood of the tentative solution when solving the main model in the simulated annealing process is predetermined, although the departure time of each train could be within the whole planning horizon. In fact, the experiment will show that this neighborhood designing mechanism has a great impact on ensuring the quality of the obtained solution. The procedure of the algorithm is as follows:
Step 1: initialization—set the initial temperature
Step 2: according to the number of elements in the set of high-speed trains
where
Step 2.1: for the tth train
Since the high-speed train has the highest rank and could not be influenced by other trains, the remaining part of the path of train
Step 2.2: for each train
Step 2.3: if the current section
Step 2.4: if
(a) The interval between
(b) The interval between the arrival time of train k and the arrival time of train

Three conditions for the overtaking.
In condition (b), since train k is currently not sure whether it needs to dwell at station
(c) The arc of train k and the arc of train
Step 2.5: update the temporary variable
Step 2.6: if
where
Otherwise, the corresponding arrival vertex and departure vertex of the arc to be selected for train k are determined as follows, and go to step 2.7
Step 2.7: update the temporary variable
Step 2.8: if the whole space–time path of train
where
If the space–time networks of trains in the multi-graph are abstracted into another kind of vertices, then the multi-graph could be transformed into a connection network. Figure 5 shows the connection network transformed from the multi-graph shown in Figure 3. Given a network with

Connection network transformed from the space–time multi-graph.
It can be seen from Figure 5 that the problem of calculating the EMU circulation plan is equal to the problem of finding a DCM1-T in the connection network such that the in-degree and out-degree of each vertex in the DCM1-T are equal to 1 and that the maintenance is taken into consideration.
Step 3: construct the connection network of the set of the train K. Set the maximum times of iteration of calculating the EMU circulation plan to be
Step 3.1: if
Since the problem of finding DCM1-T is NP-complete (see, for example, Alexandre and Abilio 24 ), the DCM1-TG algorithm is able to find an approximate lower bound for the EMU circulation plan. The circulation plan obtained by the DCM1-TG algorithm could not be feasible because the maintenance is not considered. In order to find an optimal feasible circulation plan, an improved branch-and-bound method is applied as follows
Step 3.2: according to the set of
where
Call the DCM1-TG algorithm on the basis of the set
Step 3.3: if
Step 3.4: if the circulation plan corresponding to
Step 3.5: if
Step 3.6: Update
Step 3.7: backtrack to the previous level—update
Step 3.8: update
Step 4: a new complete random solution of the framework
If
Step 5: if
Step 6: if the iteration termination condition, which could be controlled through the annealing temperature, is satisfied, the algorithm stops. Otherwise, update the temperature
Numerical example
To evaluate the effectiveness of the proposed method, numerical experiments are conducted on a corridor (shown in Figure 6). This corridor could also be viewed as a simple railway network since the train routes on this corridor are interrelated. There are totally six different train routes and there are both high-speed trains and medium-speed trains on each route. The hollow vertices represent the stations at which a high-speed train is scheduled to stop. The solid vertices represent the stations at which a medium-speed train is scheduled to stop. The shaded vertices represent the stations at which both high-speed train and medium-speed train are scheduled to stop.

Train routes in the basic test case.
Table 1 shows the data of the train operation scheme used in the basic experiment.
Train operation scheme for the basic test case.
Stations BJN, QFD, BBN, and SHHQ are able to fulfill the task of first-grade EMU maintenance. Other basic parameters for the construction of the EMU circulation plan are as follows:
When the total extra dwelling time of medium-speed trains generated due to being overtaken by high-speed trains and the total extra overtaking are viewed as penalties of the objective function of the main model, and the results of the initial solution and optimal solution obtained in the solving process of the proposed method are listed in Table 2. I-NLDT (O-NLDT) means the number of trains with long dwelling time in each route in the initial solution (optimal solution). I-R-NLDT (O-R-NLDT) means the ratio of I-NLDT (O-NLDT). I-NEO (O-NEO) means the number of trains with extra overtaking in the initial solution (optimal solution). I-R-NEO (O-R-NEO) means the ratio of I-NEO (O-NEO). I-HD means the initial solution’s homogeneity of the departure times of trains at the first stations. O-HD means the optimal solution’s homogeneity of the departure times of trains at the first stations.
Results of the initial solution and optimal solution in the basic case.
It can be seen from Table 2 that the proposed method is able to filter the solutions with excessive overtaking or dwelling time due to being overtaken. The iteration process of the proposed method is shown in Figure 7. The total connection time reduces from 6.3859 × 104 min of the initial solution to 5.7345 × ×104 min of the optimal solution. The total travel time reduces from 4.2701 × 104 min of the initial solution to 4.2015 × 104 min of the optimal solution. The corresponding number of EMUs needed reduces from 74 of the initial solution to 69 of the optimal solution.

Iteration processes of different solution methods for the basic case.
Figure 7 also makes comparison between the solving process of the proposed simulated annealing with managed neighborhood of first departure times (MD-AS) and the solving process of the general simulated annealing (AS) in which the neighborhood is unlimited. Obviously, the neighborhood of MD-AS is more reasonable than that of the AS.
Figure 8 displays the results of the branch-and-bound method obtained from some intermediate solutions in the process of solving the main model when the number of branches is set to be 4, 5, and 6. It can be seen from the figure that there are obvious differences between the results when the number of branches varies. Therefore, the searching space is broadened in the improved branch-and-bound method.

Results of the branch-and-bound method when the branching number varies.
It is worth noting that the scale of the test case (e.g. the number of trains or the mileage of the railway) in our article is close to the real-world situations. The running time of the proposed approach in the experiment is about several hours. Specifically, the path selection at each iteration of the main model is within a few seconds. The construction of the EMU circulation plan for each iterative timetable is within a few minutes. Since the timetable and EMU circulation plan are obtained at a strategic level, the computation time of the proposed approach is reasonable for implementation in the practical applications.
Figure 9 displays the total connecting time of the intermediate points in the process of solving the second experiment case. The second case is constructed by reducing the number of trains in the basic case. Other parameters remain the same. Three types of results are shown for each intermediate point. The first type is like the tight connection. It can be seen from this type of results that when the connection relationships between trains are constructed by some heuristic rules, then the total connection time could be too long. The second type is the previous branch-and-bound method. The third type is the improved branch-and-bound method. It can be seen from the figure that the improved branch-and-bound method has obvious potential to further enhance the quality of the optimal circulation plan for each intermediate point. This feature of the improved branch-and-bound method plays an important role in helping the proposed framework find a balanced optimal solution.

Comparison between the connecting times for intermediate solutions.
Other additional cases are constructed by reducing different number of high-speed trains and medium-speed trains in the basic case. Figure 10 makes the comparison between the total connection times of the random initial solution, the optimal solution obtained by the proposed method, and two optimal solutions obtained by the ordinary method. The ordinary method indicates that the optimal timetable and EMU circulation plan are found in a separated way. Specifically, the timetable with optimal efficiency is first obtained, and then the optimal EMU circulation plan is calculated based on this optimal timetable. Figure 11 makes the comparison between the total travel times of the random initial solution, the optimal solution obtained by the proposed method, and two optimal solutions obtained by the ordinary method. The connection times of the two solutions obtained by the ordinary method are calculated based on two optimal timetables whose efficiencies are nearly the same. It can be seen from Figures 10 and 11 that for each additional case, even if the total travel times of different timetables are close to each other, the connection times calculated based on them could have great difference. Figure 12 makes the comparison between the total objective values of the solutions of each case.

Comparison between total connection times for different cases.

Comparison between total travel times for different cases.

Comparison between total objective values for different cases.
As seen from Figure 12, in the tests conducted for different cases, the total objective values of the output solutions of the proposed approach are on average 5.32% better than the solutions of the ordinary method and 8.87% better than the randomly generated initial solution. It indicates that the proposed method has a positive influence on the coordinate optimization of the timetable and the EMU circulation planning than the method that handles the problems in a sequential way.
Table 3 lists the total circulation times (TCT) of the randomly generated initial solution, the optimal solution obtained by the proposed framework for coordinate optimization of the timetable and EMU circulation plan (FCO), and two solutions obtained by the ordinary method (OM) for each case. According to TCT and equation (18), the number of EMU (NEMU) needed to fulfill all the train tasks for each case could be calculated. Table 3 also makes comparison between the NEMUs of different solutions. According to the results, the NEMUs of the solutions of the proposed approach is up to five (at least three) less than the initial solutions and up to four (at least one) less than the solutions of the method that handles the problems in a sequential way. It indicates that the proposed method is able to help the railway system save the EMUs which are expensive equipment.
Comparison between TCTs and NEMUs.
TCT: total circulation times; NEMU: number of electric motor train unit.
Table 4 lists the connection times of the DCM1-T, the initial feasible circulation plan (IFC) the optimal circulation plans obtained by branch-and-bound method, and the improved branch-and-bound method corresponding to different solutions of each case. INS represents the initial solution. It can be further seen from the results that the improved branch-and-bound method has stronger strength in finding an optimal circulation plan that is equal to or much close to the connection time of DCM1-T than the basic branch-and-bound method. Specifically, the connection times of solutions obtained by the proposed method is on average 1.13% more than the corresponding DCM1-Ts, on average 22.24% less than the solutions obtained by a tight-connection-based heuristic method, and on average 2.73% less than the solutions obtained by the original branch-and-bound method. It is worth mentioning that since DCM1-T does not consider the maintenance of the EMUs, the connection time of DCM1-T provides an approximate lower bound of the circulation planning problem as the submodel has described.
Comparison between the connection times corresponding to different solutions of each case.
DCM1-T: Degree Constraint Minimum 1-Tree; IFC: initial feasible circulation plan; B&B: branch-and-bound method; M-B&B: improved branch-and-bound method.
Conclusion
In this article, a hierarchical framework is proposed for the coordinate optimization of train timetable and EMU circulation plan. The framework integrates the problem of timetabling and EMU circulation planning and consists of a main model and a submodel. The timetabling main model aims at finding an optimal timetable with balanced efficiency and the quality of the corresponding EMU circulation plan. For each iteratively generated timetable, the efficiency is computed directly in the main model and the quality of the EMU circulation plan is evaluated through the submodel.
A simulated annealing–based method which is embedded with an improved branch-and-bound algorithm is proposed for solving the problem. As the quality of the optimal solution of the proposed framework is not only related to the performance of the algorithm of the main model but also related to the performance of the algorithm of the submodel, each algorithm is designed effectively. The neighborhood system of the simulated annealing process is predetermined when solving the main model. This strategy helps the algorithm ignore the obviously unreasonable region when finding an intermediate solution. The improved branch-and-bound method enhances the capability of searching the optimal circulation plan in a wider space.
The numerical experiments show that for different test cases, the proposed method doses well in finding a timetable which not only has the optimal efficiency but also provides a good basis for the constructing of an optimal circulation plan. Furthermore, the improved branch-and-bound method is able to ensure that the circulation plan of each intermediate timetable reaches its approximate lower bound.
In our future research, we will extend the proposed framework that incorporates the robustness into the output solutions. How to preserve the stability of the timetable and the EMU circulation plan against the stochastic disturbances (e.g. Markov-type disturbances) needs to be further studied.
Footnotes
Academic Editor: Hamid Reza Karimi
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was financially supported by the Science and Technology Research and Development Program of China Railway Corporation (KTD16011531).
