Abstract
Research on multitask scheduling systems in factory environments is a popular topic in the field of intelligent manufacturing. Existing research mainly focuses on the optimization of automated guided vehicle (AGV) path planning and scheduling, emphasizing on the minimization of conflicts and deadlocks, multi-objective task scheduling, and metaheuristic algorithm optimization, while ignoring path stability and real-time path planning in dynamic environments. Therefore, this paper aims to address these issues to better handle dynamic changes in actual operating environments. This paper establishes a mathematical model with the optimization objective of minimizing the overall running time of material distribution tasks and proposes an improved ant colony algorithm to optimize the model. First, the concept of prior time is introduced to improve the traditional ant colony algorithm. The path of the ongoing task is introduced with a time calculation, and the occupancy time window of each grid point on the path is calculated. Based on this, the initial pheromone distribution on subsequent paths is altered dynamically, which accelerates the convergence of the ants to a collision-free path. Second, in the pheromone update stage, the method of calculating the pheromone increment in the traditional ant colony algorithm is modified. The original distance influence factor is changed to a time influence factor, which ensures that all tasks still have the minimum running time when calculating a collision-free path. Finally, through 30 sets of simulation experiments on material distribution tasks, it is shown that the proposed algorithm shortens the total running time by 15.14%, 12.87%, and 10.59% compared to two ant colony algorithms and one strategic multi-AGV scheduling algorithm, respectively, thus verifying the effectiveness of the proposed method.
Keywords
Introduction
In the context of warehouses and manufacturing plants, the transportation of materials and goods is crucial. 1 In this context, the unique industrial capabilities of AGVs have been demonstrated. 2 Through effective scheduling of AGVs, efficient transportation of materials and goods can be achieved, greatly improving production efficiency and reducing distribution costs. 3 This facilitates the unmanned transfer of materials in the factory production process, which is also an important way to achieve intelligent manufacturing.
In practical industrial production, a multi-AGV scheduling system for material transfer is an automated system that collaboratively manages multiple AGVs to jointly complete a series of relatively independent, complex, and difficult production processes that cannot be organized through traditional assembly line processes. Coordinated scheduling problems of multiple AGVs (CSMAs) have received close attention in the field of intelligent manufacturing research due to their complexity. In CSMAs, different AGVs undertake various transportation tasks and need to travel efficiently between different starting points to meet the actual needs of factory material transportation. 4 At the same time, these AGVs need to have the minimum time cost and the highest transportation efficiency. When faced with complex transportation tasks, staggered road networks, and numerous AGVs, a series of complex problems, such as AGV road selection and collision prevention, are encountered, which make it more difficult for a multi-AGV multitask scheduling system to achieve good performance and affect its efficiency and practicality. 5
In response to these problems, scholars worldwide have conducted extensive research. Deng et al. 6 effectively addressed the AGV collision and waiting time occupancy problems by introducing an improved genetic algorithm. Zhong et al. 7 proposed a mixed integer programming model for multidevice integrated scheduling that incorporates path optimization, integrated scheduling, and minimization of conflicts and deadlocks. Jiao 8 transformed the multi-AGV problem into an occupancy-based multivehicle path planning problem and applied a particle swarm algorithm to effectively solve the AGV scheduling problem. In addition, Ulusoy et al. 9 attempted to solve the collaborative scheduling problem for machine tools and AGVs in flexible manufacturing systems by combining time windows and genetic algorithms. Udhayakumar et al. 10 designed a hybrid algorithm for the FMS manufacturing environment with the optimization objectives of AGV load balancing, minimum travel time, and maximum utilization rate. This algorithm combines genetic algorithms and ant colony algorithms to solve the multi-objective task scheduling problem for AGVs. Zhao et al. 11 developed a multi-AGV scheduling algorithm based on predictive collision avoidance, using the A* algorithm for path planning and providing strategies for dealing with different types of collisions.
Among the numerous algorithms for solving scheduling problems, ant colony optimization (ACO) is an intelligent optimization method that simulates the foraging behavior of ants in nature. It has the advantages of positive feedback, parallel computation, high robustness, and multiple adjustable parameters, making it an effective method for solving multi-task scheduling problems. 12 In particular, in multi-task scheduling problems, the ACO algorithm can quickly converge to an optimal solution through its positive feedback mechanism. Its parallel computing feature allows multiple agents to search the solution space simultaneously, effectively improving the computational efficiency. Despite its strengths, the traditional ACO algorithm still has some notable drawbacks. First, its convergence speed is relatively slow, especially when solving large-scale complex problems, where the convergence time can be quite long. Second, the ACO algorithm is prone to getting stuck in local optima, stagnating near local optimal solutions and making it difficult to explore the global optimal solution. To address these issues, many scholars have proposed improvement strategies from different perspectives, including adjusting the algorithm structure, optimizing parameter selection, and determining the initial pheromone distribution. In terms of algorithm structure, Dorigo et al. 13 proposed an innovative metaheuristic ant colony optimization method that provides a general algorithm framework for solving complex problems. Relevant studies have also been conducted in China. For instance, Wu et al. 14 introduced rollback and death strategies to reduce the impact of ineffective pheromones on ant colony evolution, optimized the state transition rules, and improved the pheromone composition structure. Liu and Zhang 15 applied the elite strategy and the max-min strategy to the basic ant colony algorithm. By updating the pheromones of elite ants, they successfully balanced the solution speed and optimization ability. From the perspective of parameter optimization, Sahu et al. 16 considered using the length and usage time of the path traveled by the ants to calculate the pheromone increment for autonomous navigation tasks with humanoid robots. They used both static and dynamic path planning to verify the efficiency of this method in path planning problems. Akka and Khaber 17 adopted a new pheromone update rule and dynamically adjusted the evaporation rate to accelerate the convergence speed of the algorithm and expand the search space, thereby preventing premature local optimization. Zhu et al. 18 combined the artificial potential field method with ant colony algorithm, improving the slow convergence speed issue of the ant colony algorithm through the enhancement of heuristic functions and adaptive induced heuristic factors. Xiao et al. 19 combined the dynamic window method with ant colony algorithm and constructed an evaluation function based on an improved dynamic window algorithm, which enhanced the local optimization ability of the dynamic window algorithm. They also developed an efficient fusion algorithm to avoid the limitations of the two algorithms in solving the path. From the perspective of initial pheromone distribution, Ding et al. 20 incorporated the path information obtained from genetic algorithms into initial pheromones, achieving satisfactory optimization results. Chen et al. 21 utilized the guidance effect between a series of jump points obtained by the leapfrogging search algorithm to enhance the early convergence and reduce the search time issues of the ant colony algorithm. Wang et al. 22 used the initial path obtained from the artificial potential field method as heuristic information. Based on the zero theorem, they assigned different initial pheromones to grid points with different characteristics and set an iteration threshold to achieve a good global search ability. Deng et al. 23 used the initial results of particle swarm optimization as the initial pheromone distribution of the ACO algorithm and utilized the parallelizability between ant colony algorithms to achieve efficient parallel search among ants via distributed technology.
However, existing algorithms typically modify the initial pheromone distribution based on specific factors only once. This fixed distribution strategy is difficult to dynamically adapt to environmental changes during the path planning process, leading to decreased path stability and increased risk of conflicts. Moreover, they fail to adequately consider the cooperation between different vehicles, especially in dynamic environments, which limits the algorithm's flexibility and real-time performance, potentially causing path delays and inefficiencies. 24 To overcome these shortcomings, this paper proposes a strategy that dynamically alters the initial pheromone concentration distribution in the subsequent exploration stages based on prior information. This strategy improves the stability and real-time performance of path planning by dynamically adjusting the pheromone distribution through real-time monitoring of path occupancy times. In addition, the distance influence factor in the heuristic function of the pheromone concentration update stage is changed to a time influence factor so that when calculating collision-free paths, the method can still fully consider time factors to obtain paths with the shortest running time.
The remainder of this paper is organized as follows. Section ‘Problem Description’ introduces the model of the material distribution scenario and establishes a mathematical model for the scheduling system. Section ‘Algorithm Design’ first introduces the basic process of the traditional ant colony algorithm, then introduces the improved pheromone update method and the heuristic function of the improved ant colony algorithm proposed in this paper, and finally gives the basic process of the improved algorithm. Section ‘Simulation Test and Analysis’ first validates the performance improvement of the proposed improved ant colony algorithm through specific task groups that can cause space–time conflicts, and then specifically analyses and compares the total completion times of the proposed algorithm with two ant colony algorithms and one strategic multi-AGV scheduling algorithm under 30 task scenarios. Section ‘Conclusion’ summarizes and concludes the research.
Problem description
Problem scenario model
In an intelligent warehousing system, AGV cars are mainly responsible for material handling in the factory area. For a single AGV multitask scenario, it is only necessary to consider the starting point and the ending point of each task, calculate the shortest path from the starting point to the ending point, and complete the tasks in order. However, for multi-AGV multitask scheduling optimization, the situation becomes more complex and conflicts between vehicles need to be considered.
To quantify this problem, we first divide the factory area to establish a grid map model, as shown in Figure 1. The areas that vehicles cannot reach are represented by black grid cells.

Factory grid map model.
This grid map model divides the factory area into 20 × 20 units, where a certain number of AGVs are responsible for material handling tasks. When a task is assigned to an AGV car, it will calculate the optimal path based on the task starting point and ending point, and then transport the goods along the optimal path to the task ending point and wait for the next required task. In the case of multiple tasks assigned to multiple AGVs, space–time conflicts may occur when the calculated paths of different AGVs occupy the same grid unit at the same time. This will lead to conflicts between AGVs, forcing other AGVs to stop and wait to avoid collisions, thus it should be avoided as much as possible in practice. However, for a fixed map size, the more AGV cars there are, the more complex the transportation task becomes, and the higher the probability of AGV space–time conflicts, which affects the normal handling efficiency of AGVs.
Mathematical model
To achieve efficient collaborative scheduling of multi-AGV under multitask conditions in factories, it is necessary to establish the corresponding mathematical optimization model. The following are the basic assumptions of the model:
AGVs cannot run along the edge of the grid or into an occupied area of the grid but can only travel from inside the grid to inside the grid. Each AGV can only transport one object at a time. All AGVs run at the same speed. The acceleration during the starting phase and the deceleration during the stopping phase of the AGV are not considered. The times taken for AGV vehicles to load and unload goods are not considered. If the AGV moves diagonally, it will occupy two adjacent grids simultaneously, and the occupation time will be the time required to completely leave the current grid unit. When the AGV changes its direction of travel, due to its structure, it can turn on the spot with a turning radius of zero. AGVs can reach adjacent unoccupied grids from each grid and can reach up to eight adjacent grids.
Based on the problem scenario model established earlier, let the driving speed of the AGV be v, the side length of each grid point be r, and the length of the AGV be
Figure 2 illustrates four scenarios representing the different ways a vehicle can move from the previous grid into the current grid and subsequently enter the next grid. These scenarios correspond to four distinct occupancy times for the vehicle in the current grid, as well as the additional grids occupied. These also apply to motions in opposite directions, that is, for the movement of the vehicle in the reverse direction, the corresponding occupancy times and extra occupied grids remain the same. The grid occupation time

Vehicle routes at different locations.
From Table 1 and Formulas (1)–(4), we can calculate the occupancy time
Time taken for different routes.
That is, the longest completion time of the AGV is minimized while satisfying the constraints (1)–(4) and the basic assumptions of the model.
Algorithm design
Considering the complexity of the problem model established in the previous section, especially the need to simultaneously handle space–time conflicts between vehicles when solving this optimization problem, this undoubtedly increases the difficulty of solving the problem. The ant colony algorithm has demonstrated significant advantages in solving such problems through distributed collaborative optimization. Based on this advantage, we use the ant colony algorithm as our optimization method and introduce the concept of prior time to effectively improve the algorithm.
Traditional ACO algorithm
The basic idea of the ACO algorithm is to use the foraging behavior of ants as a neighborhood search strategy and optimize the search process of the strategy by simulating the behavior of ants releasing pheromones. Specifically, an ant releases pheromones into its surroundings as it searches the problem space. When other ants search, they will move in directions with high pheromone intensity to find better solutions.
Suppose there are K ants, and each ant needs to traverse different cities until it reaches the destination, here the cities correspond to the grid points in the factory grid map model. Let
When an iterative process is completed, all ants have reached the destination. To prevent the accumulation of pheromones in the wrong direction, it is necessary to introduce the evaporation factor
Formula (8) shows the pheromone update rule:
Prior-time based improved ACO algorithm
Although the traditional ACO algorithm performs well in many optimization problems, it shows limitations in the context of multi-AGV multi-task scheduling. First, it does not account for the time and space occupancy of vehicles during path planning, which can easily lead to space–time conflicts during actual operation, causing collisions or delays. Second, the traditional ACO algorithm primarily focuses on finding the shortest path, failing to adequately consider dynamic changes during task execution, such as real-time path adjustments to avoid potential conflicts. Additionally, the traditional algorithm tends to get trapped in local optima, making it difficult to find the global optimal path in complex multi-task environments.
To address the aforementioned issues of the traditional ACO algorithm in multi-AGV scheduling, this paper proposes an improved ACO algorithm based on prior time information. Unlike the traditional ACO algorithm, the improved version takes into account the operational status of other vehicles during the path planning process. By introducing the concept of prior time, the algorithm calculates and maintains the time windows for grid points occupied by each AGV on their current task routes. This prior information allows dynamic adjustments of the pheromone concentration distribution. When a grid point is already occupied by another AGV and may cause a time conflict, the algorithm will adjust the pheromone concentration on that path based on the severity of the conflict, thereby guiding the path planning of subsequent task-dispatching ants. Specifically, when calculating the path for the first task, the traditional ACO algorithm is used since there are no running vehicles on the map. However, when the algorithm performs path calculations for the
When calculating the path for a new task n, each iteration of the algorithm requires determining which grid points the ant can travel to from the current
In the pheromone update stage, the traditional ACO algorithm updates pheromones solely based on distance factors. The newly added pheromone content as shown in Formula (9) is determined by the pheromone constant Q and the travel distance
The steps of the prior-time based improved ACO algorithm are as follows:
Step 1: Initialize the task starting point
Step 2: Set the algorithm control parameters. Let the total number of ants be K and the maximum number of iterations be M.
Step 3: Start the
Step 4: Send the
Step 5: Calculate the adjacent grid points that ant k can travel to next, and compare them with the grid points occupied by the vehicles executing the tasks in
Step 6: If there is no same grid point, then calculate the path directly according to the traditional ant colony algorithm, and go to Step 8; Otherwise, calculate the entry time and departure time of the AGV at that grid point and compare them with the time in
Step 7: If there is a conflict, calculate the probability
Step 8: Calculate the state transition probability
Step 9: Check whether the selected next path point is the endpoint
Step 10: Increase k by 1. If
Step 11: Update the pheromone using Formula (14) based on the recorded grid occupancy time.
Step 12: Increase the current iteration number m by 1. If
Step 13: Output the path with the shortest grid occupation time and update the structure array
Simulation test and analysis
To verify the effectiveness of the proposed optimization model and solution algorithm for the multi-AGV multi-task collaborative scheduling system, a series of simulation tests were designed, covering multiple task sets, varying numbers of AGVs, and scenarios with potential space–time conflicts. By comparing the proposed algorithm with the traditional ACO algorithm, the improved ACO algorithm,
19
and the strategic multi-AGV scheduling algorithm which uses A* for path calculation,
11
the improvement in task completion time was quantified to validate the effectiveness of the proposed algorithm. The grid map of the factory area in Figure 1 was used as the experimental scene. The grid map was assigned values using a row-major order, meaning that the assignment starts from the first row, with the first grid cell assigned a value of 1, and each subsequent row has 20 more assigned grid cells to fully describe the terrain of the entire factory. MATLAB R2021a was used to write the algorithm code, and the algorithm was run on an Intel Core i5-11320H 3.20 GHz processor (16.00 GB RAM) with Windows 11 as the operating system. The algorithm parameters are set as follows: number of ants
Space–time conflict test
To demonstrate the optimization effect of the improved algorithm, two sets of tasks with intersecting paths in the map in Figure 1 were selected, as shown in Table 2.
Space–time conflict task list.
Two AGVs were employed, and the traditional ACO algorithm and the improved ACO algorithm proposed in this paper were used to calculate the scheduling path of each AGV, and a time axis was introduced for the two-dimensional calculation path, allowing a three-dimensional image to visually display the occupancy time at each grid point. The simulation results are shown in Figures 3 and 4.

Scheduling paths of the traditional ACO algorithm.

Scheduling paths of the proposed improved ACO algorithm.
By introducing a timeline, we can have a clearer understanding of the occupancy time of each grid point. As shown in Figures 3 and 4, the two paths calculated by the traditional ACO algorithm have a space–time conflict, resulting in vehicles needing to wait. In contrast, the proposed algorithm calculates nonconflicting paths for the two tasks, successfully avoiding waiting and reducing the time consumption of the tasks.
Figures 5 and 6 show the specific position change trends of the AGVs optimized by the two algorithms in the Y-coordinate direction and X-coordinate direction during the travel process. It can be seen that there is a crossing path in Figure 5(a) and (b) at 11.91 s–13.61 s; that is, the overlapping parts of the red line and blue line in (a) and (b) occur in the same time interval. In contrast, in Figure 6(a) and (b), there is no overlap of the red line and blue line in the same time interval. This result intuitively verifies the effectiveness of the proposed improved ACO algorithm in avoiding space–time conflict paths.

Changes in the AGV positions optimized by the traditional ACO algorithm. (a) Y coordinate change trend. (b) X coordinate change trend.

Changes in the AGV positions optimized by the proposed improved ACO algorithm. (a) Y coordinate change trend. (b) X coordinate change trend.
Multitask scenario test
To verify the optimization effect of the proposed algorithm under multitask conditions, we simulated 30 different sets of tasks
Simulation task list.
We tested and analyzed the impact of changes in the number of AGVs on the scheduling results. Using the traditional ACO algorithm, we adopted a strategy of waiting for the previous vehicle to pass when there was a space–time conflict. The scheduling task allocation scheme was as follows: first, all vehicles were assigned tasks separately, and each vehicle received only one task. Once a task was completed, the vehicle responsible for the task would accept the next unfinished task. Task dispatching continued until all tasks were completed. We conducted scheduling simulations for scenarios where the number of AGVs increased from 1 to 10 and determined the total time required to complete 30 dispatch tasks under different numbers of AGVs. The simulation results are shown in Figure 7.

Task completion times under different numbers of vehicles.
It can be seen that when the number of AGVs is greater than four, the decrease in the total task completion time gradually slows. This is because the increase in the number of vehicles will lead to an increase in space–time conflicts, thus in the case of 10 vehicles running simultaneously, the total task completion time does not decrease, but rather increases. We summarized the number of space–time conflicts generated under different numbers of AGVs, as shown in Figure 8.

Number of space–time conflicts under different number of vehicles.
Considering the relationship between the number of AGVs and space–time conflicts, the characteristic that the improvement of task completion efficiency gradually slows down or even deteriorates with the increase of vehicle numbers, and the impact of economic costs on AGV quantity selection, we selected five AGVs for testing, using the traditional ACO algorithm, the improved ACO algorithm, 19 the multi-AGV scheduling algorithm, 11 and the prior-time based improved ACO algorithm proposed in this paper. The parameters of each algorithm were set to the same values, and the optimization results are shown in Tables 4 to 7. The middle part of the table shows the task queue for each AGV, while the last column reflects the total time required for each AGV to complete the assigned tasks. This time includes the execution time of each task and the time required for movement between tasks. Here, the method used for calculating the path between tasks is the same as the method used for calculating each task. Through comparative analysis of the experimental results, we can see that both the improved ACO algorithm and the multi-AGV scheduling algorithm have shorter total task completion times than the traditional ACO algorithm under multitask conditions, while the prior-time based improved ACO algorithm proposed in this paper outperforms all three algorithms in improving task execution efficiency and reducing conflict rates.
Optimization results of the traditional ACO algorithm.
Optimization results of the improved ACO algorithm.
Optimization results of the multi-AGV scheduling algorithm.
Optimization results of the proposed algorithm.
The experimental results further show that in a typical scenario of 30 tasks and 5 AGVs, the proposed improved ACO algorithm achieves a total task completion time of 193.55 s, which is 15.14%, 12.87%, and 10.59% shorter than the 228.09 s taken by the traditional ACO algorithm, the 222.13 s taken by the improved ACO algorithm, and the 216.48 s taken by the multi-AGV scheduling algorithm, respectively. This is attributed to the consideration of the grids and time periods occupied by other running vehicles during the path calculation process of the proposed algorithm, as well as the adjustment of the pheromone update strategy. The traditional ACO algorithm and the improved ACO algorithm struggle with multi-AGV multi-task scheduling due to their inability to dynamically adjust path planning, which often leads to space–time conflicts and results in longer total task completion times. Although the strategic multi-AGV scheduling algorithm considers various AGV conflicts and their scheduling schemes, it also does not make dynamic path adjustments in advance based on these conflicts. In contrast, the proposed algorithm successfully avoids path conflicts and optimizes scheduling efficiency by introducing prior time and dynamic adjustment strategies, thus showing greater effectiveness and superiority in multi-task scenarios.
Conclusion
This paper investigates the collaborative optimization scheduling problem for multiple AGVs under multiple task conditions in factories and establishes a mathematical model for optimizing the scheduling system. In response to the potential occurrence of space–time conflicts during multitask execution, an optimization based on prior information is introduced on the basis of traditional ACO algorithms. By incorporating the time and space occupancy information of existing paths, the possibility of changing the pheromone concentration for subsequent path calculations is considered to determine whether it is necessary to detour to other paths, thus enabling dynamic adjustments. The introduction of the prior time concept enables the algorithm to dynamically perceive the occupancy status of other AGVs and adjust path planning accordingly to avoid space–time conflicts. This is a factor that existing ACO algorithms have not considered in multi-task scheduling. Moreover, existing ACO algorithms mainly optimize based on path distance, whereas the proposed algorithm in this paper incorporates a time influence factor. This ensures that path selection not only avoids conflicts but also minimizes vehicle travel time. These improvements enable the algorithm to generate paths that occupy less time when scheduling complex tasks.
Through simulation tests under multitask conditions, it is found that the proposed algorithm outperforms two ant colony algorithms and one strategic multi-AGV scheduling algorithm in avoiding space–time conflicts and improving scheduling efficiency, so it can better handle multitask scheduling problems in actual factories. Compared with the above algorithms, the proposed algorithm is more stable and reliable and can better meet the needs of practical applications. Despite these advantages, there are still some limitations in this study. First, the simulation tests were conducted on a simplified 20 × 20 grid map, which, while reflecting some issues present in real-world scenarios, does not encompass more complex factory environments. In more intricate scenarios, such as varying vehicle speeds and load capacities, the adaptability and effectiveness of the algorithm require further verification. Second, although the proposed algorithm has demonstrated high scheduling efficiency in the simulated environment, maintaining the real-time responsiveness of the algorithm and further integrating it with other optimization methods, such as deep learning, are major challenges that need to be tackled in future research when dealing with large-scale, frequently dynamically changing tasks.
Footnotes
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by Key R&D Program Project of Shandong Province (Grant No. 2024TSGC0012) and Shandong Province Central Guidance Local Science and Technology Development Fund Project (Grant No. YDZX2024125).
