Measuring temporal and spatial travel efficiency for transit route system using low-frequency bus automatic vehicle location data

Abstract

Although the bus probe data have been widely adopted for examining the transit route efficiency, this application cannot guarantee the accuracy in special temporal and spatial segments due to the inadequate probe samples. This study evaluates the feasibility of automatic vehicle location data as probes for the bus route travel time evaluation. Our techniques explore the minimum requirement of transit automatic vehicle location data to recover the bus trajectories in various spatial–temporal dimensions along the scheduled transit routes. First, a three-dimensional tensor is established to infer the uncovered link traveling information in current time slots and the last short-term period. Then, a general form is proposed to calculate the local mean travel speed and the average link travel time in each separated time slot of day. Finally, a case study has been conducted using field transit automatic vehicle location data running on a bus route corridor in Edmonton, Canada. The results demonstrate the effectiveness and efficiency of low-frequency bus automatic vehicle location data as probes for transit route efficiency measurement by comparing with baseline approaches. This work also supports the feasibility of using automatic vehicle location–equipped buses as customized buses for choosing alternate path based on evaluating the current transit efficiency on all routes.

Keywords

Transit reliability evaluation bus probe travel time estimation trajectory reconstruction tensor decomposition

Introduction

Estimation of bus travel speed and travel time provides important functions for transit service agencies to determine the performance of its units of operations and further improve the schedule, route plan, on-arrival reliability, and so on. Lots of solutions have been proposed for real-time vehicle speed and travel time estimations, for example, microwave detectors, radar detectors, camera sensors, and other devices deployed at fixed detecting locations.¹ However, road segments have not been completely equipped with detecting sensors due to the increasing cost, and thus the range of these measurements is very limited.² Probe vehicle global positioning system (GPS) data are regarded as one of the most alternative methods for transit travel time applications, especially recently when transit vehicles are increasingly equipped with automatic vehicle location (AVL) for recording the bus positions when traveling the route, which in turn provides a novel type of sensor for travel speed and travel time estimation purposes. Unlike the probe samples obtained from other sources, such as taxis, the transit buses are keeping apprised of routes and schedules, and thus it enables a practical measurement of speed and travel time between two explicit locations.

Although the use of bus AVL sample data as probes creates a potential solution for practical implementation for obtaining the transit travel speed and travel time of any route segment where buses equipped with GPS system are active, the technique has not yet been well solved given the following challenges: (1) road segments are timely traveled by transit buses on scheduled time slots, which creates difficulties for examining the link segment travel time for each fraction in time slots set, such as from 2:00–4:00, and the sample size reduced significantly on each link; (2) the frequency of transit AVL samples is designed to be less than two per minute, and the AVL system has error and missing records when the GPS devices lost the accuracy, which creates difficulties in inferring the truly traversed path between two reported positions. These issues limit the usage of transit probes and therefore need more sophisticated methods for processing and recovering the missed transit samples.

In this article, we proposed a general model for estimating the transit local time-mean travel speed and average link travel time based on bus AVL samples. Our technique estimates the transit travel speed field information on consequent sections along the bus route corridor in different time slots by constructing a three-dimensional (3D) tensor, which infers the missing speed information using traversed bus trajectory samples in contemporary time slots and over the past time slots, as well as road network data. The contribution of this article is to estimate the dynamic bus time–space travel speed and travel time information on scheduled transit routes by reconstructing and allocating the individual transit trajectories into the temporal–spatial speed field. Our study also demonstrates that the low-frequency transit probes can contribute to the data source for effective travel time applications at minimum cost.

The remainder of this article is organized as follows: first, the most recent related studies were summarized and reviewed. Then, a general prescription was presented for identifying the set of calculations that are needed to implement the transit route travel time estimation. This is followed by the sections presenting a case study, results and discussions, and conclusions.

Literature review

The investigation of travel speed and travel time application using a probe vehicle has been recently conducted increasing technical attention by researchers to implement more practical traffic applications.² However, the most significant problem existed with the sparse sample size and low sampling frequency, which lead to difficulties when using probe samples to explore the traversed segments between sequential logger points. Especially in urban arterial area, low probe sampling frequency creates challenges in inferring the vehicle-traversed route in the network, which may involve more than two paths between two recorded GPS locations.^3–5

Considerable research on travel speed and travel time applications in an urban area is mostly focused on improving the estimation performance using low-frequency GPS probes, network geographic information, and other attributes. For example, Jenelius and Koutsopoulos³ proposed a model to integrate the network patterns and historical traffic data, and then examined the traffic flow on explicit road segments and the traffic delay at signalized intersections. The model can be resolved practically by the maximum likelihood estimation method. Fabritiis et al.⁶ improved the link travel speed estimation by considering the traffic patterns on the neighboring links, which have a short-term effect on the traffic status of the current segment. But the model increases the complexity and instability when scaling up to a large network area. In addition, the sparsity of the probe sample size, that is, consecutive links are not completely covered by probes in each time slot, is a common issue limiting the usage of the model. Cathey and Dailey⁷ investigated individual transit bus probe to estimate the travel speed based on Kalman filter algorithm, which in turn provided the space-mean travel speed and link travel time information. However, the proposed method is only testified using high-frequency probe data.

Other methods in terms of route travel time estimation involved the allocation of travel time to each segment. Hellinga et al.⁸ divided the link travel time into three components: free flow travel time, stopping time at traffic signals, and delay due to link congestion. If there are traffic signal devices along the link, the traffic signal–related delay would be more likely to occur and estimated based on a probability function. In large-scale network applications, the observation of probe travel time was decomposed into each traversed route segment.^9–13 Zheng and van Zuylen¹⁴ inferred the travel time information of probe vehicle on traversed links by an improved multi-layer artificial neural network (ANN) model. Chakroborty and Kikuchi¹⁵ collected transit probe data on urban corridors to evaluate the sample size and data quality on each segment and demonstrate the possibility of using the transit probe data source for the traffic monitoring purposes.

Although the state-of-the-art technologies have been developed and the valuable usage of bus GPS data for various travel speed and travel time applications has been clearly testified, there are several issues remaining unresolved in the existing studies. On the one hand, the large scale of travel time estimation has minimum requirement for the probe sample size deployed along the entire route in each time slot.¹⁶ The average historical link travel time cannot accurately represent the current situations when the traffic dynamic is complicated. On the other hand, the influence of the traffic patterns of the neighboring links on the sparse sample’s travel speed and travel time on individual segment has not been testified with dependent relationship, which may lead to uncertainty of travel time estimation in variable traffic situations.¹⁷

Methodology

In this section, we set up a general scheme for transit route travel speed and travel time estimation based on bus AVL records. In Figure 1, an algorithm framework is presented by three major processes, the detailed variables and notations on the major components and data attributes of which are represented by a vector $[n_{i}, t_{i}, l_{i}, T_{k}, r_{k}, s_{i}, v_{i}]^{T}$ , where $[n_{i}, t_{i}, l_{i}]^{T}$ describes the transit AVL information, indicating the bus ID, GPS timestamp, and bus GPS location, respectively. Other parameters of $[T_{k}, r_{k}, s_{i}, v_{i}]^{T}$ describe the transit route base map information, where $T_{k}$ denotes the $k th$ time slot, which is equally partitioned into the time intervals of a day, $r_{k}$ denotes the $k th$ road segment, and $s_{i}$ and $v_{i}$ are the distance-into-trip and velocity-into-trip, respectively.

Figure 1.

Overall framework for transit route travel time estimation.

First, we project the bus AVL raw data (presented in the longitude and latitude format) into the referred geographic road network by applying path inference and map matching algorithm.¹⁸ Based on these processed sample data, the probe vehicle’s traveling information can be investigated on separated links at different times of a day. Then, a 3D tensor $A_{r}$ is established for modeling this information, each dimension of which represents the road segment ID, time slot ID, and transit bus ID, respectively. Each unit in $A_{r}$ stands for the probe’s link travel speed in each time interval. If we defined the time slots of a day in a short-term period, the filled units in the tensor $A_{r}$ consequently tend to be sparse of probe records. Since transit bus locations are recorded on average every 30 s, some records are often lost due to the GPS signal failure. Second, to resolve the data sparsity problem, transit probe trajectories in historical and current status and route geospatial data are explicitly extracted from transit AVL records and the road network data. The historical transit trajectories are described by another tensor $A_{h}$ . $A_{h}$ is then factorized with $A_{r}$ collaboratively, aimed to fit the missed units in tensor $A_{r}$ (i.e. to fill the travel time information in the elements of the tensor $A_{r}$ without adequate probe records). Since the current bus travel time has similar features with the previously traversed buses on its present and conjunctive route segments, then historical tensor $A_{h}$ is integrated with current tensor $A_{r}$ to increase the proportion of non-zero probe records, which in turn improve the accuracy of the travel time estimation. Based on the reconstructed tensor model $A_{r}$ , we are able to recover the travel time information over the entire road segments in the current time slot (stored in $A_{rec}$ ). Consequently, we examine the average travel speed and travel time of $A_{rec}$ along the transit route and its query path.

Trip inference

The transit AVL system reports the real-time vehicle locations based on the vehicular GPS devices. However, these raw probe data cannot be directly used for travel time application due to the lack of route length between two probe positions, and thus it is supposed to transform the original sample data into road mapped samples, which is not widely provided by the transit AVL system. In this section, the trip inference algorithm is applied to extract the trip of each individual along the traversed route, as shown in Figure 2.

Figure 2.

Definition of road network and path shape points.

In Figure 2, there is a sample pattern plotted in a map of 23 Ave., Edmonton, Canada. The distance-into-geodetic path is described to be the travel distance–related information. Given the vehicle AVL data and road network data, we can use path inference algorithm to determine the transit route trip and a vehicle traveling distance-into-trip. Then, a set of generic definitions are necessary to accomplish the task of the path inference. In Figure 2, the notations are defined as follows:

Definition 1: probe sample. A probe sample $T_{r}$ is a group of probe raw records in time-ordered sequence, for example, $T_{r} : p_{1} \to p_{2} \to \dots \to p_{n}$ , where each record $p_{i} = {[n_{i}, t_{i}, l_{i}]^{T} / l_{i} = (l o_{n_{i}, t_{i}}, l a_{n_{i}, t_{i}})}$ consists of bus ID, GPS timestamp, and bus GPS location (longitude and latitude), respectively.

Definition 2: path point interval (PPI). PPI is a dataset of stretched road distance between the successive connected reference points, which represent the geographical features (start points, end points, and points at the curve sections) of a path.

Definition 3: probe trip. Probe trip is a group of processed probe samples matched with a set of traversed PPI, for example, ${r} : r_{1} \to r_{2} \to \dots \to r_{n}$ . Then, the traveling distance for each individual sample is the sum of traversed PPIs. More precisely, bus probe trip indicates an assignment of transits on the scheduled route, which specifies the route length, direction, and a set of interested positions for each bus route travel through.

Definition 4: link travel speed. The parameter $v_{u, r, k}$ is defined as the travel speed of the probe u on an $r th$ segment of road during the period of the $k th$ time slot.

The transit AVL system produces real-time reports of vehicle identifier, timestamp, and vehicle location (longitude and latitude). However, most of the applications using the transit probe data require the vehicle’s location information to be directly related to the distance-into-geodetic path, which assign the spatial–temporal transit probe samples to the scheduled route locations and time slots. Figure 3 depicts the procedure of the trip inference algorithm, which consists of two steps: map matching and path inference. The function of map matching is projecting the probe GPS records into the corresponding geographical road pattern, in order to obtain the probe’s traveling distance in the field path. It is challenged to infer the matched points into the correct candidate links when the transit route is surrounded by several approached links i.e., the practical GPS position may be measured at the entrance of path branches or in between two adjacent parallel links.

Figure 3.

A schematic of trip inference algorithm.

In Figure 3, we illustrate a practical method for map-matching the probe sample data with the consecutive corresponding PPIs and then create the transit probe path.^7,18 The projected probe locations are realized by searching them from the closest segment. A simple algorithm based on the vector dot product principle is described to calculate the length of the vector $\vec{d}$ as the traveled distance of point p along the direction of the segment $\vec{q_{1} q_{2}}$ , as shown in Figure 3, and the vector $\vec{d}$ is the projection of vector $\vec{w}$ in the direction of vector $\vec{v}$ , which can be described by the dot product formula as follows

\vec{w} \cdot \vec{v} = | w | | v | \cos θ

(1)

where the vector $\vec{v} = {\vec{q}}_{2} - {\vec{q}}_{1}$ and $\vec{w} = \vec{p} - {\vec{q}}_{1}$ , and $| w |$ and $| v |$ denote the magnitude of the vectors $\vec{w}$ and $\vec{v}$ , respectively. The projection of $\vec{w}$ onto $\vec{q_{1} q_{2}}$ is calculated as

\vec{d} = \frac{(\vec{w} \cdot \vec{v})}{(\vec{v} \cdot \vec{v})} \cdot \vec{v}

(2)

The map matching of the probe sample data is an essential step for using the transit bus AVL data as the source for travel time estimation. Then, the traveling trajectories of dynamic bus from the reference of distance are portrayed in a time–space diagram based on the map-matched sample data, as shown in Figure 4, which plots the projected bus samples on 23 Ave., Edmonton from 13:00 to 17:30 on Monday, 4 July 2016. In Figure 4, the dotted lines depicted by different colors record the transit buses location on each trip in an average frequency of 30 seconds. The red dashed lines indicate the scheduled bus stops along the bus route and black dot lines the fixed intersections on the 23 Ave. This plot showing the locations of each individual probe in the time–space coordinate is significant for understanding the sample’s dynamic information, especially for the estimation of transit travel speed, travel time, and delay. For example, it is obvious to see the slopes of the trajectories to be flattened at bus stops and intersections.

Figure 4.

Time series of the computed trip distance.

The slopes of the trajectories reveal the varying speeds of buses according to the separated road segment and the time slot. Based on the distance-into-trip, each entry $A_{r} (u, r, k)$ in Figure 1 is defined as

A_{r} (u, r, k) = v_{u, r, k} = \frac{d_{u, r + 1} - d_{u, r}}{t_{u, r + 1} - t_{u, r}}

(3)

where $A_{r} (u, r, k)$ denotes the average speed of the $u th$ bus traveling on the $r th$ road segment in the time slot k.

Sparse trajectory reconstruction

We construct a tensor model $A_{r} (u, r, k)$ to describe the probe travel information. Clearly, the tensor $A_{r}$ in Figure 1 was modeled using the transit GPS logger data which is very sparse. As in almost every time slot, many road segments have not been traversed by bus. Furthermore, transit locations are recorded on average at 30 s and some records are often lost due to the GPS signal failure.

An imputation method based on Tucker decomposition¹⁸ is applied in this study for the purpose of fitting the missed traffic measurement. Suppose that A is a 3D tensor with the original missing value, which can be decomposed into the multiplication of a core tensor $S \in R^{R_{1} \times R_{2} \times R_{3}}$ and three principal factor matrices $X \in R^{I_{1} \times R_{1}}$ , $Y \in R^{I_{2} \times R_{2}}$ , and $Z \in R^{I_{3} \times R_{3}}$ . In addition, considering a cost function to minimize the fitting error, the tensor imputation based on Tucker decomposition can be presented as

F (S, X, Y, Z) \equiv argmin \frac{1}{2} W * {(A - S_{\times 1} X_{\times 2} Y_{\times 3} Z)}_{F}^{2}

(4)

where $W$ is a non-negative weight tensor, which can be defined as

W_{i_{1}, i_{2}, i_{3}} = {\begin{matrix} 1, if a_{i_{1}, i_{2}, i_{3}} in A is known \\ 0, if a_{i_{1}, i_{2}, i_{3}} in A is unknown \end{matrix}

It should be noted that the function $F (S, X, Y, Z)$ is non-convex, which can achieve the minimum solution by resolving the gradient-based optimization. The computation of the gradient of $F$ by the given $(X_{0}, Y_{0}, Z_{0})$ is as follows

\overset{\cdot}{F} = - {(W * (A - S_{\times 1} X_{0 \times 2} Y_{0 \times 3} Z_{0}))}_{\times 1} X_{0 \times 2}^{T} Y_{0 \times 3}^{T} Z_{0}^{T}

(5)

When setting $\overset{\cdot}{F}$ in equation (5) to be zero, $S_{0}$ is supposed to obtain the most optimal value. However, the Tucker-based decomposition has been testified with high indeterminacy when used for a large-scale value domain. Thus, a method for regularization is proposed by setting up the tensor $B = W * A$ and $C = W * (S_{\times 1} X_{\times 2} Y_{\times 3} Z)$ . Then the function $F (S, X, Y, Z)$ is equivalent to equation (6)

F (S, X, Y, Z) = \frac{1}{2} B - C_{F}^{2}

(6)

Given the partial derivatives of the regularized cost function $F$ as follows, then the most optimal solution can be achieved when setting the partial derivatives equal to zero

{\begin{matrix} \frac{\partial F}{\partial S} = {(C - B)}_{\times 1} X_{0 \times 2}^{T} Y_{0 \times 3}^{T} Z_{0}^{T} \\ \frac{\partial F}{\partial X} = (C_{(1)} - B_{(1)}) (Z \otimes Y) S_{(1)}^{T} \\ \frac{\partial F}{\partial Y} = (C_{(2)} - B_{(2)}) (Z \otimes X) S_{(2)}^{T} \\ \frac{\partial F}{\partial Z} = (C_{(3)} - B_{(3)}) (X \otimes Y) S_{(3)}^{T} \end{matrix}

Segment travel time estimation

This section presents the travel speed and travel time calculation; the estimator is driven by reconstructed transit trajectory and produces accurate estimations ${v_{r}, t_{r} / 1 ⩽ r ⩽ D}$ on sequential PPIs. Consider an array $v (u, r, k)$ $(u \in N, r \in D, k \in T)$ denoting the velocity of the $u th$ bus present on the $r th$ road segment at the $k th$ time section. In Figure 5, a velocity field is illustrated using the probe trajectories of several buses traversed on a route D in the time slot T. From the reconstructed trajectories, each vehicle’s average speed $v_{u, d_{i}, t_{j}}$ on the link $d_{i}$ at time $t_{j}$ is supposed to appropriate the instantaneous space-mean speed $v_{D, t_{j}}$ , local time-mean speed $v_{d_{i}, T}$ , and the average time $\bar{T} (D, T)$ needed to travel through the route D in the time slot T.

Figure 5.

Trajectories in space and time based on the inferred transit AVL data.

In Figure 5, the local time-mean speed $v_{d_{i}, T}$ passing a cross-section $d_{i}$ during the period T is defined as

v_{d_{i}, T} = \frac{1}{| \frac{T}{t_{j}} |} \sum_{u = 1}^{T / t_{j}} v_{u, d_{i}, t_{j}}, t_{j} \in T

(7)

Instantaneous space-mean speed $v_{D, t_{j}}$ present at road section D at the given moment $t_{j}$ is defined as

v_{D, t_{j}} = \frac{1}{| \frac{D}{d_{i}} |} \sum_{u = 1}^{D / d_{i}} v_{u, d_{i}, t_{j}}, d_{j} \in D

(8)

Then we can estimate the probe travel time using the distance of the road section D dividing the instantaneous space-mean speed $v_{D, t_{j}}$

T^{*} (D, t_{j}) = \frac{D}{v_{D, t_{j}}}

(9)

where $T^{*}$ accounts for the instantaneous travel time information in the space of consecutive segments ${D / d_{i} \in D}$ using the trajectories of current moving probes within the space at the moment $t_{j}$ , which is supposed to show no significant changes during the period. Then, we can further calculate the general travel time of the collected probe vehicles in the previous T time slots and estimate the average route traffic travel time as

\bar{T} (D, T) = \sum_{i = 1}^{D / d_{i}} \frac{d_{i}}{v_{d_{i}, T}}

(10)

where $d_{i}$ denotes the distance of the separated segment defined from the path point i to the point $i + 1$ . Then the travel time defined in equation (10) is supposed to be belonging to a path running the whole field of probe velocity. It is essential to identify the information in the T time slots, in order to precisely determine $\bar{T} (D, T)$ .

Case study

In this section, a case study is illustrated for estimating the transit travel speed and travel time on the consecutive route segments on 23 Ave. in Edmonton, Canada. As shown in Figure 6, the test route is on a 3.2-km stretch of a main urban corridor, which includes seven bus stops and seven intersections (five signal intersections and two non-signal intersections) from Tegler Gate bus stop to 111 Street. The speed limit is 60 km/h. We set up eight path points in the bus stops to separate the route into seven segments, and thus the delay of transit would be eliminated to some extent when we evaluate the travel speed on each separated segment.

Figure 6.

Transit route on 23 Ave. with intersections and bus stops identified.

The transit AVL system broadcasts records containing the location (latitude, longitude), timestamp, and vehicle ID once every 30 s on average. In this case, totally 5364 observations generated by 336 trips on 23 Ave., in Edmonton from 4 to 11 July 2016, are used as the samples. The total length of the trajectories is over 1407 km.

We map-matched the collected transit probe raw data into the scheduled route and estimated the bus travel information in the corresponding route segments and time slots, based on which we created two tensor models $A_{r}$ and $A_{h}$ to fill with the probe travel information in the current time slots and historical time slots, respectively. The time interval in $A_{r}$ and $A_{h}$ is set at 10 min to leverage the tradeoff of calculation effectiveness and efficiency.¹⁹ We removed the sample records of the bus routes that traveled no more than average 10 times per week, which may be due to noises or imperfect map matching. To test the performance of road travel time estimation in the situation of inadequate probe samples, we randomly removed the positive values from the tensor $A_{r}$ in the current time slots. We then infer these entries and compare the results with the original values. For random missing case in the tensor $A_{r}$ , the ratio of non-zero entries R is evaluated and calculated as

R = \frac{M}{N} * 100 %

(11)

where M is the number of non-zero entries and N means the total number of entries in the tensor $A_{r}$ . Besides, using transit AVL dataset, we also used high-frequency GPS loggers (1 Hz) to record the transit bus locations as a ground truth. In this study, the imputation performance was evaluated by the root mean squared error (RMSE) between the estimated results $t_{e}$ and the tested points $t_{t}$ , which is defined as

RMSE = \sqrt{\frac{1}{N - M} \sum_{m = 1}^{N - M} {(t_{t}^{(m)} - t_{e}^{(m)})}^{2}}

(12)

where $t_{t}^{(m)}$ and $t_{e}^{(m)}$ are the $m th$ elements of the known real value and the inferred value, respectively.

Results and discussion

In this section, the performance of the tensor-based method for inferring the missed probe travel time information in specific time slots and route segments is assessed. Based on the reconstructed trajectories, the local time-mean travel speed and the average travel time on each route segment over time of day were supposed to be estimated and evaluated.

The results of the inferred probe travel time information are investigated, as shown in Figure 7. The tensors $A_{r}$ and $A_{h}$ were built to model the missing cases in the recent and previous time slots. The statistics of non-zero entries in the tensor models $A_{r}$ and $A_{h}$ are shown in Figure 7(b). The non-zero entries represent the number of bus probe trajectories in the separated spatial–temporal unit. For instance, each road segment has been traversed 1–2 times in every time slot from 14:30 to 15:00. Thus, the ratio of the average non-zero entries is estimated to be very low in both recent statistics (3.89%) and historical statistics (5.02%). The entries of trajectories increased rapidly after 16:00 (3–4 times) in each time slot, and the ratio of the average non-zero entries becomes higher in both recent statistics (5.13%) and historical statistics (9.2%). The results of trajectory reconstruction are shown in Figure 7(a), where the zero entries in the tensors $A_{r}$ and $A_{h}$ are effectively inferred. Based on the tensor inferred travel speed information, the local mean travel speed on each segment over time of day can be estimated, which in turn used as evaluation index of transit travel efficiency in different time slots distributed along each traversed segments. Our results demonstrate the high accuracy of the proposed method in reconstructing the transit route travel time information when the ratio of missed measurement is high.

Figure 7.

(a) Reconstructed travel speed field, (b) statistics of non-zero entries in tensor, and (c) performance of trajectory reconstruction.

In addition, the performance of the tensor-based method is evaluated using the current probe sample set and both the current and historical sample sets, and the result is shown in Figure 7(c). It seems that tensor-based decomposition usually performs well on inferring missed values (i.e. transit travel speed over road segments in each time slot), but fails to converge as the percentage of missing entries increases. For example, during the period 14:30–15:00, each entry in $A_{r}$ was extremely sparse. The RMSE of the inferred travel speed was 3.02 m/s when considering only the current dataset. When the historical trajectories were added into the inference, an improvement of performance in the estimation was achieved. The above analysis allows for the missing transit trajectories to be perfectly constructed by our proposed method. It should be noted that, by adding more trajectories into tensor decomposition (TD), we can obtain more accurate estimation when the traffic condition of the previous traversed probe records has high similarity with the current traffic states. However, we would require more time for sparse data reconstruction when applying richer probe information for calculation.

Based on the reconstructed travel information, the local time-mean travel speed and the average travel time on each segment over time of day were estimated and evaluated, as shown in Figure 8.

Figure 8.

(a) Travel time estimation results, (b) performance of travel time estimation changing over time of day, and (c) performance of travel time estimation changing over route segments.

Based on the estimated transit local time-mean travel speed changing over time of day, we estimated the transit travel time passing different segments in Figure 8(a). We compared the performance of travel time estimation based on the TD method with the trajectory-based simple concatenation (TSC) method as shown in Figure 8(b) and (c). The TSC method examines the road travel time information on each segment using mainly the previously passed probe trajectories in the recent short-term period.¹⁹ If the measurement of probe records on a certain segment is missed or very sparse, it is supposed to use the previous estimated travel time instead and accumulate the estimated results on each segment as the summation of travel times along the traversed path.

Figure 8(b) presents the performance of our method and TSC changing over time of day. The TSC-based estimation on road segments deviates the true travel time in free traffic hours, around 14:30–16:00, for example, the RMSE of TSC-based estimation is 1.55 min during 14:30–15:00. When it is in peak traffic hours, that is, 16:00–17:00, the error of TSC-based estimation decreases and comes close to TD-based estimation. This result illustrates that the TSC-based method fails to provide accurate estimation if the size of the probe sample cannot meet the minimum requirement. When there are not enough transit buses traveling on road segments, for example, during 14:30–16:00, the error of TSC is higher than that in the other time slots. However, the RMSE of estimation using our method increases only slightly during free traffic hours. This demonstrates the effectiveness of inferring the unobserved trajectories.

Figure 8(c) shows the results of the average travel time estimation on each segment along the path. The errors of using TSC and our proposed method for estimation decreased as the traveled distance is increased. As the length increases, the more sample size would be involved than in the shorter subpath. Especially, the travel time of a separated segment is significantly impacted by intersections and bus stops. In such condition, it turns out to be difficult to estimate this segment travel time in dynamic traffic situations.

Conclusion

In this article, we have shown how to effectively make use of sparse bus probe data in measuring the transit route travel speed and travel time in different temporal and spatial blocks. These data provide a good coverage of the current and historical contexts learned from bus trajectories and map data. We also have shown in our use of the 3D tensor model that a great deal of sparse uncounted transit route travel time can be derived from the existing data. Thereafter, a general prescription of estimation was proposed to evaluate the transit route efficiency at any point for a bus traveling on a known path. The results of the extensive experiments demonstrate the advantages of our method.

In addition, in the future, transit probe may prove a feasible data source and is more amenable to arterial travel time calculations. The transit probe data increase the probe sample size along the corridor compared to the use of only general probe, which in turn make arterial travel time calculation more accurate. In the future studies, considering the fact that transit vehicles and general vehicles have different running behaviors, that is, delay and dwelling time at the bus stop, a bias has to be investigated between transit travel time and arterial travel time. Also, the travel times on different segments are assumed to be independent conditional on the state of the system, which lead to incorrect estimates of the travel time variability on certain routes. Therefore, the proposed method should be improved by incorporating more factors related to transit reliability, that is, traffic congestion and weather conditions.

Footnotes

Acknowledgements

The authors gratefully thank the City of Edmonton for providing the transit GPS records and road network data for the case study.

Handling Editor: Zhixiong Li

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was partially supported by the Jiangxi Provincial Fund for Visiting Scholar Development Plan (Grant No. 2016109), Jiangxi Provincial Department of Education Science Research Fund Project (Grant No. GJJ170420), and ECJTU “TIANYOU Talent” Development Program (Grant No. 201709).

ORCID iD

Liqun Peng

References

Tao

Wang

et al . Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transport Res C: Emer 2015; 54: 187–197.

Gong

Adams

Wang

XB.

Estimating link travel time using sparse GPS data on highway corridors. Transport Res Rec 2015; 2477: 7–17.

Jenelius

Koutsopoulos

HN.

Travel time estimation for urban road networks using low frequency probe vehicle data. Transport Res B: Meth 2013; 53: 64–81.

Rahmani

Koutsopoulos

HN.

Path inference of low-frequency GPS probes for urban networks. Transport Res C: Emer 2013; 30: 41–54.

Miwa

Sakai

Morikawa

Route identification and travel time prediction using probe-car data. Int J ITS Res 2008; 2: 21–28.

Fabritiis

Ragona

Valenti

. Traffic estimation and prediction based on real time floating car data. In: 11th international IEEE conference on intelligent transportation systems, Beijing, China, 12–15 October 2008, pp.197–203. New York: IEEE.

Cathey

Dailey

DJ.

Estimating corridor travel time by using transit vehicles as probes. Transport Res Rec 2002; 1855: 60–65.

Hellinga

Izadpanah

Takada

et al . Decomposing travel times measured by probe-based traffic monitoring systems to individual road links. Transport Res C: Emer 2008; 16: 768–782.

Hofleitner

Herring

Abbeel

et al . Learning the dynamics of arterial traffic from probe data using a dynamic Bayesian network. IEEE T Intell Transp 2012; 13: 1679–1693.

10.

Fosgerau

Fukuda

Valuing travel time variability: characteristics of the travel time distribution on an urban road. Transport Res C: Emer 2012; 24: 83–101.

11.

Liu

Yamamoto

Morikawa

Feasibility of using taxi dispatch system as probes for collecting traffic information. J Intell Transport S 2009; 13: 16–27.

12.

Miwa

Kiuchi

Yamamoto

et al . Development of map matching algorithm for low frequency probe data. Transport Res C: Emer 2012; 22: 132–145.

13.

Ramezani

Geroliminis

On the estimation of arterial route travel time distribution with Markov chains. Transport Res B: Meth 2012; 46: 1576–1590.

14.

Zheng

van Zuylen

Urban link travel time estimation based on sparse probe data. Transport Res C: Emer 2013; 31: 145–157.

15.

Chakroborty

Kikuchi

Using bus travel time data to estimate travel times on urban corridors. Transport Res Rec 2004; 1870: 18–25.

16.

Bian

Zhu

Ling

et al . Bus service time estimation model for a curbside bus stop. Transport Res C: Emer 2015; 57: 103–121.

17.

Chen

Zheng

et al . Comparison of variability of individual vehicle delay and average control delay at signalized intersections. Transport Res Rec 2016; 2553: 128–137.

18.

Tan

Feng

et al . A tensor-based method for missing traffic data completion. Transport Res C: Emer 2013; 28: 15–27.

19.

Wang

Zheng

Xue

Travel time estimation of a path using sparse trajectories. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, USA, 24–27 August 2014, pp. 25–34. New York: ACM.