Abstract
Real-time traffic speed is indispensable for many ITS applications, such as traffic-aware route planning and eco-driving advisory system. Existing traffic speed estimation solutions assume vehicles travel along roads using constant speed. However, this assumption does not hold due to traffic dynamicity and can potentially lead to inaccurate estimation in real world. In this paper, we propose a novel in-network traffic speed estimation approach using infrastructure-free vehicular networks. The proposed solution utilizes macroscopic traffic flow model to estimate the traffic condition. The selected model only relies on vehicle density, which is less likely to be affected by the traffic dynamicity. In addition, we also demonstrate an application of the proposed solution in real-time route planning applications. Extensive evaluations using both traffic trace based large scale simulation and testbed based implementation have been performed. The results show that our solution outperforms some existing ones in terms of accuracy and efficiency in traffic-aware route planning applications.
1. Introduction
Nowadays, public traffic is a serious problem in big cities. Billions of hours are wasted everyday on traffic congestions. In addition, emission caused by vehicles has become one of the major sources of air pollution. To tackle these problems, people have developed many intelligent transportation system (ITS) applications to reduce vehicle traveling time by route planning [1] and to reduce emission by driving advisory [2]. To develop such ITS applications, real-time traffic information is a key prerequisite. With accurate real-time traffic information as prior knowledge, the performance of route planning and emission reduction can be significantly improved.
How to obtain real-time traffic information is one of the major issues to be solved. Conventional methods leverage traffic monitoring infrastructures, such as video surveillance cameras [3], inductive loops [4], or the combination of them to collect traffic data [5]. The main problem of this approach is the low availability of these facilities, due to the high deployment and maintenance cost. One alternative approach is to analyze historical traffic data [6] to find out some statistical traffic patterns. Unfortunately, historical data can not always reflect current situation since many emergent events (e.g., traffic accident) or temporary events (e.g. road surface maintenance) can lead to significant traffic flow variation.
Recently, using vehicular crowdsourcing to collect floating car data (FCD) for speed estimation [7] has attracted the attention of both academic and industrial bodies. Compared with infrastructure based approaches, the data collection cost of crowdsourcing is significantly reduced by leveraging individual users' contribution. However, the data transmission cost is still considerably high due to the use of cellular data to collect FCD for centralized data processing. In many ITS applications (e.g., route planning), the real-time traffic information of a road segment is only interested by the nearby vehicles. In these scenarios, an in-network traffic speed estimation solution is especially promising. Moreover, in-network processing can also reduce the cost of long distance data transmission.
With the rapid advancement of wireless communication and embedded technology, using vehicular ad hoc networks (VANET) to collect traffic information becomes popular recently [8–11]. Traffic information can be collected through vehicle to infrastructure (V2I) or vehicle to vehicle (V2V) communication. However, it is not easy to collect floating car data using VANETs. Considering the unique characteristics of VANETs and the application scenario, several challenging issues are raised:
Network Disconnection. Vehicle mobility is highly dynamic, which can cause the network to be frequently disconnected. When using VANET to collect traffic information, an interesting contradiction can be found between traffic condition and network condition. Good traffic condition (high vehicle speed, low traffic density) usually leads to poor network condition (high packet loss, opportunistic network) and vice versa. The solution must tolerate the poor network conditions. Vehicle Speed Variation. Vehicles on roads do not travel using constant speed. The individual vehicle speed can have remarkable change in a short period. For example, a red light is in front, or a taxi stops at the request of passengers. The instant variation of floating car data can lead to inaccuracy to the traffic estimation algorithm. A good analysis model must be able to eliminate this variation. Insufficient Traffic Sampling. Due to the infrastructure-less nature of VANETs, it is not easy to monitor an area for a long period of time. Moreover, the fast-changing individual vehicle speed and fragile network topology can potentially interfere with the data collection. Hence, the data sampling is always insufficient. The traffic estimation algorithm needs to deal with insufficient traffic sampling condition.
To tackle these challenges, some infrastructure based solution has been proposed. These solutions utilize roadside unit (e.g., Wi-Fi access points [12], cellular base stations [13], etc.) and vehicle to infrastructure (V2I) communication to monitor the traffic condition of a fixed area. With the help of roadside infrastructure, continuous sampling of an area and caching collected data can easily be achieved. However, these solutions have the same deployment problem as those traffic facility based solutions. Other researches rely on the assumption that vehicles travel with constant speed [8, 14]. Without considering vehicle speed variation, some straightforward traffic estimation algorithms like time mean speed or space mean speed can be developed. However, this assumption does not hold in real world and it can lead to inaccurate traffic speed estimation results.
In this paper, we propose a novel in-network traffic information collection and estimation solution. We name it MICE or Model based Infrastructure-free traffic Collection and Estimation. In MICE, traffic information is collected using infrastructure-free approach. Then, traffic condition is estimated using in-network data processing and macroscopic traffic flow model. The application scenario is depicted in Figure 1. When a vehicle wants to know the traffic condition of nearby roads, it sends a traffic collection request using V2V communication to ambient vehicles on demand. Those vehicles that received the request send their information back to the original vehicle for traffic condition estimation.

Application scenario and system model: using V2V communication to collect real-time traffic information.
The contributions of this work can be summarized as follows:
We propose a floating car data collection protocol, which leverages V2V communication. It can collect floating car data in segment level granularity and it is adaptive to different network conditions. We adopt Underwood density-speed traffic flow model [15] developed by traffic scientists to predict the traffic speed. This model can eliminate the temporary speed variations to improve the estimation accuracy, efficient enough for in-network data processing. We apply the proposed solution to the shortest-time route planning application. A heuristic based path finding algorithm is proposed to show how the proposed solution works in real world applications. Extensive evaluations and analysis are conducted using both large scale simulation and testbed implementation. Evaluation results show that our solution outperforms some static and dynamic route planning solutions.
The remainder of this paper is organized as follows: related works are summarized in Section 2. The MICE problem is formulated in Section 3. The solution to MICE is presented in Section 4 in detail. In Section 5, an application of the proposed solution is introduced. The evaluations and results will be analyzed in Section 6. Finally, Section 7 concludes this paper.
2. Related Work
Several related works on using VANET to estimate traffic speed can be found in literature. We first introduce the works on traffic information collection, followed by existing traffic speed estimation solutions.
2.1. Traffic Information Collection
Solutions on traffic information collection can be classified by whether they rely on road side infrastructure. Road side infrastructure can simplify network protocol design since it is a convenient place to store, cache, and analyze data. Many infrastructure based solutions have been proposed [9, 10, 16, 17].
PeerTIS and Kyun queue are two typical infrastructure based solutions. PeerTIS [13] firstly constructs a peer to peer overlay using cellular networks among vehicles. A traffic information dissemination protocol is developed to share information among vehicles. The drawbacks of using cellular networks as infrastructures are the high communication cost. Kyun queue [18] proposed a novel solution to use RSSI to monitor the road traffic congestion. Specifically, it monitors the length of the waiting queue in front of a road intersection by observing the interference caused by stopping cars. The proposed solution is simple and yet effective. However, road side wireless nodes still need to be deployed as infrastructures in advance.
Infrastructure-free solutions only use communication among vehicles and no extra infrastructure needs to be deployed. Therefore, it is more flexible [8, 19]. MobSampling, Geocache, and V2R2 are three typical solutions of this type.
PGB [20] is an infrastructure-free data collection solution. The key idea is to allow receivers to determine forwarding or dropping the packet. With receiver-based decision, the overhead of controlling message (e.g., RTS/CST) can be eliminated, and the communication performance can be improved. The paper recommends using geographical information or signal strength to group the receivers, and only the receivers that satisfy the predefined conditions can forward the data.
MobSampling [21] uses infrastructure-free vehicular networks to collection traffic information. A role switching algorithm is developed to cache the data within certain geographical regions. When the vehicle with cached data moves out of the region, it is required to pass the data to another vehicle. Geocache [22] is a pull-based geocast protocol combined with caching mechanism to reduce the communication throughput. Both MobSampling and Geocache work effectively under dense traffic. Otherwise, they will fail if no nearby vehicle is available. V2R2 [23] uses unicast along the shortest path and multicast for the return trip to collect the traffic condition around navigation source and destination. However, V2R2 ignores the data collection of roads with sparse traffic, which may limit the application scenarios of the solution.
Different from existing infrastructure-free solutions, our solution is designed for both dense and sparse traffic scenarios. Moreover, our solution does not have the cost of establishing and maintaining in-network cache. Therefore, there is no “warm-up” phase at the very beginning.
2.2. Traffic Speed Estimation
After floating car data is collected, how to leverage these data to estimate traffic is another important issue. The traffic estimation algorithm must be designed according to the properties of the data. Generally, floating data can be categorized as dense data and sparse data, depending on the sampling frequency. Infrastructure based data collections are useful to obtain dense data, while infrastructure-free data collections can usually get sparse data.
Li et al. [14] proposed two traffic estimation algorithms: link-based algorithm (LBA) and vehicle-based algorithm (VBA). LBA estimates the road segment traffic only using vehicles at the beginning and the end of the road segments, while VBA utilizes all vehicles. The results are compared with the ground truth captured by video surveillance. Both LBA and VBA use vehicle speed to estimate traffic. The algorithms are suitable with dense data.
BeeJamA [24] is a heuristic solution for dense data. It is derived from the honey bee behavior to avoid traffic congestion. A density-speed model is used to rate the congestion. However, this approach requires data to be collected continuously from an area and therefore only works with the help of roadside infrastructure and V2I communication.
Besides BeeJamA, researchers have developed many other heuristic models like neural networks [25] based and genetic algorithm [26] based solutions for traffic estimation. However, these heuristic algorithms usually demand dense data as training set and also require centralized processing. Therefore, they are not suitable for the case of infrastructure-free VANET.
When dealing with sparse floating car data, different analysis models can be applied to estimate the traffic condition. Some straightforward solutions use basic statistic analysis techniques like mean vehicle speed [8] or mean pass time [17]. Unfortunately, these schemes only work under the assumption of constant vehicle speed. In real world, they may not be able to accurately reflect the real traffic condition (e.g., data are collected when the traffic light is red).
StreetSmart [19] is a congestion discovery solution using V2V communication. It uses distributed clustering to calculate traffic condition. This approach is useful for long term traffic monitoring since it needs to construct and maintain a cluster structure. Unfortunately, it is not flexible enough for on-demand traffic estimation.
VAN [8] has exactly the same functional requirement and the same traffic collection design concern with our work. It uses average speed to estimate the traffic condition and the candidate paths are bounded by packets' time-to-live attribute. VAN can be considered as the state-of-the-art route planning solution using infrastructure-free vehicular networks, and we will compare our solution with VAN in our evaluation.
Different from the aforementioned solutions, MICE does not require training dataset and only utilizes macroscopic traffic flow model to estimate traffic speed. This density-speed model can handle sparse data since it only relies on vehicle density, which varies slightly in a continuous time period.
3. Problem Statement
3.1. Preliminaries
In our research, we assume vehicles have navigation devices with GPS installed onboard. The road networks topology, vehicle's current position, velocity, and global time can all be retrieved via GPS devices. In addition, we further assume vehicles can communicate with each other using V2V communication standard like DSRC to form the vehicular networks.
Before we formulate the problem, some definitions shall be introduced first.
Definition 1 (road segment).
A road segment is a 5-tuple
Definition 2 (vehicular network).
Vehicular network is a wireless ad hoc network formed by vehicles. At a specific time, it can be defined as a graph
Definition 3 (floating car data snapshot).
A floating car data snapshot is the traffic state of a road segment at a particular point of time. It can be described as a vector of vehicles, their positions, and velocities. The floating car data snapshot of road segment r at time t is denoted as
Definition 4 (traffic condition).
Traffic condition of a road segment can be defined as a series of floating car data snapshots sampled for the road segment in consecutive time periods
Alternatively, it can also be transformed as a group of vehicles that have appeared on the road segment during the time period
With the latter representation, we can use space mean speed to denote the traffic condition as
This definition is reasonable because when people talk about traffic condition of an area, they usually refer to the average condition during a time period, and so does our definition.
Definition 5 (traffic density).
Traffic density k of a road segment is defined as the number of vehicles per unit length per lane. The inverse of density
Intuitively, the traffic density and floating data snapshot have some correlations. Researchers have conducted extensive evaluations and built several models to investigate the correlation. Both urban driving scenario [27, 28] and highway driving scenario [29] have been studied. Among all the models, Poisson arrival process has been widely adopted because of the versatility and simplicity. It has been used to model the arrival of cars at toll gate, traffic light waiting queue, and a road segment. In our research, we adopt this result. Therefore, given a traffic snapshot of a road segment, the corresponding traffic density can be calculated as follows.
Since we also assume vehicles follow Poisson arrival process, as a result, the distances among consecutive vehicles
The symbols and notations used in this paper are summarized in Summary of Notations.
3.2. Problem Formulation
With the above definitions and assumptions, the objective of MICE is to (1) collect and then (2) estimate the traffic condition of a given road segment.
The traffic information collection problem is a design issue. We need to design a network data collection protocol for VANET to collect traffic information. In practice, the traffic density may vary from time to time. Consequently, the network connection is not always available when traffic density is low. The design objective of the protocol is as follows:
Adaptive. The protocol should be adaptive to different traffic density. Specifically, it needs to handle network disconnections in sparse traffic condition. Infrastructure-Free. The protocol should not rely on any road side infrastructure or cellular station. Only V2V networks should be used. On-Demand. The protocol should not wait a period of time to warm up before it can function well. It must work on demand. Real-Time. The traffic information needs to be collected and estimated with low latency.
Since we do not utilize infrastructure or cache, the results of a data collection instance are a traffic snapshot, which can be used to estimate the traffic speed. The traffic speed estimation problem can be regarded as a data analysis issue. It can be formulated as follows.
Given
A single road segment Space mean speed A traffic snapshot
Objective. Find a mapping function
Equation (3) shows that we want to minimize the error between the estimated value and the calculated value (ground truth). In our research, we use traffic trace and our own testbed. In both cases, the ground truth of the traffic speed can be easily obtained.
4. Solution
In this section, we focus on how to collect and estimate traffic speed for a single road segment. In real world, estimating the traffic condition of a single road segment does not seem necessary, since it can be directly observed by the drivers most of the times. However, the solution introduced in this section can be easily extended to a road network composed by many connected road segments.
4.1. Traffic Information Collection
The traffic speed estimation request can be initiated by any vehicle that wants to know the surrounding information at any time. After the request is initiated, the request packets are forwarded to nearby vehicles hop by hop. To demonstrate how the protocol works, we use a single road segment, for instance.
The traffic density changes from time to time, causing VANET to be disconnected frequently. Based on the connection status of VANET, we can define four different states, as depicted in Figure 2. They are as follows:
Dense Connection. Vehicles on the same direction can be connected directly. Sparse Connection. Vehicles on the same direction can only be connected with the help of vehicles from the opposite side. Partial Disconnection. The vehicle does not have next hop candidate, but it is still connected with the previous hop neighbor. Total Disconnection. The vehicle does not have any neighbors, it is completely disconnected.

Four VANET connection states, depending on different vehicle density.
The design objective of the protocol is to be adaptive with all these connection states. Therefore, different packet forwarding strategies shall be considered for different state. The complete packet forwarding algorithm on the arrival of a new packet is shown in Algorithm 1.
(1) (2) (3) (4) Greedy forward (5) (6) Greedy forward (7) Feedback (8) (9) Feedback (10) Carry p (11) (12) Carry p (13) (14) (15) (16) Greedy forward p to (17) (18) Greedy forward p to (19) Feedback (20) (21) (22)
For dense connection, packets will be forwarded greedily like GPSR [30] to the vehicle in neighbor table which is the closest to road segment end and has the same direction as the road segment (line (4)). For sparse connection, packets have to be forwarded to a vehicle on the opposite direction (line (6)). Otherwise, the vehicle will carry the received packet (lines (10) and (12)) for a while and try again. The packet carrying period is set to one second in our implementation. It is also possible that current vehicle is on the opposite direction of the road segment. In this case, vehicle will never carry packet, since the longer time it carries the packet, the further from the road segment end it will be. In this case, the vehicle at the opposite side will firstly try to forward the packet to vehicles with the same direction as the road (line (16)) and then seek for another opposite directed vehicle, which is closer to the road segment end as an alternative (line (18)).
When designing the packet forwarding protocol, two widely adopted strategies can be considered: sender-based decision and receiver-based decision, depending on who determines the next hop destination. They have different strengths and weaknesses. Generally, solutions using receiver-based decision can reduce the overhead of controlling message by eliminating multiple handshakes. Differently, those using sender-based decision are more flexible and adaptive to the environment, because the sender usually collects information of all candidates and then makes decision. In our application scenario, the protocol needs to be adaptive to different traffic conditions and flexibility is the key requirement. Therefore, we decide to adopt sender-based decision.
To implement the sender-based decision, we design an RTS/CTS scheme to collect traffic information as well as build neighbor table

The RTS/CST scheme and forward strategy.
The data collection result can be treated as a snapshot of floating car data. After all vehicles on the road segment are collected or one of the disconnection states occurs, the current traffic speed will be estimated. Specifically, speed estimation algorithm is invoked if one of the following conditions is satisfied:
Packet reaches a new road segment or destination. Sparse connection or partial connection occurs. The specified time interval has elapsed.
Condition (2) has been shown in lines (7) and (9) of Algorithm 1. Condition (3) is used to handle the special case that the traffic density is low but current vehicle moves slowly (e.g., stop for red light or temporary parking).
After the speed has been estimated, the solution also uses greedy forwarding algorithms to send the estimated speed back to the source vehicle. When the source vehicle receives the estimated speed, it will store the data locally for further usage. It is possible to receive multiple feedbacks for a single road segment. In this case, the one with the latest time stamp will be kept.
4.2. Traffic Speed Estimation
The traffic speed estimation problem has been formulated in Section 3. If we look at the objective function in (3), it looks like an optimization problem. However, it can not be directly solved by any optimization methods, because we are using sparse data (traffic snapshot) to estimate the dense data (traffic condition). And the dense data is unknown but has correlations with the sparse data. To estimate the speed, we need to model the relationship between the data we have and the data we want to get.
In this research, instead of training a model by ourselves, which requires extensive dataset and computation, we decide to utilize existing macroscopic traffic models developed in traffic engineering. Macroscopic traffic model studies the relationship among density, flow, and speed. In our research, we select Underwood model [15], which is a typical density-speed model.
The reasons we choose Underwood density-speed model are twofold. Firstly, several properties can be obtained from the floating data snapshot, like speed, density, vehicle position, and so forth. We need to select suitable features to build the model. Many existing works selected speed [8, 14, 17]. However, we observe that the variance of speed within a period of time is significant. To determine the variance of the speed, we analyze the traffic trace data from Cologne (details are introduced in Section 6), and the VMR (variance-to-mean ratio) of speed is 8.7, while the VMR of density is 4.5. Figure 5 shows the distribution of speed and density. Therefore, density is more stable than speed and can better reflect the traffic condition in a time period. Secondly, many macroscopic density-speed models have been calibrated using site data, for example, Greenberg, Underwood, and Drake [31]. Different models have different strength and weakness. Underwood is more accurate for uncongested condition compared with other models. Judging from Figure 5(b), the probability of uncongested condition is much higher than congested condition.
The original Underwood model can be described as

Underwood density-speed traffic model with

The distribution of different vehicle properties.
When optimum traffic density
Another issue is how to determine current density from the collected floating car data snapshot. As described in (2), λ is an unknown value. Now, we want to estimate λ from the collected floating car data snapshot.
From the floating car data snapshot we collected, we can construct the vector of distances between adjacent vehicles as
Considering the traffic regulations, we assume free flow speed is the same as maximum allowed speed; that is,
It is also possible to fail to collect traffic information for a road segment. This is mainly caused by the sparse traffic. In this case, we assume the average vehicle distance is larger than
The speed for a specific road segment can be estimated using (8) and (9). The only variables in these equations are the distances among vehicles, which can be obtained from traffic information collection results.
5. MICE in Route Planning Application
The previous section introduced how to collect traffic data and how to estimate traffic speed for a single road segment in MICE. In this section, we apply the proposed solution to a typical and popular ITS application: real-time route planning. This application utilizes the estimated traffic speed to find a shortest-time route for a vehicle. To find the shortest-time route, traffic condition of multiple road segments needs to be collected and estimated.
However, designing such an application is not trivial. To find a global shortest-time path, traffic information of all roads is prerequisite. But considering a city with thousands of roads in real world, it is not feasible to collect information of all roads since the search space is extremely huge. To make the solution practical, we have to be tolerant with a suboptimal solution by collecting traffic information for part of the roads and then make decision.
Since the traffic information is unknown and should be estimated in real-time, the shortest-time route planning problem is a typical stochastic shortest path problem with recourse (SSPPR) in graph theory, which explores adjacent nodes in a graph until finding the destination. Before exploring a node, the weight of the associated edges is unknown. The problem has been proved to be NP-complete [34] and hence has no polynomial time solution unless P = NP.
Our objective here is not to propose a better approximation algorithm for this difficult problem. Instead, we just want to demonstrate the application of our proposed estimation solution in a specific scenario. To solve this problem, we designed a heuristic solution according to two common senses: (1) the longer the path is, the longer the time it may take. (2) It makes no sense to choose another longer path when shortest path is not congested. We name the candidate path selection algorithm Backoff-and-Fork or BnF in short. The basic idea is derived from the two common senses. For (1), we define a total length constraint, which means the total length of a candidate path can not exceed the threshold. For (2), we define a speed threshold, which means if the estimated speed of the shortest path road is slower than the threshold, other nearby alternative paths will be searched.
The complete BnF algorithm is shown in Algorithm 2. At the beginning, the algorithm will only choose the road segments on the shortest path as candidate paths (lines (1)–(3)). Then the data collection request packets are sent to all candidate paths. After that, whenever a packet arrives at a new road segment, it will collect traffic information of all neighbor road segments that have not been collected and then estimate speed (lines (7)–(9)). If the estimated speed is good enough, it will go on with the next road segments. However, if the speed is lower than the predefined threshold (line (10)), the algorithm will send a backoff packet to the previous road segment (backoff). Then, all the neighbor road segments that have not been selected as candidate path will be used as a starting point to build a new shortest path to end point (fork), and the newly built shortest path will be added to the candidate path set if the length has not exceeded the threshold (lines (11)–(13)). To avoid overlapping between the new and old shortest path, the paths connecting existing candidate paths are excluded when executing the shortest path algorithm (line (12)).
// Initialize. (1) (2) (3) // On packet arrive at road segments. (4) (5) (6) (7) (8) // Collect data on road rs. (9) EstimatedSpeed = (10) // Backoff and fork. (11) (12) (13) (14) (15) r. IsCandidate = True (16) (17) (18) (19) (20) (21)
Figure 6 can be used to demonstrate how the BnF algorithm works. The starting point and end point are S and E, respectively. In the initial phase, the shortest path

An example of Backoff-and-Fork algorithm.
The algorithm will terminate when the request packets arrive at the navigation destination.
6. Performance Evaluation
6.1. Experiment Setup
In this section, we evaluate the performance of MICE using both small-scaled testbed implementation and traffic trace based large-scaled simulation. We choose two evaluation methods because both of them have advantages and disadvantages. The traffic trace provides a realistic evaluation scenario. However, some parameters like the average vehicle density or speed are not configurable. Consequently, we can not use the trace to test the impact of these parameters. On the contrary, the testbed is more flexible, but the road topology and traffic scale are very limited. Figures 7 and 8 show the testbed, road network topology, and flow chart of the evaluation.

iTranSNet testbed and cologne topology.

Evaluation flow chart.
We implemented the MICE on iTranSNet [35]. The iTranSNet is a testbed with several programmable mini cars, which is modified from toy cars by ourselves. The moving speed of the toy cars is very slow, approximately 0.4 m/s, and we can control the speed with program. The size of iTranSNet platform is 3 m × 6 m. The road network topology of iTranSNet is illustrated in Figure 7(a). The mini cars are controlled by onboard MICAz wireless sensor nodes. The MICAz sensor nodes can also communicate with each other using IEEE 802.15.4. The MICE algorithm is implemented on MICAz with TinyOS [36]. In our implementation, we set the transmission power to the lowest level (approximately 1 m) to enable multihop among vehicles. The positions of all mini cars can be determined by sensors on iTranSNet. The vehicles follow the traffic light control rules [37]. We run the experiments under several configurations: dense traffic (25 cars), sparse traffic (12 cars), and some intermediate values. The experiments are repeated 50 times with arbitrary route planning starting and end points.
For simulation, we use the traffic trace dataset from Cologne in Germany provided by [38], where detailed analysis of the trace can also be found. Traffic simulator SUMO [39] is used to simulate the traffic mobility and network simulator NS-3 [40] is used to simulate VANETs communication and run MICE algorithm. Table 1 shows other parameter settings for NS-3. The simulation process works as depicted by Figure 8(b): firstly, a subset of the traffic trace file (geographical region
NS-3 simulation parameters.
6.2. Traffic Collection Evaluation
To evaluate the performance of traffic collection protocol, we are interested in the network latency and the network overhead. The network latency is defined as
Figure 9 shows the traffic collection evaluation results. Two major factors are evaluated: path length and traffic density. The results (distance and time cost) of multiple evaluations are added together and plotted in Figure 9. From Figure 9(a), we can see that the latency increases as the path length. And for the same length, the latency for sparse traffic is usually larger than the case of dense traffic. This is caused by the frequent packet carrying in sparse traffic condition. Figure 9(b) shows that the total number of packets transmitted also increases as the path length. For the same length, more packets are transmitted in dense traffic condition. This is because of the RTS/CST scheme depicted in Figure 3. When the traffic is dense, after an RTS is sent, more CTS replies are generated and transmitted over the air.

Network evaluation results using iTranSNet testbed.
Figure 10 illustrates the traffic collection evaluation results using traffic trace. We run the experiments many times and sum up the distance and time consumption. As we can see, the increase trend of latency and the number of packets sent are similar as the results obtained by testbed. Moreover, the results are more convincing since the road condition, vehicle speed, and distribution are more realistic. The network latency of is less than 1 min/km. In real application scenario, the delay is acceptable.

Network evaluation results using traffic trace.
6.3. Speed Estimation Evaluation
To evaluate the performance of traffic speed estimation algorithm, we consider the impact of traffic density and time window length p in traffic condition (Definition 4). Data of a single road segment is collected and estimated. The road segment is selected randomly and the experiments are repeated 50 times. We compare our solution with VAN [8], which calculate the mean speed of collected vehicles to represent the traffic speed. The mean estimation error is calculated using (10). Therefore, the smaller the error is, the better the method works
Figure 11 illustrates the relationship between traffic density and the estimation error. The unit of density is the number of vehicles per meter per lane. From this figure, we can see that, in most of the cases, the estimation errors of MICE are smaller than VAN, which indicates that our method outperforms VAN in traffic speed estimation. The key insight is that, compared with mean speed used by VAN, the density-speed model used by MICE can better reveal the truth in most cases. However, the estimation error of MICE has a remarkable increase trend when the traffic density is high. There are two reasons for this. First, Underwood density-speed model we select performs better in low density scenario. Second, when vehicle density reaches 0.2 (headway distance is 5 m), the traffic is highly congested and the vehicles are almost stopped. Then, the variance is not significant, the mean speed method used by VAN can even better reflect the ground truth.

Traffic density and speed estimation accuracy.
Figure 12 shows the relationship between estimation error and time window size. The size of the time window affects the value of traffic condition mean speed

Time window size and speed estimation accuracy.
6.4. Route Planning Application
Finally, we evaluate the performance of different speed estimation solutions by putting them into route planning application. We calculate the shortest-time path using the estimated traffic speed from these solutions and then let the vehicles travel along the shortest-time path in SUMO or iTransNet testbed to obtain the actual time consumption. In this way, we can find out the performance of these solutions in real applications.
We firstly evaluate the successful collection ratio of road segments. The collection ratio is defined as

Collection ratio.
The performance of BnF algorithm is also evaluated. The evaluation is only performed with trace data since the results are not obvious on iTranSNet due to the small scale. We compare our heuristic solution with a spatial constrained broadcast solution, which collects traffic information of all roads within an ellipse with navigation starting point and end point as two foci of the ellipse. In the evaluation, we set the length threshold Θ to 3 times of the shortest path length and then increase the speed threshold

Candidate path.
Finally, we evaluate the route planning performance using different route planning algorithms. For route planning evaluation, we are interested in the effectiveness of route planning, that is, how much time can be saved, and the travel overhead, that is, how many extra distances have to be travelled in order to bypass the congested road segments. In the evaluation, we compare MICE with two static route planning algorithms: shortest path Dijkstra algorithm (SP, the weight of edges is
Given the same navigation starting and ending points, different route planning algorithms may generate either the same or different route planning results. Figure 15 shows the similarity and differences of the algorithm output. On iTanSNet testbed, the route planning results for SP, VAN, and MICE solutions are quite similar: about 60% of the total tries have the same result, and only 5% of total tries have completely different results. This is caused by the small scale. For traffic trace based simulation, only 30% of the results are identical. It can also be observed that the similarity between MICE and VAN is higher than the two static ones.

Overview of shortest-time path planning results.
The overall efficiency charts in Figures 16 and 17 reveal that our algorithm works better in dense traffic conditions, compared with spare ones. This conclusion does not contradict the results from Figure 11, because the shortest-time route planning algorithms can bypass dense road segments and select sparse ones automatically. In sparse traffic scenarios, the performance of the three algorithms is quite similar due to traffic congestion, although MICE indeed has some improvement. In contrast, MICE has about 25% time saving compared with Dijkstra algorithm and about 15% time saving compared with VAN. For the trace based simulation case in Figure 18, MICE has about 15% time saving compared with Dijkstra algorithm and about 5% time saving compared with VAN.

Performance results for path planning application in dense traffic using iTranSNet testbed.

Performance results for path planning application in sparse traffic using iTranSNet testbed.

Performance results for path planning application using traffic trace.
The time saved charts in Figures 16 and 17 use the same evaluation results as overall efficiency but use the time cost for the shortest path as baseline for better understanding. An interesting observation can be found in which, in the trace based simulation, sometimes the shortest-time algorithm spends even longer time than shortest path one. The result is consistent with our experiences. However, analyzing the result is out of the scope of this paper.
Finally, the distance increased chart shows the travel overhead to bypass the congested road. It also uses shortest path length as baseline. It is observed that the travel overhead is controlled within 10%. For the trace based simulation in Figure 18, it is interesting to discover that the overhead of MICE is only a little higher than the shortest-time algorithm and about 10% lower than VAN. In these figures, the improvement of MICE is not that obvious. As depicted in Figure 15, about 75% of the path planning results for VAN and MICE are the same. In order to reflect the real performance, we did not remove these duplicated results in the charts.
7. Conclusion
In this paper, we have proposed MICE, a traffic estimation solution by collecting and analyzing surrounding traffic information using VANETs in real-time. The traffic collection solution is infrastructure-free and scalable compared with conventional infrastructure based ones. We have also employed a novel density-speed traffic model to estimate traffic condition using the collected floating car data. Then, we have applied the proposed algorithm to a shortest-time route planning applications to demonstrate the performance of the solution. Extensive evaluations using both testbed based implementation and trace based simulation have been conducted. The results have shown that our solution can outperform some existing ones in terms of network transmission overhead, route planning effectiveness, and traffic overhead.
Footnotes
Summary of Notations
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgment
This research is partially supported by Microsoft Research Asia Accelerating Urban Informatics with Azure Program.
