Estimation Vehicular Waiting Time at Traffic Build-Up Queues

Abstract

Due to the high growth of social economic activities and the increased need for mobility in recent days, transportation problems like congestion, accidents, and pollution have been increased. However, improving the reliability of delay estimates and real-time dissemination of information remains a challenge. An advanced border-crossing system corresponding to the changes of cross-border circumstances becomes an urgent matter. An automated system for queue end monitoring has been proposed using image processing based transformed domain and empirical mode decomposition (EMD) feature extraction systems. The performance of feedforward backpropagation algorithm artificial neural networks (ANNs) was evaluated and tested, based on a selected set of features. The experimental results showed that the use of discrete wavelet transform (DWT) based Daubechies with decomposition of level 2 has accomplished the target with a processing time 2 sec and 3 epochs of training network only with best validation performance of (2.1053e-007) for vehicle recognition. Also the use of EMD as a feature extractor has accomplished the target of vehicle recognition with a best validation performance of (about 3.42e-09) and a processing time of 1 sec at epoch 3 of training network only with a minimal percentage of error for the recognition of each vehicle in the appropriate queue with the aid of the new concept of road side unit (RSU).

1. Introduction

Intelligent transportation systems (ITS) offer potential solutions to growing congestion problems in major urban areas because of increasing mobility demands which has directly adverse effects on level of service, transportation costs, commerce, tourism, and the environment. Intelligent transportation system involves the use of IT and technology such as image processing and artificial neural networks for solving transportation problems.

In this paper, we propose a novel ITS that aims at estimating the waiting times of vehicles stuck at “traffic buildups.” These buildups happen, for example, at traffic signals, border crossings, and “work zones” on highways and arterials. The estimation of this “waiting time” enables ITS applications and enables taking actions to mitigate issues related to environment, international trade, and safety. For example, reducing queue length at traffic signals cuts down significantly ${CO}_{2}$ emissions inside cities as it means car engines would spend lesser time idling. The characterization of waiting time at these queues helps adjust the timing of traffic signals and is a basic component of adaptive traffic signal control.

We propose a cost-effective infrastructure-based system that detects accurately and dynamically vehicular waiting times at such queues. The system requires the deployment cost-effective roadside units (RSUs). RSUs are used as vehicular sensors. We calculate the vehicular waiting time through coordinating the RSUs operations and exchanging information. Each RSU has a consumer-grade equipment of a camera and a wireless communications module (e.g., Wi-Fi module). The system relies on processing individual images (not videos) taken by the cameras of RSU. The RSU uses artificial neural networks (ANNs) to identify vehicle and RSU wireless communications (e.g., Wi-Fi module) to exchange information with other RSUs to enable the calculation of vehicular waiting times.

The rest of this paper is organized as follow. Section 2 is literature review; Section 3 is the appropriate proposed system with respect to its composition and mechanism; Section 4 introduces feature extraction using transformed domain algorithms; Section 5 introduces the empirical mode decomposition algorithm; Section 6 introduces the backpropagation feedforward artificial neural network for the recognition; Section 7 introduces the basic idea for calculation the waiting time; Section 8 experimental results; and Section 9 is conclusion and future work.

2. Literature Review

Previously proposed systems involve many technologies such as video image processing (VIP), wireless sensor networks (WSNs) as a road sensors. For examples advanced warning system (AWS), which was designed as an automated system to improve tunnel safety, reduces the potential for both primary and secondary collisions, as well as reducing incident response times producing an incident management system [1, 2]. The system has some key objectives: (i) provide means of automated real-time advance notice to motorists entering the tunnel of queues or lane blockages that may be beyond their sight depending on video frames, (ii) provide means of automatic dissemination of information on overall system events preferably via automatic email, and (iii) provide means of remote Ministry LAN access to monitor and manually override when required [3]. The main advantage in our proposed system is the use of small number of images in the recognition process instead of videos; also, we focused on the recognition with a small number of salient features. For our proposed system, state-of-the-art deployments rely on an imaging camera to collect traffic flow information for measuring queue length and estimating delay based on that. For example, a camera uses an ultrasonic vehicle detector that detects vehicle presence by the time difference of the reflection of ultrasonic wave fired from above the road surface to just under it. The number of passed vehicles in a unit time is used to calculate queue length and delay. Another example is an ITV camera that can be installed along the side of the roads or at light posts at road intersections. The images taken by this camera are processed and features are extracted to calculate queue length. The camera is installed to cover a specific road area (of around 150 m of road length). More expensive cameras will be able to cover larger areas. However, a single camera generally has a poor visibility [4]. Our proposed system has the following advantages: (i) a highly efficient and real-time system is achieved through in-network processing of real-time individual images as opposed to videos which affects directly it is processing time and communication speed; (ii) cost effective as RSUs are built using consumer-grade technologies and open source tools and (iii) it is highly reliable as there is no single point of failure (i.e., a centralized server) in the system. Data processing is performed at many RSUs. The failure of an RSU does not affect the operation of the system significantly.

3. The Proposed System

Figure 1 shows the main components of a deployment of the proposed system and practical aspects of its deployments. There are 3 main stages: (i) data acquisition system and it is responsible for capturing vehicle images of the queue of interest using camera module which connected to each RSU unit; (ii) feature extraction stage and it is responsible for extracting the most salient features in each image using transformed domain features such as discrete Fourier transform (DFT), discrete cosine transform (DCT), and discrete wavelet transform (DWT) based Daubechies mother function; and (iii) recognition using ANNs feed forward backpropagation algorithm.

Figure 1

Main components of deployment.

The system is composed of RSU deployed on the side of roads in areas around border crossing, checkpoints, or highway work zones. RSUs can be programmed to operate as data sensors. Each RSU is configured to run the OpenWRT (http://www.openwrt.org/) Linux distribution for embedded devices. OpenWRT is a Linux distribution that provides a Software Development Kit (SDK) that is used to compile custom code into a package to be installed on different RSUs.

For the purposes of detecting waiting time, we interface cameras to RSU through USB ports. We extended the camera software required to drive hardware modules. With this setup, each RSU, and its attached camera, is controlled to take snapshots of vehicles on the road. RSUs process the images taken, as described below, to detect vehicles through a feature extraction module. RSUs communicate amongst themselves using Wi-Fi to forward data related to traffic queues and vehicle waiting times. This augments each RSU with spatial and contextual characteristics of surrounding environments as explained below. This wireless infrastructure enables routing of information in a multihop manner.

An RSU is a stationary access points of a wireless mesh network typically deployed in a stationary manner. RSU exchanges data packets over, possibly, mobile multi-hop. We use optimized link state routing (OLSR) as a proactive routing protocol that maintains an up-to-date routing table. APs exchange OLSR HELLO messages periodically to build and maintain this table. This dynamic method of building the table enables APs to self-configure themselves to establish a WMN. HELLO messages advertise the one-hop interfaces of each AP. The periodic exchange of HELLO messages also enables the WMN to recover from a failed link or node. The system architecture does not require all RSUs to be connected to the Internet.

In general, a special type of RSU, called gateway, allows integration with other network types (e.g., Internet). The gateway receives/forwards the information using TCP/IP on the Internet where packets are rerouted to reach the Server. RSU and gateways self-configure themselves to identify their roles.

For example in Figure 2, the three vehicles, red, followed by a green vehicle, followed by blue vehicle, constitute a platoon. RSU₁ through its installed camera captures image of the vehicular platoon which is to build an ANNs. Information of the resulting structure of trained ANN is forwarded to RSU₂. RSU₂ takes images for vehicles. Each image is processed by the trained ANN until the platoon of vehicles is detected and recognized. The process is repeated for RSUs down the road. Knowing the distance between RSUs and assuming time synchronization between the RSUs, the travelling speed, and waiting times of vehicles can be estimated and calibrated.

Figure 2

System deployment.

An important design parameter is the use of an NN-expiration timer. The trained NN is considered useless after the expiry of this timer. In the previous example scenario, RSU₂ (or the RSU downstream) cannot continue looking for the platoon forever.

This can be due to cars in the platoon changing their positions (e.g., blue vehicle followed by red and then green). This may cause the NN not to recognize the platoon. The expiry timer resets the algorithms and gets the RSU to restart image capturing, feature extraction, and platoon recognition.

This does not affect the overall ability of the system to estimate waiting times as this ability depends on averaging the waiting time for many images taken over a specific period of time. If one or two failures occur, then this should not affect the overall average. In addition, our objective is to estimate waiting times in terms of minutes and accuracy (on the scale of seconds) is of not much relevance to our objectives.

Default case is that RSU₁ sends a trained NNs to RSU₂ to start recognition process but in case that RSU₂ has started a failure situation due to any technical reasons (S/W or H/W failure), then RSU₁ automatically resent NNs to the most prior RSU module (the RSU module in the direction of queue). In our case, if RSU₂ has failed, then RSU₁ resent trained NNs to RSU₃ not RSU₀ as shown in Figure 3.

Figure 3

RSU unit failure scenario.

4. Image Processing and Feature Extraction Using Transformed Domain (Frequency and Wavelet Domain)

In the proposed system, transformed domain based feature extractor with 3 categories has been manipulated: (i) discrete wavelet transform (DWT), (ii) discrete cosine transform (DCT), and (iii) fast Fourier transform (FFT) as a good representation for the Discrete Fourier Transform (DFT). For FFT, DCT, and DWT, we have the following mathematical assumptions.

(i)

Each vehicle image is a 2D matrix which can be expressed in spatial domain as a matrix of size $x \times y$ and in transformed domain as a matrix of size $u \times v$ .

(ii)

x or $u = 0, 1, 2, \dots, M - 1$ and $y or v = 0, 1, 2, \dots, N - 1$ .

(iii)

[ $M, N$ ] is the size of each image.

A DFT decomposes a sequence of values into components of different frequencies. This operation is useful in many fields but computing it directly from the definition is often too slow to be practical. Fast Fourier transform (FFT) is a way to compute the same result more quickly, so FFT is an efficient algorithm to compute DFT [5]. 2-D FFT for each vehicle image is calculated through the following equation:

\begin{matrix} F (u, v) = \sum_{x = 0}^{M - 1} \sum_{y = 0}^{N - 1} f (x, y) e^{- i 2 π} (\frac{x u}{M} + \frac{y v}{N}) . \end{matrix}

(1)

A DCT expresses a sequence of finitely data points in terms of a sum of cosine functions oscillating at different frequencies. The DCT, and in particular the DCT-II, is often used in image processing [6]. The DCT is given according to the formula of the following equation:

\begin{array}{l} C (u, v) = α (u) α (v) \sum_{x = 0}^{M - 1} \sum_{y = 0}^{N - 1} f (x, y) \cos [\frac{π (2 x + 1) u}{2 M}] \\ \times \cos [\frac{π (2 y + 1) v}{2 N}] . \end{array}

(2)

DWT has gained widespread acceptance in image compression. The most commonly used wavelets were formulated by the Belgian mathematician Ingrid Daubechies in 1988. This formulation is based on the use of recurrence relations to generate progressively finer discrete samplings of an implicit mother wavelet function [7].

The DWT of each vehicle image is calculated by passing it through a series of filters, a low pass filter and a high pass filter. The outputs are then divided into (i) detailed coefficients (d) from the high-pass filter, and (ii) approximation coefficients (a), from the low-pass one. It should be noted that the two filters are related to each other and they are known as a quadrature mirror filter. However, half of the frequencies of the image have now been removed and half of the samples can be discarded according to Nyquist rule [8].

There are three different detailed coefficients; these are vertical (V), horizontal (H), and diagonal (D) coefficients. The obtained detailed coefficients are passed through 3 different thresholds, in order to reduce the number of coefficients as well as remove nonsignificant value. The 2-D DWT of order (j) based detailed coefficients can be estimated through the following equation:

\begin{matrix} W_{φ}^{i} (j, u, v) = \frac{1}{\sqrt{M N}} \sum_{x = 0}^{M - 1} \sum_{y = 0}^{N - 1} f (x, y) φ_{j, u, v}^{i} (x, y), \end{matrix}

(3)

where i is the detailed coefficients and expressed by ${horizontal (H), vertical (V), diagonal (D)}$ . $φ_{j, u, v}^{i} (x, y) = 2^{j / 2} φ^{i} (2^{j} x - u, 2^{j} y - v)$ , and it is the wavelet function.

The 2-D DWT of order (j) based approximate coefficients can be estimated through the following equation:

\begin{matrix} W_{ϕ}^{i} (j_{0}, u, v) = \frac{1}{\sqrt{M N}} \sum_{x = 0}^{M - 1} \sum_{y = 0}^{N - 1} f (x, y) ϕ_{j_{0}, u, v} (x, y), \end{matrix}

(4)

where $i = {H, V, D}, ϕ (x, y) = 2^{j / 2} ϕ (2^{j} x - u, 2^{j} y - v)$ and it is the scaling function.

5. Image Processing and Feature Extraction Using Empirical Mode Decomposition (EMD)

The key part of the empirical mode decomposition method with which any complicated dataset can be decomposed into a finite and often small number of intrinsic mode functions (IMFs) that admit well-behaved Hilbert transforms. This decomposition method is adaptive and, therefore, highly efficient. Since the decomposition is based on the local characteristic time scale of the vehicles data, it is applicable to nonlinear and nonstationary processes. With the Hilbert transform, the intrinsic mode functions (IMFs) yield instantaneous frequencies as functions of the spatial domain parameters $(u, v)$ that give sharp identifications of imbedded structures. The final presentation of the results is an energy frequency spatial domain parameters distribution, designated as the Hilbert spectrum [9].

There are many reasonable reasons for using EMD in analyzing vehicle images for the training and testing processes. These reasons may be summarized as follows.

(i)

It does not assume a prior basis function for the decomposition and thus it is fully adaptive.

(ii)

It can separate non-stationary oscillations.

(iii)

It does not require spurious harmonics to represent nonlinear data.

(iv)

It can give a meaningful instantaneous frequency representation.

EMD is a method to decompose data based on its IMFs instead of a set of predefined basis functions. EMD was proposed by Huang et al. in 1998 [9]. It can extract adaptively the oscillatory modes at each time from a complex signal, namely, it can decompose the signal into a finite (often less) number of IMFs. In addition, EMD is especially suited for analyzing nonlinear and nonstationary data sequence [10, 11]. Any IMF must satisfy two conditions, (i) in the whole dataset, the number of extrema and the number of zero-crossings must be equal or may differ at most by one. (ii) At any point, the mean value of the envelope defined by the local maxima and the envelope defined by the local minima is zero. The Hilbert Huang transform (HHT) consists of two processes: performing EMD of the signal and calculating the Hilbert spectrum of the resulting IMFs. From these spectra, a time-frequency representation of the IMFs can be determined. The signal $x (t)$ can be expressed as

\begin{matrix} x (t) = Re \sum_{i = 1}^{n} a_{i} (t) e^{j \int w_{i} (t) d t}, \end{matrix}

(5)

where $w_{i} (t) = d θ i (t) / d t$ is the instantaneous frequency, $a_{i} (t)$ is the instantaneous amplitude, and $θ i (t)$ is the instantaneous phase. However, the notion of the instantaneous frequency has been highly controversial. An intrinsic mode function (IMF) is a function that satisfies two conditions.

(i)

In the whole data set, the number of extrema and the number of zero crossings must either be equal or differ at most by one.

(ii)

At any point, the mean value of the envelope defined by the local maxima and the envelope defined by the local minima is zero [12].

The following basic steps describe the complete algorithm for reaching the sifting process to extract the (IMFs) in each captured image after transforming time domain analysis to spatial domain 2D analysis, and this could be considered as a new contribution in feature extraction [9, 13].

Step 1.

Calculate the upper and lower envelopes of the vehicle image $x (u, v)$ and their mean value $m_{1} (u, v)$ , where

\begin{matrix} x (u, v) = Re \sum_{i_{2} = 1}^{n 1} \sum_{i_{2} = 1}^{n 2} a_{i} (u, v) b_{i} (u, v) e^{j \int \int w_{i_{1}} (u, v) w_{i_{2}} d u d v} \end{matrix}

(6)

$w_{i_{1}} (u, v) = d 2 θ i_{1} (u, v) / d u d v$ and $w_{i_{2}} (u, v) = d 2 θ i_{2} (u, v) / d u d v$ is the instantaneous frequency, $a_{i} (u, v)$ and $b_{i} (u, v)$ are the instantaneous amplitude, and $θ i_{1} (u, v)$ and $θ i_{2} (u, v)$ are the instantaneous phase.

Step 2.

Calculate $h_{1} (u, v) = x (u, v) - m (u, v)$ .

Step 3.

Check if $h_{1} (u, v)$ satisfies the IMF properties.

Step 4.

If not, use $h_{2} (u, v) = h_{1} (u, v) - m_{2} (u, v)$ to obtain new h, where $m_{2} (u, v)$ is found from $h_{1} (u, v)$ as in Step 1.

Step 5.

Continue until an $h_{k} (u, v)$ satisfies the IMF properties. When done, $c_{1} (u, v) = h_{k} (u, v)$ is the first IMF.

Step 6.

Considering the $r (u, v) = x (u, v) - c_{1} (u, v)$ as the new signal, continue from Step 1 to get the higher IMFs, up to $c_{n} (u, v)$ .

6. Recognition Using ANNS

Backpropagation feedforward algorithm has been used for the recognition process in each case of feature extraction. The backpropagation algorithm trains a given feedforward multilayer neural network for a given set of input vehicle images with known classifications. When each entry of the sample set is presented to the network, the network examines its output response to the sample input pattern. The output response is then compared to the known and desired output and the error value is calculated. Based on the error, the connection weights are adjusted. The back propagation algorithm is based on Widrow-Hoff delta learning rule in which the weight adjustment is done through mean square error of the output response to the sample input [14]. The set of these sample patterns are repeatedly presented to the network until the error value is minimized. Algorithm 1 and Table 1 show the basic steps in both feature extraction and recognition processes in both transformed domain and EMD algorithm.

Table 1

Basic feature extraction and recognition steps in all transformed domain case studies.

Case study (1) Transformed domain based FFT

Feature extraction

(i) Read, format (as a double), and convert (as a grey scale) vehicle images per queue.

(ii) Calculate FFT coefficients for each image.

(iii) Extract absolute value of FFT coefficients in matrix format.

(iv) Calculate mean values of extracted coefficients in the previous step.

(v) Rearrange mean value coefficients as a column matrix.

(vi) Concatenate the coefficients of the previous step of all vehicle images together in a matrix form.

Training NNs

(i) Specify input value from the concatenated matrix.

(ii) Specify target value with predefined indices.

(iii) Set the number of Neural Network hidden layers.

(iv) Start network

net = newff (input values, target values,

[]

,{“logsig”,“hardlim”});

(v) Set properties of neural network as follows:

(1) Number of repeats
net.trainparam.epochs = 100,

(2) the accuracy needed
net.trainparam.goal = 0.001

(3) train ratio 70%
net.divideParam.trainRatio = 70/100,

(4) Validation ration 15%
net.divideParam.valRatio = 15/100,

(5) Test ratio 15%
net.divideParam.testRatio = 15/100,

(vi) Learning the network using the previous properties
[net] = train (net, input values, target values)

(vii) Saving trained network for testing phase.

Testing NNs

(i) Read, format (as a double), and convert (as a grey) vehicle image of interest.

(ii) Find FFT coefficients for that vehicle.

(iii) Extract absolute value of FFT coefficients in matrix format.

(iv) Find mean values of extracted coefficients in the previous step.

(v) Rearrange mean value coefficients as a column matrix.

(vi) Apply testing: out = sim(net, mean values as a column matrix).

(vii) Switching for recognition decision

desired = round value of (out);

switch between (desired)

case (i)

msgbox(“match and this is the car with index I”)

error = i-out;

save error1.mat

otherwise

msgbox(“mismatch”)

end

Case study (2) Transformed domain based DCT

Repeat all the FFT phases with the following modifications.

(i) In feature extraction phase,calculate DCT instead of FFT with the use of all sinusoidal coefficients (no absolute values).

(ii) In Training NNs phase, no changes as we just use the extracted DCT coefficients.

(iii) In Testing NNs phase, we just use DCT instead of FFT with the use of all coefficients.

Case study (3) Transformed domain based DWT OF Daubechies with level 1 (haar) and level 2 decomposition

Feature extraction

(i) Read, format (as a double), and convert (as a grey scale) vehicle images per queue.

(ii) Extract horizontal, vertical, diagonal, and approximate wavelet coefficients of level 1 and level 2 decomposition.

(iii) Concatenate horizontal, vertical, and diagonal coefficients.

(iv) Find the transpose of the concatenated matrix.

(v) Calculate mean values of extracted coefficients in the previous step.

(vi) Rearrange mean value coefficients as a column matrix.

(vii) Concatenate the coefficients of the previous step of all vehicle images together in a matrix.

(i) In Training NNs phase, no changes as we just use the extracted DCT coefficients.

(ii) In Testing NNs phase, we just use wavelet coefficients instead of FFT or DCT with the use horizontal, vertical, and diagonal

extracted coefficients from vehicle image of interest.

Algorithm 1: IMFs extraction based EMD.

IMFs based EMD

(i)

Read, format (as a double), and convert (as a grey scale) vehicle images per queue.

(ii)

Apply EMD technique for each image

(1)

Calculate both envelope and mean for each image

(2)

Extract the mean

(3)

Check if it satisfies the two conditions of the IMFs

(4)

If not, check again using new envelope and new mean

(5)

Continue until the two conditions is satisfied

(6)

We have now the first IMF, repeat again to get the higher IMF and this one will be the last IMF

(iii)

Rearrange IMFs as a column matrix

(iv)

concatenate the coefficients of the previous step of all vehicle images together in a matrix form

7. Estimating Vehicular Waiting Time

7.1. Time Calculation through Camera Coordination

The following steps describe how vehicular waiting time is calculated.

(1)

RSU₁ through its installed camera captures image scenes for vehicles then uses the images to start training and building an ANN.

(2)

RSU₁ registers the time $(T_{RSU 1})$ and location when vehicle crosses RSU module as the exact location of the RSU is fixed and known a piriori, and the relative location of the vehicle with respect to the RSU module can be calculated.

(3)

In case of failure of an RSU unit, a resent scenario will begin the process again as shown in Figure 3 according to the direction of queue as we discussed before.

(4)

A feed-forward back propagation neural network has been utilized and built with a differentiable transfer function which uses from 2 to 3 scenes for each vehicle.

Each NN uses transformed domain feature extractor to extract most salient features for the training process.

(5)

The resulting trained ANN structure and timing information is forwarded to RSU₂ via the RSU Wi-Fi interface and through its locally maintained routing tables via OLSR. We assume that RSUs are synced in time.

(6)

RSU₂, through its Camera, captures image scene for vehicle. The trained ANN is used to recognize vehicles.

The images taken are used to fine-tune the training of the ANN. The RSU registers the time of vehicle crossing by RSU.

The fine-tuned ANN and registered time are forwarded to RSU₃ via the RSU Wi-Fi interface.

(7)

As the vehicle is identified, the RSU registers the time of vehicle crossing by RSU₂ by the relation (waiting time = $T_{RSU 2} - T_{RSU}$ ).

The fine-tuned ANN and registered time are forwarded to RSU₃ via the RSU Wi-Fi interface. Algorithm 1 and Table 1 show how the RSU sends the NNs in case of activation.

(8)

The process is repeated for RSUs as shown in Algorithm 1 and Table 1. Simple timing calculations can provide information on waiting times for a vehicle as well as their average speeds. Averaging timing and speed information on a number of vehicles result in increasing the accuracy of this information.

(9)

Finally, accurate vehicular waiting time information is forwarded via the gateway and the Internet to Traffic Management Center headquarters.

7.2. Coordination Algorithm with Respect to Vehicle Speed

As shown in Figure 4, the following assumptions with some standards will be assumed:

(1)

ANNs processing time = t (and it is a variable depending on the algorithm of interest),

(2)

predefined threshold expiry time = $t_{0}$ (constant),

(3)

distance between any two $RSUs = d$ (37–48) m and height of lighting column is (4-5) m (infrastructure constant),

(4)

vehicle velocity = v,

(5)

time taken from vehicle at RSU₁ to reach ${RSU}_{2} = t_{1}$ depending on relation

\begin{matrix} t_{1} = \frac{d}{v} . \end{matrix}

(7)

Figure 4

Proposed infrastructure.

8. Experimental Results

We ran simulation experiments to study the performance of the proposed ANN based DWT, DCT, FFT, and EMD system. The objective of the experiments is measuring four performance indices which could provide a good view for the hardware implementation. These are (i) ANNs performance, (ii) regression performance, (iii) vehicle recognition error (target value-output value), and (iv) processing time.

In our simulations, we used the following parameters for all tested ANNs using the three feature extractors:

(i) number of hidden neurons = 10, (ii) train ratio = 70%, (iii) validation ratio = 15%, and (iv) test ratio = 15%.

The performance of all tested ANNs was evaluated through Tables 3 and 4 which introduce a comparison study between all transformed domain extractors with respect to (i) number of inputs, (ii) number of NNs iterations (epochs), (iii) processing time, and (iv) best validation performance and recognition error.

For the FFT based feature extractor, the best validation performance has been achieved at epoch (4) and regression is approximately equal to (1) with processing time equal to (8 sec).

For the DCT based feature extractor, the best validation performance has been achieved at epoch (2) and regression is approximately equal to (1) with processing time equal (7 sec).

The DWT based feature extractor has been used with different levels of decomposition as follow.

(i)

Daubechies level one (db1) which called Haar family (db1): we found that the best validation performance has been achieved at epoch (3) and regression is deviated than (1) with processing time equal to (3 sec) and this deviation causes some sort of vehicle recognition error. For this level of decomposition, we found the recognition error of car 3 and it is recognized to be car 2. by calculating this error mathematically, we found that it was about (target – output = 1.2957). We believe that this error is increased in such family due to low level of decomposition which reflects less salient wavelet coefficients.

(ii)

Daubechies level two (db2): we found that the best validation performance has been achieved at epoch (3) and regression is approximately equal to (1) with small processing time equal (2 sec). We believe this high performance is achieved due to high level of decomposition which reflects most salient wavelet coefficients.

For the EMD based feature extractor, the best validation performance has been achieved at epoch three and regression is approximately equal one with a processing time equal to (1 sec). This new algorithm of feature extraction was very powerful and more efficient technique for the hardware implementation because of sifting algorithm which provides only five IMFs with a minimal recognition percentage of error and we can easily discriminate between cars of the same colors one following after another.

Figures 5, 6, 7, 8, and 9 show mean squared error achieved versus the number of epochs for the training, testing, and validating samples. The best validation performance achieved in FFT, DCT, DWT based db1, DWT db2 and EMD is $2.19 e - 007$ , $1.4657 e - 005$ , 1.1192, $2.1053 e - 007$ , and $3.42 e - 09$ .

Figure 5

Network performance for the FFT based.

Figure 6

Network performance for the DCT based.

Figure 7

Network performance for the DWT of family db1 (haar) based.

Figure 8

Network performance for the DWT of family db2 based.

Figure 9

Network performance based EMD.

These results reflect that the EMD and DWT based db2 is the highest performance algorithm for all tested cases of transformed domain extractors and DWT based db1 (Haar) is the worst-case performance.

Figures 10, 11, and 12 show the regression $(R)$ which represent an approach to modeling the relationship between target value $(T)$ of (train, validation, and test) samples and appropriate output $(Y)$ . The use of EMD and DWT db2 satisfying best fit of date which reflects that the sum of the squares of the distances between the line and the data points are minimal so $(R = 1)$ . However, DWT db1 satisfying worst fit of data because of the deviation about $Y = T$ curve fit.

Figure 10

Regression DWT based db1 (Haar) family (worst-case solution).

Figure 11

Regression DWT based db2 family (best case solution).

Figure 12

Network regression based EMD.

All cases of the regression as represented by linear straight line equation $(Y = T)$ reflect the higher accuracy and minimal error of extracted features using EMD and DWT db2. The results illustrate how EMD and DWT db2 achieve its target results within acceptable processing time and number of salient features needed.

We can summarize the results of the algorithms with respect to the processing time of the NNs as in Table 2.

Table 2

Results summary.

Algorithm	FFT	DCT	DWT		EMD
Algorithm	FFT	DCT	db1	db2	EMD
ANN processing time (t)	8 sec	7 sec	3 sec	2 sec	1 sec

Table 3

Transformed domain network specifications.

	EMD	FFT	DCT	DWT
	EMD	FFT	DCT	db1 (haar)	db2
Number of inputs	5	194	194	97	98
Number of iterations	3	4	2	3	3
Processing time	1 sec	8 sec	7 sec	3 sec	2 sec
Best validation performance	$3.42 e - 09$	$2.19 e - 007$	$1.4657 e - 005$	1.1192 (high error)	$2.1053 e - 007$

Table 4

Percentage of recognition error per vehicle.

Car index	ANNs based EMD	ANNs based FFT	ANNs based DCT	ANNs based DWT
Car index	ANNs based EMD	ANNs based FFT	ANNs based DCT	db1 (haar)	db2
Car 1	$3.4372 e - 05$	$6.8246 e - 004$	0.0031	$2.3385 e - 004$	$7.8221 e - 004$
Car 2	$2.0880 e - 05$	$5.6914 e - 004$	0.0047	$7.7325 e - 005$	$1.1399 e - 004$
Car 3	$1.08935 e - 05$	$4.7098 e - 004$	$8.9165 e - 004$	1.2957 (error)	$8.2116 e - 005$
Car 4	$4.8734 e - 05$	0.0020	0.0048	$9.6645 e - 005$	$2.8829 e - 005$
Car 5	$8.5992 e - 05$	$3.3354 e - 004$	0.0045	0.0073	$2.0427 e - 004$
Car 6	$2.22527 e - 05$	0.0059	0.0049	$7.5130 e - 005$	0.0016

We have 3 basic assumptions that describe the relationship between the applied algorithm and the vehicle speed.

(a)

Assume that small level of crowding (vehicle speed is 50 km/hr) is

\begin{matrix} t_{1} = \frac{d}{v} = \frac{43}{(50000 / 3600)} = 3.1 sec. \end{matrix}

(8)

It means that FFT and DCT will introduce a mismatch in the recognition of the vehicles $(t_{1} \leq t_{0})$ .

(b)

Assume that high level of crowding (vehicle speed approaches to zero km/hr) is

\begin{matrix} t_{1} = \frac{d}{v} = \infty . \end{matrix}

(9)

It means that a vehicle takes much more than expiry threshold time $(t_{1} ≫ t_{0})$ so RSU₂ sends message to RSU₁ to resend a new vehicle image.

(c)

Assume that no level of crowding (vehicle speed is ≥120 km/hr) is

\begin{matrix} t_{1} = \frac{d}{v} = \frac{43}{(120000 / 3600)} = 1.3 sec. \end{matrix}

(10)

It means that EMD is the only algorithm that could be suitable for the vehicle recognition.

We can summarize these assumptions as follows.

Case 1.

\begin{matrix} if t < t_{1} < t_{0} \end{matrix}

(11)

(i)

vehicle speed around $50 km / hr$

recommendation: use EMD or DWT based db2,

(ii)

vehicle speed around $120 km / hr$

recommendation: EMD is only recommended.

Case 2.

One has

\begin{matrix} If t_{1} > t_{0} \end{matrix}

(12)

(occurs when vehicle speed approaches to zero).

Then, resend NNs with another vehicle for waiting time calculation.

The two cases should satisfy the condition

\begin{matrix} t_{1} = t_{0} - t . \end{matrix}

(13)

9. Conclusion and Future Work

In this paper, we proposed a novel system for the prediction of vehicular waiting times at different traffic locations and conditions including traffic signals, border crossings, and work zones. We described the main components of the system and many real-life deployment concepts of the system. We also described image-processing algorithm used to determine such waiting times. We used Wi-Fi based wireless communication to coordinate and share information between different parts of the system.

We used artificial neural networks to recognize vehicles and transformed domain based algorithms to extract features of images. Performance results indicate that the use of Empirical Mode Decomposition as a feature extraction provides our system with the most salient features producing a minimal error of vehicle recognition within acceptable processing time. This results in providing advanced automatic waiting time prediction system depending on images not video frames which decreases processing time needed and complexity of the neural networks and communication overhead between system modules.

In the future work of interest, we will produce new experiments using other feature extractors such as (invariant moments-independent component analysis (ICA) and other recognition methodology such as 2-D correlation, fuzzy logic, and genetic Algorithms.

Footnotes

Acknowledgment

This research has been supported by the center of Research Excellence in Hajj and Omrah (HajjCORE), Umm Al-Qura University, Makkah, Saudi Arabia under project number P1129, entitled (UQU-SENSE:A Crowd-sourced Data Management Platform for intelligent Transportation systems).

References

Browne

Fox

Smith

Highway 404 HOV tunnel advance warning system

Proceedings of the Annual Conference of the Transportation Association of Canada

2008

Toronto, Ontario, Canada

Technical Paper TAC, 2008 AGM

Wong

Lee

Tai

Implementation of the Queue-end Warning System (QWS) along the approaches to the Canada-U.S. border

Proceedings of the ITS World Congress

2003

Madrid, Spain

Khan

A. M.

Prediction and display of delay at road border crossings

The Open Transportation Journal 2010 4 9 22

Higashikubo

Hinenoya

Takeuchi

Traffic queue length measurement using an image processing sensor

Project thesis of Sumitomo Electric Industries, Shimaya, Osaka, Japan with the collaboration of the Universal Traffic Management Society of Japan

Heideman

M. T.

Johnson

D. H.

Burrus

C. S.

Gauss and the history of the fast Fourier transform

Archive for History of Exact Sciences 1985 34 3 265 277

2-s2.0-0000293699

10.1007/BF00348431

Cho

N. I.

Lee

S. U.

Fast algorithm and implementation of 2-D discrete cosine transform

IEEE Transactions on Circuits and Systems 1991 38 3 297 305

2-s2.0-0026130969

10.1109/31.101322

Arivazhagan

Ganesan

Automatic target detection using wavelet transform

Eurasip Journal on Applied Signal Processing 2004 2004 17 2663 2674

2-s2.0-14944348008

10.1155/S1110865704408208

Torrence

Compo

G. P.

A practical guide to wavelet analysis

Bulletin of the American Meteorological Society 1998 79 1 61 78

2-s2.0-1542680533

Huang

N. E.

Shen

Long

S. R.

M. C.

Snin

H. H.

Zheng

Yen

N.-C.

Tung

C. C.

Liu

H. H.

The empirical mode decomposition and the Hubert spectrum for nonlinear and non-stationary time series analysis

Proceedings of the Royal Society A 1998 454 1971 903 995

2-s2.0-5444236478

10.

Karoud

Sabri

M. A.

Andaloussi

S. J.

Tairi

Aarab

Block image analysis using empirical mode decomposition

WSEAS Transactions on Computers 2006 5 12 2903 2910

2-s2.0-33749078550

11.

Yang

A new two-dimensional empirical mode decomposition based on classical empirical mode decomposition and radon transform

Proceedings of the International Multi Conference of Engineers and Computer Scientists (IMECS '09)

March 2009

Hong Kong

12.

Hyvrainen

Karhunen

Oja

Independent Component Analysis

2001, http://www.cis.hut.fi/projects/ica/book/

13.

Hyvarinen

Oja

Independent Component Analysis

A Tutorial, 1999, http://www.cis.hut.fi/aapo/papers/IJCNN99_tutorialweb/

14.

Wilamowski

B. M.

Neural Network Architectures and Learning

Auburn University, Auburn, Ala, USA, ICIT-Mribor-Slovenia, 2003