Using Traffic Light Signal to Enhance Intersection Foreground Detection Based on Video Sensor Networks

Abstract

Foreground detection plays an important role in the traffic surveillance applications, especially in urban intersections. Background subtraction is an efficient approach to segment the background and foreground with static cameras from video sensor networks. But when modelling the background, most statistical techniques adjust the learning rate only based on the changes from video sequences, which is a crucial parameter controlling the updating speed. This causes a slow adaptation to sudden environmental changes. For example, a stopped car fuses into background before moving again, and it lowers the segmentation performance. This paper proposes an efficient way to address the problem by accounting for the physical world signal in traffic junctions. It assigns an adaptive learning rate to each pixel by integrating traffic light signal obtained from sensor networks. Combined with abundant physical world signals, background subtraction method is able to adapt itself to the outside world changes instantly. We test our approach in real urban traffic intersection; experimental results show that the new method increases the accuracy of detection and has a promising future.

1. Introduction

Intelligent video surveillance, aiming at making traffic more intelligent and decreasing the amount of vehicle accidents, is a well-studied subject area with both existing application systems and new approaches still being developed. Among this area, detecting objects at the intersection is one of the most significant focuses in typical intelligent transportation systems (ITS) applications and the basis of high-level processing. In the most real intersections, a single camera is not enough to monitor the whole scenario. Video sensor network provides a large-scale, redundant, of video streams to observe the intersections [1, 2]. Because some of video-based traffic monitoring systems include high-level description of both cars and their behaviours, continuous tracking result is significant to the high-level processing. Background subtraction is a widely used technique for foreground detection which compares an observed image with an estimated background image that does not contain any objects of interest. But before using this method, several parameters have to be determined. Among these arguments, the learning rate is more critical to the performance. If the rate is large, the slow or stopped vehicles will fuse into background quickly just as Figure 1 describes. But if we set the rate at a small value, the background will not be updated in time. In particular, in a traffic junction scene, vehicles always encounter congestion and stop-and-go when there is a red light. At this time, a reasonable learning rate becomes more significant. Usually, the majorities of current methods adjust learning rate only relying on the changes from video sequences. This causes very often the method to be unable to adapt itself with the outside world changes instantly. And when the traffic light is red, it mainly leads the tracking of vehicles to be interrupted. Once the light turns green, the cars move again and new tracking of them is constructed. In summary, it has a bad effect on the continuous tracking of foreground objects and reduces the accuracy of some high-level understanding methods, such as [3, 4]. Unlike previous methods, this paper focuses on how to adjust learning rate according to the real-time and accurate physical world signals from other sensors. And this technique guarantees the continuous tracking of vehicles in traffic junction.

Figure 1

(a) shows several stopped vehicles during red light. (b) is the corresponding detection results. As time passed, some of the foreground objects disappeared gradually.

In this paper, we select the more common traffic light as the external signal to improve the results of foreground detection. Meanwhile, to divide input images into reasonable regions, road line detection result is used. Because the camera is static, road lines are just detected only once in the whole monitoring period. If the system receives a red light signal, it indicates that vehicles will slow down and stop after some frames. To avoid these interested vehicles blending in background and losing existed tracking, we decrease the learning rates of pixels in the red light region while the rates of other pixels remain unchanged. When the system receives a green light signal, it means that vehicles run through the intersection at normal speed and normal learning rates are selected. Experimental results show that this kind of environment information can greatly improve the results of the background subtraction and foreground detection.

We have done some work in improving the detection results by combining the video sequence and physical world information before, which is published in paper [5]. This paper is an expanded version of [5], which analyzes the method more in detail, sets more contrast experiments, and adds quantitative evaluation of experiments’ results. The remaining of this paper is organised as follows. In Section 2, a compact review of important developments and existing improvements about background subtraction and foreground detection are presented. We propose the proposed framework and outline three classical methods and illustrate how to use our approach to make these methods perform better in Section 3. Further explanations of our method in a practical application scenario are supplied in Section 4. Meanwhile, the contrast experiment results between precious methods and our method are presented. The last section is conclusion and future work.

2. Related Research

There has been numerous works devoted to the development of background subtraction for real-time video processing. Several surveys devoted to this topic can be found in [6, 7]. The statistical tools provide a good framework to model the background of a complex traffic scene and so many methods have been developed.

During these methods, Gaussians mixture model (GMM), first presented in [8], models the distribution of the values observed at each pixel by a weighted mixture of Gaussians. GMM is able to cope with the multimodal nature of many practical situations and leads to good results when there are repetitive background motions, such as leaves shaking or water rippling. By far, GMM is the most researched and applied method. Many enhanced algorithms [9] have been proposed all along these years. Reviews of them can be seen in literature [7, 10]. The weakness of GMM lies in its strong assumption that the background is more frequently visible than the foreground and that its variance is significantly lower. Also, the initialization of the model and estimation of the parameters are problematic and uncertain in different real-world environments. Therefore, traditional GMM-based methods with empirical value usually are not competent to good background subtraction results. To avoid the difficulty of finding an appropriate shape for the probability density function, nonparametric methods using kernel density estimation to model background distributions have been proposed. These methods build a histogram of background values for each pixel, by collecting values sampled from the pixels recent time window [11]. In [12], a Bayesian framework which incorporates multiple types of features for modelling complex backgrounds is proposed (we abbreviate the foreground detection method in [12] as FGD for short) and solves the sudden once-off background change effectively.

For all those background subtraction methods that have been mentioned above, they still have a common problem. That is how to select an adaptive learning rate. Explicitly, background modelling methods with a global empirical learning rate are significantly penalized. Over these years, a lot of research papers discussed the adaptive learning rate and proposed various solutions for tuning the learning rate based on local intensity changes [13, 14], different level feedbacks [15, 16], and so on. Based on GMM, [17] proposed a background subtraction method using a pixel-wise adaptive learning rate for object tracking. Unlike the traditional methods that use the same experiential “learning rate,” it assigns a learning rate to each pixel relying on two parameters; one is depending on the difference of pixel intensities between the background model and the current frame and the other is depending on the duration of the pixel being classified as a background pixel. In [18], the learning rates for the mean and the variance terms are decoupled and independent so as to avoid the saturation phenomenon and degeneracy problem. They use an adaptive learning rate to update the mean and a semiparametric model for the variance. The authors of [19] use the time gap between moving and stopped objects to train the background model and get adaptive parameters for urban traffic video. Considering the slow learning problem of GMM at the beginning phase, Kaewtrakulpong and Bowden improved the update mechanism in learning step and proposed the fast-learning Gaussian mixture model [20]. In [21], the enhanced Gaussian mixture model detects still objects from moving state and adjusts learning rate to improve the performance of detecting moving object detection with intermittent stops. The authors of [22] modulated the learning rate of background model based on scene activity. In [23], an updating method with adaptive learning rate (we abbreviate this method as GMMX for short) is proposed to accurately segment the objects that move slow or stop for a while during moving.

Even though many background subtraction approaches with adaptive learning rates were proposed and indeed improved the naive GMM, as mentioned above, they still have some limits and are not proper for foreground detection at the intersection. Especially, we find that many stopped cars gradually fuse into background and cannot be traced again with the previous methods. Firstly, most of these methods [14, 15, 17, 18, 21–23] perceive sudden change only based on the image information, such as illumination changes and background movements. Therefore, the accuracy of perception is difficult to be guaranteed. Secondly, statistical methods often need a period of time to affirm and learn new changes. During this period, it generates a lot of detecting mistakes. Thus, some instant adjustment mechanisms are significant. Thirdly, image processing for perception needs additional computing, which aggravates burden on the system real-time performance. The problems mentioned above motivate us to propose a new method to perceive environment changes and adjust the learning rate by integrating traffic light signal into video sensor networks.

3. Our Approach

To the best of our knowledge, there have not been any methods that utilize the traffic light signal to enhance the vision-based background subtraction at the intersection. This paper proposes to regard these similar physical world signals as the criteria to adjust background model parameters. There are several parameters in background modelling methods. Learning rate α which controls the updating speed of modelling is a more important parameter. The stopped foreground objects will not fuse into background by adjusting α according traffic light signal. Then, we use GMM [8], GMMX [23], and FGD [12] as experimental subjects and traffic light signals as external information. How these methods are enhanced by our approach is described elaborately in the following text. Note that our method can work with other existing background subtraction approaches applied in intersection scene as well.

3.1. The Proposed Framework

In Figure 2, it depicts the whole proposed framework of using traffic signal to enhance the vision-based background subtraction. Model initialization, the first step, assigns all parameters needed and initials the background model. Then, the system gets a new input image from the video sensor networks and simultaneously receives physical world signals. Next module adjusts learning rate according to the traffic light signals. The following step is background subtraction, which is the same as the previous methods. Then, it outputs the foreground detection results. Meanwhile, the system updates background model and waits for the next input image. The grey modules are newly added in the framework, which distinguish from previous methods.

Figure 2

Framework of the proposed background subtraction.

As wireless sensors become widely available and their costs come down, traffic control systems integrate with wireless sensor networks (WSN) in ITS [24]. Our method is designed to obtain traffic light signals from WSN. For convenience, we set the traffic light signal manually in experimenting. The learning rate is adjusted according to traffic light signal. When it is red, a little constant is selected. And once the light turns green, a normal one is applied. Here, the two values are all determined empirically. The following content specifically describes the improved methods.

3.2. Improved GMM Modelling

In the context of a traffic surveillance system, Friedman and Russell [25] proposed to model each background pixel using a mixture of three Gaussians corresponding to road, vehicle, and shadows. The maintenance is made by using an incremental EM algorithm for real-time consideration. Stauffer and Grimson [8] generalized this idea by modelling the recent history of the colour features of each pixel ${{X_{1}, \dots, X}_{t}}$ by a mixture of K Gaussians.

The intensity in the RGB colour space of each pixel is selected as the feature to classify. The probability of observing the current pixel value is considered given by the following formula in the multidimensional case:

\begin{matrix} P (X_{t}) = \sum_{i = 1}^{K} ω_{i, t} η (X_{t}, μ_{i, t}, Σ_{i, t}), \end{matrix}

(1)

where the parameters are the number of Gaussian K, a weight

ω_{i, t}

associated to the

i th

Gaussian at time t with mean

μ_{i, t}

, and standard deviation

Σ_{i, t}

. η is a Gaussian probability density function

\begin{matrix} η (X_{t}, μ, Σ) = \frac{1}{{(2 π)}^{n / 2} {| Σ |}^{1 / 2}} e^{(- 1 / 2) (X_{t} - μ) Σ^{- 1} (X_{t} - μ)} . \end{matrix}

(2)

For computational reasons, Stauffer and Grimson [8] assumed that the RGB colour components are independent and have the same variances. So, the covariance matrix is of the form

\begin{matrix} Σ_{i, t} = σ_{i, t}^{2} I . \end{matrix}

(3)

The K Gaussians are sorted in descending following the ratio $ω_{j} / σ_{j}$ . The first B Gaussian distributions which exceed certain threshold T are retained for a background distribution

\begin{matrix} B = \arg \min_{b} (\sum_{i = 1}^{b} ω_{i, t} > T) . \end{matrix}

(4)

The others are regarded as foreground distribution. When the new frame comes at time $t + 1$ , a match test is made for each pixel. And a pixel matched a Gaussian distribution if

\begin{matrix} {({(X_{t + 1} - μ_{i, t})}^{T} \sum_{i, t}^{- 1} (X_{t + 1} - μ_{i, t}))}^{1 / 2} < k σ_{i, t} . \end{matrix}

(5)

When a match is found with one of the K Gaussians, for the matched component, the update is done as follows:

\begin{matrix} ω_{i, t + 1} = (1 - α) ω_{i, t} + α, \\ μ_{i, t + 1} = (1 - ρ) μ_{i, t} + ρ X_{t + 1}, \\ σ_{i, t + 1}^{2} = (1 - ρ) σ_{i, t}^{2} + ρ (X_{t + 1} - μ_{i, t + 1}) {(X_{t + 1} - μ_{i, t + 1})}^{T}, \end{matrix}

(6)

where α and ρ are two learning rates. For the unmatched component, only the weight is replaced by

\begin{matrix} ω_{j, t + 1} = (1 - α) ω_{j, t} . \end{matrix}

(7)

When no match is found, the least probable distribution is replaced with a new one with initial parameters.

In our improved method, the learning rate is adaptively tuned in accordance with external physical world events. Once a new input image arrives, the system enquires traffic light to perceive environment changes and does some reasonable adjustments instantly to get the best effect. The learning rate is changed as follows:

\begin{matrix} α = {\begin{cases} α_{red}, & when light is red \\ α_{normal}, & when light is green . \end{cases} \end{matrix}

(8)

When light is green, α is selected as $α_{normal}$ , a constant used by the original methods. Once the light turns red, α is correspondingly adjusted to $α_{red}$ , a small constant depending on the duration time of red light. From Figure 2, we can see that only the steps of receiving signal and adjusting learning rate are added. So this idea, improving models by integrating traffic light signal, can be generalized on many other methods.

3.3. Improved GMMX Modelling

The original GMM has many limitations, such as the number of Gaussians having to be predetermined, the need for good initializations, and the dependence of the results on the true distribution law which can be non-Gaussian and slow recovery from failures. To alleviate these disadvantages, numerous improvements have been proposed over the recent years. In this paper, we choose a method as the comparative experiment, which is abbreviated as GMMX [23] and has an outstanding performance on detecting temporarily stopped objects with adaptive learning rate.

The main contribution of paper [23] is a model number adaptive method to decrease the amount of computation and an updating method with adaptive learning rate to accurately segment the objects that move slow or stop for a while during moving (here we are only interested in the second method). The authors think the fixed learning rate causes the problem that moving objects stopping for a short time will rapidly be updated to the background model by the GMM. Thus, different learning rates should be assigned to different distributions. When a new match is found at time t, the learning rate $α_{t}$ is changed as follows:

\begin{matrix} α_{t} = \max (α_{0}, ω_{M} \cdot α), \end{matrix}

(9)

where

ω_{M}

is the weight of the matched distribution. The value of α should be higher than

α_{o}

, both of whom are constants. The reason for evaluating

α_{t}

with the maximum of

α_{0}

and

ω_{M} \cdot α

, rather than

ω_{M} \cdot α

, is that, when

ω_{M}

has a very low value,

α_{t}

will almost be zero if being evaluated with

ω_{M} \cdot α

. It will cause an object staying for a long time to be difficult to be updated into the background model.

Traffic light signal is still able to be combined with GMMX by changing $α_{t}$ as follows:

\begin{matrix} α_{t} = {\begin{cases} \min (α_{red}, \max (α_{0}, ω_{M} \cdot α)), & when light is red \\ \max (α_{0}, ω_{M} \cdot α), & when light is green, \end{cases} \end{matrix}

(10)

where

α_{red}

is a little constant to prevent the objects from fusing into the background in accordance with external signals. Because the red light may last for dozens of seconds, the value of

α_{t}

should not be beyond

α_{red}

. Otherwise, the method of GMMX also results in the disappearing of stopped objects, especially at a later stage when the matched component has a large

ω_{M}

. Thus, external signals are also needed to let the output be more accurate.

3.4. Improved FGD Modelling

Li et al. proposed to classify background and foreground pixels under the Bayes decision theory [12]. Let $V_{t}$ be a discrete value feature vector extracted from an image sequence at the pixel $s = (x, y)$ and time instant t. According to the Bayes rule, a posterior probability of $V_{t}$ from the background b or foreground f is

\begin{matrix} P (C | v_{t}, s) = \frac{P (v_{t} | C, s) P (C | s)}{P (v_{t} | s)}, C = b or f . \end{matrix}

(11)

Using the Bayes decision rule, a pixel s is classified as background according to its feature vector $v_{t, s}$ observed at time t if

\begin{matrix} P (b | v_{t, s}) > P (f | v_{t, s}) . \end{matrix}

(12)

Note that the feature vectors associated with the pixel s are either from background or from foreground objects, and it follows that

\begin{matrix} P (v_{t} | s) = P (v_{t} | b, s) \cdot P (b | s) + P (v_{t} | f, s) \cdot P (f | s) . \end{matrix}

(13)

Substituting (11) and (13) into (12), it becomes

\begin{matrix} 2 P (v_{t} | b, s) \cdot P (b | s) > P (v_{t} | s) . \end{matrix}

(14)

In this method, the colours of a pixel are chosen as the feature for stationary background, while the colour cooccurrences of interframe changes from the pixel are chosen as the feature for moving background. And a table of statistics for the possible principal features is established for each feature type at s, which is denoted as

\begin{matrix} S_{v_{t}}^{s, t, i} = {\begin{cases} p_{v}^{t, i} = P (v_{t}^{i} | s) \\ p_{v, b}^{t, i} = P (v_{t}^{i} | b, s) \\ v_{t}^{i} = {[a_{1}^{i}, \dots, a_{n}^{i}]}^{T} . \end{cases} \end{matrix}

(15)

For each feature vector $v_{t}$ that is used to classify a pixel as foreground or background, the statistics of the corresponding features (colour or colour cooccurrence) is updated by

\begin{matrix} p_{b}^{s, t + 1} = (1 - α_{2}) p_{b}^{s, t} + α_{2} M_{b}^{s, t}, \\ p_{v}^{s, t + 1, i} = (1 - α_{2}) p_{v}^{s, t, i} + α_{2} M_{v}^{s, t, i}, \\ p_{v b}^{s, t + 1, i} = (1 - α_{2}) p_{v b}^{s, t, i} + α_{2} (M_{b}^{s, t} \land M_{v}^{s, t, i}), \end{matrix}

(16)

where

α_{2}

is the learning rate which controls the speed of feature learning.

M_{b}^{s, t} = 1

when s is labelled as the background at time t; otherwise,

M_{b}^{s, t} = 0

M_{v}^{s, t, i} = 1

when

v_{t}^{i}

S_{v_{t}}^{s, t, i}

in (15) matches

v_{t}

best and

M_{v}^{s, t, i} = 0

for the remainders. A reference background image that represents the most recent appearance of the background is maintained at each time to make the background difference accurate. If s is detected as a point of insignificant change in change detection, the reference background image is updated as

\begin{matrix} B (s, t + 1) = (1 - α_{1}) B (s, t) + α_{1} I (s, t) . \end{matrix}

(17)

Also, traffic light signal is added in FGD just as GMM in Figure 2. The external signals are mainly used to adjust feature learning rate $α_{2}$

\begin{matrix} α_{2} = {\begin{cases} α_{red}, & when light is red \\ α_{normal}, & when light is green . \end{cases} \end{matrix}

(18)

When the light is red, a little constant is set to avoid the stopped cars vanishing quickly. Once the signal changes, $α_{2}$ is always set as a suitable value. This method not only ensures the robustness of foreground detection but also eliminates the missing of slow-moving or stopped vehicles in the results.

4. Experiment Results

In order to make the experiment more convincing, we record the video at a real traffic junction, where there is a traffic light. Then, we run the programs of GMM, GMMX, FGD, and their enhanced versions, collect, and analyse the executive outcomes.

4.1. Data and Qualitative Results

The test video, which consists of 2752 frames of $640 * 480$ pixels and is acquired at a frequency of 25 fps (frames per seconds), is taken from a busy intersection with traffic light. From the original images, we can see that strong shadows casted by moving vehicles can be observed in the entire sequence, but removing shadow is beyond our paper. We do not perform removing shadow in all of the experiments.

The detailed selection of important parameters is as follows. In GMM, we choose $K = 5$ , $T = 0.6$ , and $k = 2.5$ . The learning rate is set to 0.005 and σ is initialized with 30. Initial weight associated to each Gaussian is 0.05. To make the experiment more comparable, GMMX has the same values of parameters with GMM. In their enhanced versions, when light is red, the learning rate is changed to 0.0001. In FGD, there are 64 and 32 bins in the joint histograms for colour and colour cooccurrence vectors, respectively. To make the computation and storage efficient, we set $N_{1} = 15$ and $N_{2} = 30$ for colour features and $N_{1} = 25$ and $N_{2} = 40$ for colour cooccurrence features. The background updating rate $α_{2}$ is set to 0.01. In accordance with the original work, we initialize the prior and conditional probabilities as $p_{b}^{s, 0} = 0$ , $p_{v}^{s, 0, i} = 0$ , and $p_{v, b}^{s, 0, i} = 0$ for $i = 1, \dots, N_{2}$ and $v_{t} = {c_{t}, {c c}_{t}}$ . In the same way, improved FGD uses the same values of parameters with FGD. Only when light is red, $α_{2}$ is set to 0.0001.

In Figure 3, the first row is four original frames selected randomly from the test video sequence, which are frames 2122, 2256, 2381, and 2417, respectively. These pictures show us five cars that slow down and stop successively. Then, the images below are the detection results of three initial approaches and their enhanced versions. We can see that the enhanced methods are successful to keep the stopped vehicles in the detection results, while the outputs of previous methods show that the cars in front of every image have fused into background. Our approach obviously improves the effect of detection.

Figure 3

(1)–(4) are four original frames selected randomly from the test video when the light is red. These images show us five cars that slow down and stop successively. The other images are the foreground detection results of three original methods and corresponding improved methods. (a)–(d) are the results of GMM, while (e)–(h) are the results of improved GMM. (i)–(l) are the results of GMMX, while (m)–(p) are the results of improved GMMX. (q)–(t) are the results of FGD, and the images of the last row are the results of improved FGD.

4.2. Quantitative Evaluation

In the evaluation, we choose 12 frames from the segment of the test video, during which the light is red and lasts about 25 seconds. In other words, a frame is selected in every 2 seconds. We use these images to analyse the performance of our method.

Three terms are used in the quantitative evaluation: false positive (FP) is the number of background pixels that are wrongly marked as foreground; false negative (FN) is the number of foreground pixels that are wrongly marked as background; total error (TE) is the sum of FP and FN. We calculate these terms for each image according to the corresponding hand-segmented ground truth. FN, FP, and TE of every approach are the sum of four frames of FN, FP, and TE.

Figure 4 illustrates overall performance on the selected twelve frames for the three previous methods and their improvements. The total error of the improved versions is less than the previous ones. In particular, the new approach reduces FN vastly, which means that the stopped cars would not disappear before the light turns into green. So, the tracking of foreground objects will not be interrupted, which supplies a solid foundation for many high-level images processing. But it is a pity that FP increases a little and it is our next problem to be solved.

Figure 4

Overall performance of the selected twelve frames for the three previous methods and their improvements.

In Figure 5, FN of different methods is listed according to the frame number. The horizontal axis is the frame number, while the vertical axis is the FN value (the number of foreground pixels that are wrongly marked as background). And different colours represent different methods. From Figure 5, we can see that FN of previous methods increases rapidly, which means that the foreground objects disappear from the detection results, while FN of the improved ones stays in a low quantity. This phenomenon is more obvious in latter period of red light.

Figure 5

The horizontal axis is the frame number, while the vertical axis is the FN value (the number of foreground pixels that are wrongly marked as background). Different colour represents different method.

In summary, we conclude that our method can effectively improve the accuracy and reliability of foreground segmentation. Meanwhile, various background modelling methods are able to benefit from physical world signals.

5. Conclusions

We present a novel method that utilises traffic light signal to enhance the performance of background subtraction, while existing methods use only image information to model and update the reference background. Then, this paper records elaborately the experimental process and results. By contrast with FN, FP, and TE of the previous methods, such as GMM, GMMX, and FGD, the responding enhanced versions obviously have a better performance. It demonstrates that background subtraction methods based on traffic light signal may have a bright future. Considering that different pixels have different characteristics of changes in colour, it is better to set different and more reasonable learning rates based on these signals for each pixel rather than some constants for all pixels, which is our future research focus. Meanwhile, in order to combine the background modelling methods with physical world signals more closely, we will think over more relations between model parameters and these signals to get a better effect.

Footnotes

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported by the fund of the State Key Laboratory of Software Development Environment (Grant no. SKLSDE-2012ZX-01) and the Fundamental Research Funds for the Central Universities.

References

Trivedi

M. M.

Mikic

Kogut

Distributed video networks for incident detection and management

Proceedings of the IEEE Intelligent Transportation Systems

October 2000

155 160

2-s2.0-0033681369

Trivedi

M. M.

Gandhi

T. L.

Huang

K. S.

Distributed interactive video arrays for event capture and enhanced situational awareness

IEEE Intelligent Systems 2005 20 5 58 65

2-s2.0-27344452926

Bragatto

T. A. C.

Ruas

G. I. S.

Benso

V. A. P.

Lamar

M. V.

Aldigueri

Teixeira

G. L.

Yamashita

A new approach to multiple vehicle tracking in intersections using harris corners and adaptive background subtraction

Proceedings of the IEEE Intelligent Vehicles Symposium (IV ′08)

June 2008

548 553

2-s2.0-57749193751

10.1109/IVS.2008.4621293

Zhang

Chen

S. C.

Shyu

M. L.

Adaptive background learning for vehicle detection and spatio-temporal tracking

Proceedings of the 4th Pacific Rim Conference on Multimedia Information, Communications and Signal Processing

2003

797 801

Ding

Liu

Cui

Wang

Intersection foreground detection based on the cyber-physical systems

Proceedings of the IET International Conference on Information Science and Control Engineering

2012

Shenzhen, China

1881 1886

Hedayati

Zaki

W. M. D. W.

Hussain

A qualitative and quantitative comparison of real-time background subtraction algorithms for video surveillance applications

Journal of Computational Information Systems 2012 8 2 493 505

2-s2.0-84859061885

Bouwmans

El Baf

Vachon

Background modeling using mixture of Gaussians for foreground detection-a survey

Recent Patents on Computer Science 2008 1 3 219 237

10.2174/2213275910801030219

Stauffer

Grimson

W. E. L.

Adaptive background mixture models for real-time tracking

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '99)

June 1999

246 252

2-s2.0-0032634283

Hou

Y.-J. X.

Gong

S.-R.

Adaptive shadows detection algorithm based on Gaussian Mixture Model

Proceedings of the International Symposium on Information Science and Engineering (ISISE ′08)

December 2008

116 120

2-s2.0-62449097021

10.1109/ISISE.2008.249

10.

Bouwmans

Baf

Statistical background modeling for foreground detection: a survey

Handbook of Pattern Recognition and Computer 2010 4 181 199

11.

Tavakkoli

Nicolescu

Bebis

Nicolescu

Non-parametric statistical background modeling for efficient foreground region detection

Machine Vision and Applications 2009 20 6 395 409

2-s2.0-69949105468

10.1007/s00138-008-0134-2

12.

Huang

I. Y.-H.

Tian

Statistical modeling of complex backgrounds for foreground object detection

IEEE Transactions on Image Processing 2004 13 11 1459 1472

2-s2.0-7444243389

10.1109/TIP.2004.836169

13.

Y.-H.

Tian

H.-F.

Zhang

An improved Gaussian mixture background model with real-time adjustment of learning rate

Proceedings of the International Conference on Information, Networking and Automation (ICINA ′10)

October 2010

V1512 V1515

2-s2.0-78650508683

10.1109/ICINA.2010.5636758

14.

Shah

Deng

Woodford

B. J.

Localized adaptive learning of Mixture of Gaussians models for background extraction

Proceedings of the 25th International Conference of Image and Vision Computing New Zealand (IVCNZ ′10)

November 2010

2-s2.0-84858964092

10.1109/IVCNZ.2010.6148870

15.

Pnevmatikakis

Polymenakos

2D person tracking using Kalman filtering and adaptive background learning in a feedback loop

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2007 4122 151 160

2-s2.0-38049155563

16.

Lin

H.-H.

Chuang

J.-H.

Liu

T.-L.

Regularized background adaptation: a novel learning rate control scheme for gaussian mixture modeling

IEEE Transactions on Image Processing 2011 20 3 822 836

2-s2.0-79951818849

10.1109/TIP.2010.2075938

17.

K. K.

Delp

E. J.

Background subtraction using a pixel-wise adaptive learning rate for object tracking initialization

Proceedings of the Visual Information Processing and Communication II

January 2011

2-s2.0-79951657404

10.1117/12.872610

18.

Bouttefroy

P. L. M.

Bouzerdoum

Phung

S. L.

Beghdadi

On the analysis of background subtraction techniques using Gaussian mixture models

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ′10)

March 2010

4042 4045

2-s2.0-78049372362

10.1109/ICASSP.2010.5495760

19.

Hwang

P.-S.

Eom

K.-Y.

Jung

J.-Y.

Kim

M.-H.

A statistical approach to robust background subtraction for urban traffic video

Proceedings of the 2nd International Workshop on Computer Science and Engineering (WCSE ′09)

October 2009

177 181

2-s2.0-77950230240

10.1109/WCSE.2009.790

20.

KaewTraKulPong

Bowden

An improved adaptive background mixture model for real-time tracking with shadow detection

Video-Based Surveillance Systems 2002 135 144

10.1007/978-1-4615-0913-4_11

21.

Ming

EGMM: an enhanced Gaussian mixture model for detecting moving objects with intermittent stops

Proceedings of the 12th IEEE International Conference on Multimedia and Expo (ICME ′11)

July 2011

2-s2.0-80155171710

10.1109/ICME.2011.6012011

22.

Harville

Gordon

Woodfill

Adaptive video background modeling using color and depth

Proceedings of the IEEE International Conference on Image Processing (ICIP ′01)

October 2001

90 93

2-s2.0-0035159336

23.

Suo

Wang

An improved adaptive Background modeling algorithm Based on Gaussian Mixture model

Proceedings of the 9th International Conference on Signal Processing (ICSP ′08)

October 2008

1436 1439

2-s2.0-67249148953

10.1109/ICOSP.2008.4697402

24.

Tubaishat

Peng

Wireless sensor networks in intelligent transportation systems

Wireless Communications and Mobile Computing 2009 9 3 287 302

2-s2.0-63049085459

10.1002/wcm.616

25.

Friedman

Russell

Image segmentation in video sequences: a probabilistic approach

Proceedings of the 13th conference on Uncertainty in artificial intelligence (UAI ′97)

1997

San Francisco, Calif, USA

175 181