Abstract
Image segmentation is an important preliminary process required in object tracking applications. This paper addresses the issue of unsupervised multi-colour thresholding design for colour-based multiple objects segmentation. Most of the current unsupervised colour thresholding techniques require adopting a supervised training algorithm or a cluster-number decision algorithm to obtain optimal threshold values of each colour channel for a colour-of-interest. In this paper, a novel unsupervised multi-threshold searching algorithm is proposed to automatically search the optimal threshold values for segmenting multiple colour objects. To achieve this, a novel ratio-map image computation method is proposed to efficiently enhance the contrast between colour and non-colour pixels. The Otsu's method is then applied to the ratio-map image to extract all colour objects from the image. Finally, a new histogram-based multi-threshold searching algorithm is developed to search the optimal upper-bound and lower-bound threshold values of hue, saturation and brightness components for each colour object. Experimental results show that the proposed method not only succeeds in separating all colour objects-of-interest in colour images, but also provides satisfactory colour thresholding results compared with an existing multilevel thresholding method.
Keywords
1. Introduction
Object tracking in video streams is essential to many intelligent vision systems. In such systems, image segmentation plays a necessary preliminary processing role in a variety of image processing applications, especially in object detection and recognition. Accurately segmenting foreground objects from a background image is a significant issue in many image analysis fields. Several sophisticated image segmentation techniques have been reported in the literature, however, nature image segmentation is still a challenging task due to an inherently ambiguous problem [1]. To handle such an inherent complexity, some researchers have tried to extend the traditional bilevel thresholding techniques to a multilevel process since histogram thresholding is one of the most widely used techniques in image segmentation. This is commonly known as multilevel thresholding.
Various multilevel thresholding techniques have been proposed for grey images. They roughly can be divided into parametric and nonparametric approaches [2]. In parametric approaches [3, 4], the probability density function (pdf) of each object's grey-level distribution is required. But, to estimate the pdf of the grey-level distribution is typically a nonlinear optimization problem. This usually leads to an inefficient algorithm with high computational complexity and poor performance. By contrast, nonparametric approaches directly determine the threshold values by optimizing some certain cost functions [5–8]. For instance, Otsu proposed a thresholding method that determines the optimal thresholds by maximizing a between-class variance criterion using an exhaustive search [5]. To speed up the maximization process in the conventional Otsu's method, Liao et al. proposed a fast multilevel global thresholding algorithm that maximizes a modified between-class variance with a look-up table acceleration approach [6]. However, this method still takes too much time for multilevel threshold selection since it requires evaluating all possible solutions. The authors in [7] proposed a new criterion for automatic multilevel thresholding with low computational complexity. Recently, Gao et al. used a quantum-behaved particle swarm optimization technique to improve the convergence rate of the Otsu's method [8]. These nonparametric multilevel thresholding approaches produce satisfactory segmentation results for grey images, however, the papers [5–8] considers only grey scale images (i.e., single channel) and cannot be directly applicable to multi channel images.
As to colour image thresholding, some researchers have employed neural-network techniques to achieve unsupervised colour segmentation. For instance, Wu et al. proposed an automatic self-organizing map (SOM) based multilevel thresholding algorithm for colour segmentation and human hand localization [9]. They developed a SOM transduction algorithm to learn the non-stationary colour distribution in HSI colour space that overcomes the issue of dynamic lighting condition. Haghighatdoost et al. proposed an automatic multilevel thresholding algorithm for colour image quantization [10]. They first used a growing time adaptive SOM network to find the 3-dimensional RGB histogram. Then a peak finding algorithm was employed to find the thresholds automatically. Deshmukh et al. proposed an adaptive colour image segmentation approach based on a multilayer neural-network, which detects multiple objects in an input image by independently segmenting saturation and intensity planes of the image [11]. Although the neural-network-based approaches usually work well in colour image segmentation, they have to undergo a supervised training process to obtain the best segmentation performance. This requirement restricts the applicability of the neural-network-based segmentation approaches.
Some researchers have attempted to use fuzzy set theory to deal with colour image segmentation [12–15]. The authors in [12] proposed a coarse-to-fine segmentation strategy based on the thresholding and fuzzy c-means (FCM) techniques. This method first coarsely segments the image using a thresholding technique. The fine segmentation process then assigns the unclassified pixels to the closest class using the FCM technique. Chaabane et al. also adopted the concept of the coarse-to-fine strategy to segment colour images, but utilized an automatic histogram thresholding method in the coarse segmentation stage [13]. Kurugollu et al. developed a colour image segmentation technique using fuzzy thresholding and Dempster-Shafer's fusion rule [14]. This method first uses a fuzzy thresholding method to fuzzify each colour band into mass functions of three classes (object class, background class and uncertain class). These mass functions are then merged using the Dempster-Shafer's orthogonal sum rule. Finally, the optimal thresholds are taken by comparing the mid point of belief intervals of the object and background classes. Recently, Yu et al. introduced an adaptive unsupervised clustering algorithm based on the FCM technique and ant colony optimization [15]. This method is able to adaptively detect the cluster centroid distribution and centroid number, which are two critical initialization problems in FCM approach. The fuzzy-logic segmentation approaches usually require the knowledge of cluster number, otherwise they will become quite sensitive to the initialization condition. This requirement increases the complexity of image segmentation, leading to an inefficient solution to the unsupervised colour image segmentation problem.
From the literature survey, how to develop a new unsupervised colour image segmentation method efficiently extracting multiple colours-of-interest from foreground pixels in the image remains a challenge. This problem motivates us to develop a computationally efficient unsupervised multilevel thresholding algorithm for colour image segmentation. The proposed algorithm consists of a colour-pixel extraction process and a histogram-based multi-threshold searching process. In the colour-pixel extraction process, a novel ratio-map image computation method is proposed to enhance the contrast between colour and non-colour pixels. This helps leading to an efficient colour-object extraction algorithm using a simple bilevel thresholding method, such as the Otsu's method. On the other hand, the histogram-based multi-threshold searching process automatically searches the optimal upper-bound and lower-bound threshold values of hue, saturation and brightness components for each colour object. Experimental results validate the performance of the proposed method by comparing with an existing multilevel thresholding method using nature colour images.
The rest of this paper is organized as follows. Section 2 defines the problem statement of this work. Section 3 introduces an existing method related to the development of the proposed algorithm. Section 4 presents the proposed multi-threshold searching algorithm in details. Experimental results are carried out in Section 5 to evaluate the performance of the proposed colour image segmentation approach. Section 6 concludes the contributions of this paper.
2. Problem statement
Image segmentation techniques have been widely used in various machine vision applications. To obtain the information of an object-of-interest from images, one of the most important tasks is to accurately extract all foreground objects from the background. Traditionally, a simple but efficient multilevel thresholding method that distinguishes pixels belonging to foreground and background based on colour distribution of a specific object-of-interest is used for this purpose. Let
where

An example of image threshold segmentation: (a) the original colour image and (b) the blue-colour segmentation result using expression (1).
Although the expression (1) provides a simple way to deal with colour image segmentation, the main challenge is how to determine the optimal threshold values of each colour channel for segmenting multiple colour objects. That is, for
where
3. The existing Otsu's image thresholding method
This section briefly describes the Otsu's method [5] that is a well-known image thresholding technique used in grey image segmentation. Suppose that a grey image consists of one or more foreground objects, each having different grey-level values. The purpose of image thresholding is to separate these objects from the background using one or more threshold values obtained from the histogram of the image. More specifically, if the image histogram contains
where
In the case of
where
The optimal threshold values are then obtained by maximizing the criteria (6) instead of (5) so that
The main advantage of using the modified criteria (6) is that the computation of maximization process (7) can be reduced by employing two look-up tables. Let
For index
From the expressions (8)-(10), it follows that
According to (11),
Substituting (12) and (13) into (6) yields
where
and the expression (14) then becomes
Since the value of
4. The proposed algorithm
This section presents the proposed multi-threshold searching algorithm, which consists of a colour-pixel extraction process and a histogram-based multi-threshold searching process. The former aims to distinguish colour and non-colour (i.e., white and black colours) regions of an input colour image, and the latter searches the optimal multiple thresholds for thresholding the colour regions of the image with minimum error.
4.1 Colour-pixel extraction process
Colour is an important cue for image segmentation. To extract the colour information from an image, the existing colour-based segmentation approaches usually require representing the image in the HSV colour space since the hue component of the HSV colour model measures the perceived colours of a visual sensing. Figure 2(a), for example, presents the hue component of the image shown in Fig. 1(a).

Colour information of the image shown in Fig. 1(a): (a) the hue component of the image and (b) the corresponding hue histogram. One can see that the hue histogram cannot efficiently distinguish the colour and non-colour object classes in the image.
The colour and non-colour regions of the image can then possibly be distinguished by finding the optimal multiple thresholds from the corresponding hue histogram. However, this method may lead to unreliable results because the hue histogram generally cannot distinguish the colour and non-colour object classes in the image (Fig. 2(b)). This problem motivates us to develop a new method to efficiently distinguish the colour and non-colour regions of the image.
Suppose that the input colour image is represented in HSV colour space. As inspired by the existing shadow detection methods [17, 18], the proposed colour extraction algorithm first highlights all non-colour pixels on the basis of the saturation and brightness components of the image such that
where
where


It is clear from Fig. 4(a) that the colour and non-colour object classes can be simply distinguished by a global threshold. Hence, the colour regions of the original image can be efficiently extracted by using the conventional Otsu's bilevel thresholding method to threshold the ratio-map image obtained from (17) and (18) (Fig. 4(b)). After extracting the colour regions of the image, one can obtain all colour pixels in the original image (Fig. 5(a)) and compute the corresponding hue histogram of the colour information (Fig. 5(b)). Visually comparing Fig. 2(b) with Fig. 5(b) finds that Fig. 5(b) contains only hue information of the colour pixels, which is benefit for the following multi-threshold searching process to determine the optimal colour thresholds effectively and efficiently.

Hue histogram of the colour regions of the image: (a) colour pixels in the extracted colour regions and (b) the corresponding hue histogram that will be used in the following multi-threshold searching process.
4.2 Histogram-based multi-threshold searching process
Figure 6 shows the flowchart of the proposed histogram-based multi-threshold searching algorithm, which consists of a peak-point detection process and a multi-threshold searching process. Suppose that the hue histogram of the colour regions of the image is obtained from the colour-pixel extraction process described in the previous section. The peak-point detection process aims to detect peak points in the input hue histogram depending on the mean value of the histogram. Since searching the peak points directly on the original hue histogram is very sensitive to image noise, the process first performs a smoothing operation on the input hue histogram to reduce the noise effect during peak-point detection (Fig. 7(a)).

Flowchart of the proposed histogram-based multi-threshold searching algorithm.

Processing steps of the proposed multi-threshold searching algorithm: (a) smoothing the hue histogram, (b) detecting peak points of the hue histogram, (c) setting initial thresholds related to the detected peak points and (d) searching final thresholds via the proposed dynamic threshold searching process.
Let
where
To search the optimal thresholds, a valid assumption is that the peak-point position is located within the range of lower-bound to upper-bound thresholds. Based on this assumption, the multi-threshold searching process can be divided into two unidirectional searching processes, one for searching lower-bound threshold and the other one for upper-bound threshold. Therefore, the first step in the proposed dynamic multi-threshold searching process is setting two initial thresholds around the peak point such that
where the operators max{Ω} and min{Ω} denote respectively the maximum and minimum element of the set Ω. The parameter λ in (20) is a non-zero scale factor satisfying 0 < λ < 1. Here, the default value of λ is set as 0.3 empirically. Figure 7(a) presents an example to explain the initial-threshold detection via expression (20). Note that if the peak-point position
where

Dynamic threshold searching process: flowchart of dynamic searching the (a) lower-bound threshold and (b) upper-bound threshold associated with the peak point
After the dynamic threshold searching process, a potential problem is that the range bounded by the obtained lower-bound and upper-bound threshold values may overlap or repeat each other. Figure 9(a) shows an example to explain this problem. In Fig. 9(a), there are four threshold pairs obtained from the dynamic threshold searching process: (

An example of merging the redundant threshold pairs: (a) the threshold pairs obtained from the dynamic threshold searching process and (b) the merged threshold pairs via the proposed threshold pair merging process.
It is clear from this case that the first and second threshold pairs are redundant since their thresholding ranges are within the range of the third threshold pair. This problem deteriorates the computational performance of the final colour thresholding process due to the redundant thresholding operations. To avoid generating the redundant threshold pairs, a threshold pair merging process is proposed to minimize the number of threshold pairs with the same thresholding ranges (Fig. 9(b)). Figure 10 shows the flowchart of the proposed threshold pair merging process, which aims to detect and reduce all redundancy in the input multiple threshold pairs according to three conditions. The first and second conditions detect the redundant threshold pairs that will be removed from the input threshold pairs. By contrast, the third condition detects the overlapped threshold pairs that will be merged together. Taking Fig. 9(a) as an example, the first two threshold pairs, (

Flowchart of the proposed threshold pair merging process.
In the process of searching saturation and brightness thresholds, we only find the lower-bound thresholds
5. Experimental results
This section contains two parts. The first part presents the experimental results of the proposed method compared with an existing method and the second part presents the extension of the proposed method to skin colour segmentation.
5.1 Performance evaluation of the proposed method
In the first experiments, two test images, as shown in Figs. 11(a) and 12(a), were employed to evaluate the performance of the proposed method.

Experimental results of multilevel thresholding process: (a) the original image, (b) thresholding by a manually supervised method, (c) multilevel thresholding by the fast Otsu's method, (d) the multiple thresholds of hue channel obtained from the fast Otsu's method, (e) multilevel thresholding by the propose method and (f) the multiple thresholds of hue channel obtained from the proposed method.

Experimental results of multilevel thresholding process: (a) the original image, (b) thresholding by a manually supervised method, (c) multilevel thresholding by the fast Otsu's method, (d) the multiple thresholds of hue channel obtained from the fast Otsu's method, (e) multilevel thresholding by the propose method and (f) the multiple thresholds of hue channel obtained from the proposed method.
Parameter settings of the competing and proposed methods.
where
Figures 11(c) and 11(d) present respectively the multilevel thresholding result and the multiple thresholds of hue channel obtained from the fast Otsu's method. It is clear from these two figures that the fast Otsu's method cannot provide satisfactory segmentation results since the multiple threshold values are not able to threshold the two colour classes in the hue histogram properly. This observation also can be verified in Table 2, which records the multiple threshold values obtained by each compared method and the corresponding PSNR measures. From Table 2, one can see that the PSNR metric between Figs. 11(b) and 11(c) is 17.1552 dB, which means a low-similarity between the results produced by the fast Otsu's and manually supervised methods.
Multiple threshold values obtained by each compared method and the corresponding PSNR measures.
On the other hand, Fig. 11(e) shows the multilevel thresholding result obtained from the proposed method. Visually comparing Figs. 11(b) and 11(e) finds that the proposed method produces similar segmentation results to the manually supervised method. Figure 11(f) illustrates the multiple thresholds of hue channel obtained from the proposed method. It is clear from Fig. 11(f) that the multiple threshold values succeed to threshold the two colour classes in the hue histogram, producing a high PSNR of 38.0751 dB as shown in Table 2. These results explain why the proposed method produces a similar result to the manually supervised method. Similar observations can also be found in the case shown in Figs. 12(a)–12(f). Therefore, the above experimental studies validate the performance of the proposed method.
5.2 Extension to skin colour segmentation
The proposed method can be extended to other colour image segmentation applications, such as skin colour segmentation. Figure 13(a) illustrates a skin colour image, which challenges many existing skin colour segmentation methods since it contains a variety of ethnic skin tones. Figure 13(b) shows the colour thresholding result obtained from the proposed method. It can be observed from Fig. 13(b) that the proposed method efficiently extracts most skin pixels in the original image, except some white skin pixels. Table 3 tabulates the multiple threshold values found by the proposed method in this experiment. From Table 3, one can see that the skin colour distribution ranges from 1 to 41 in hue channel. This observation is consistent with the result of a previous work [19] which shows that the skin colour distribution in the Hue-Saturation plane typically ranges from 0 to 50 in hue channel for Asian and Caucasian ethnicities. On the other hand, the reported result of skin colour in saturation channel is characterized by values between 0.23 and 0.68, but our finding is between 0.1843 and 1.0. This is caused by that the test image used here additionally containing the Ethiopian ethnicity, leading to a smaller lower-bound threshold value on saturation. Furthermore, as mentioned in the end of Section 4.2, the upper-bound threshold of saturation is fixed as 1.0 in our design. Figure 13(c) shows the colour thresholding result obtained by the previous work [19]. Visually comparing Figs. 13(b) and 13(c) we can observe that the skin colour thresholding results obtained by the proposed method and in the previous work are very similar to each other. This result validates the performance of the proposed method extended to skin colour segmentation as we expected.
Multiple threshold values obtained by the proposed method and the previous work [19] for skin colour thresholding.

Experimental results of skin colour segmentation: (a) the original image, skin colour thresholding by (b) the propose method and (c) the previous work [19].
6. Conclusions and future work
In this study, a novel histogram-based multi-threshold searching algorithm has been developed to achieve automatic multilevel colour image thresholding for segmenting multiple objects in colour images. The proposed method comprises a new colour-pixel extraction algorithm, which computes a ratio-map image efficiently enhancing the contrast between colour and non-colour pixels. This property helps to extract all foreground colour objects from background image via a conventional single-level image thresholding method. Moreover, a new histogram-based multi-threshold searching algorithm is also proposed to search the optimal upper-bound and lower-bound thresholds of hue, saturation and brightness channels for each colour object. Experimental results validate the performance of the proposed method by comparing with the fast Otsu's multilevel thresholding method, both visually and quantitatively. It is worth noting that the proposed algorithm can be extended to other colour image segmentation applications, such as nature colour image segmentation or skin colour segmentation. This advantage greatly increases the applicability of the proposed method. The extension to multilevel video thresholding is left to our future study.
Footnotes
7. Acknowledgments
This work was supported by the National Science Council of Taiwan, ROC under grant NSC 101–2221-E-032-022.
