Professional Documents
Culture Documents
www.elsevier.com/locate/patrec
a
Graduate School of Engineering, Hiroshima University, 1-4-1 Kagamiyama, Higashi Hiroshima 739-8527, Japan
b
Division of Mathematical and Information Sciences, Faculty of Integrated Arts and Sciences, Hiroshima University,
1-7-1 Kagamiyama, Higashi Hiroshima 739-8521, Japan
Communicated by G. Borgefors
Abstract
This paper proposes a new method of image thresholding by using cluster organization from the histogram of an image. A new sim-
ilarity measure proposed is based on inter-class variance of the clusters to be merged and the intra-class variance of the new merged
cluster. Experiments on practical images illustrate the effectiveness of the new method.
© 2006 Elsevier B.V. All rights reserved.
0167-8655/$ - see front matter © 2006 Elsevier B.V. All rights reserved.
doi:10.1016/j.patrec.2006.02.022
1516 A.Z. Arifin, A. Asano / Pattern Recognition Letters 27 (2006) 1515–1521
separated. The similarity function used all pixels in two are discussed in Section 3, and finally, conclusions are pre-
clusters as denoted by their coordinates. sented in Section 4.
Criterion-based thresholding algorithms are simple and
effective for two-level thresholding. However, if a multi- 2. Histogram thresholding
level thresholding is needed, the computational complexity
will exponentially increase and the performance may If the histogram of an image includes some peaks, we
become unreliable (Chang and Wang, 1997). can separate it into a number of modes. Each mode is
This paper proposes a novel and more effective method expected to correspond to a region, and there exists a
of image thresholding by taking hierarchical cluster organi- threshold at the valley between any two adjacent modes
zation into account. The proposed method attempts to (Cheng et al., 2002). We propose a novel algorithm for
develop a dendrogram of gray levels in the histogram of estimating the optimal threshold using cluster analysis.
an image, based on the similarity measure which involves Initially, we regard that every non-empty gray level is a
the inter-class variance of the clusters to be merged and separate mode contained in a cluster. Then the similarities
the intra-class variance of the new merged cluster. The bot- between adjacent clusters are computed, and the most sim-
tom–up generation of clusters employing a dendrogram by ilar pair is merged. The estimated threshold for the usual
the proposed method yields a good separation of the clus- two-level thresholding is obtained by iterating this opera-
ters and obtains a robust estimate of the threshold. Such tion until two groups of gray levels are obtained.
cluster organization will yield a clear separation between The hierarchical tree of this unification process is best
object and background even for the cases of nearly unimo- viewed graphically as a dendrogram. Let us consider an
dal or multimodal histogram. Since the proposed method example gray scale image, which contains 11 · 11 pixels
performs an iterative merging operation, the extension into and consists of 43 non-empty gray levels ranging from 98
multi-level thresholding problem is a straightforward task to 198 with the histogram shown in Fig. 1(a). Our method
by just terminating the grouping when the expected num- generates the dendrogram shown in Fig. 1(b). The numbers
ber of clusters of pixel values are obtained. along the horizontal axis of dendrogram represent the indi-
The concept of developing a dendrogram from a histo- ces of the gray levels numbered from 1 to 43. The height of
gram was already employed in our previous work pre- the links between clusters represents the order of merging
sented in a conference paper (Arifin and Asano, 2004). operations. The estimated thresholds for the t-level thres-
This paper improves its merging criteria by involving holding are obtained by separating the dendrogram into t
inter-class variance and intra-class variance in the similar- groups by cutting the branch. It indicates that the multi-
ity measurement, so as to maximize the distance of cluster level thresholding is achieved quite straightforwardly by
means, as well as the variance of the new merged cluster. our method. For the usual two-level thresholding, i.e.,
This paper also provides comparisons of the quality of the usual binarization, the threshold is obtained by cutting
binarizations among the proposed method, Otsu’s method, the dendrogram at the highest branch.
and Kwon’s method, and shows the superiority of our
method to the other methods, based on three kinds of 2.1. Cluster merging strategy
quantitative comparison. The method has been developed
not only from the theoretical interest, but also a practical Let Ck be the kth cluster of gray levels in the ascending
applicability has been already presented in (Arifin et al., order, and Tk be the highest gray level in the cluster Ck.
2005). Consequently, the cluster Ck contains gray levels in the
The rest of this paper is organized as follows: The pro- range [Tk—1 + 1,Tk], if we define T0 = —1.
posed algorithm is developed in Section 2 its experimental The proposed algorithm of cluster merging is summa-
results and quantitative comparison with other methods rized as follows:
Fig. 1. (a) Histogram of the sample image and (b) the obtained dendrogram.
A.Z. Arifin, A. Asano / Pattern Recognition Letters 27 (2006) 1515–1521 1517
Fig. 2. Original images: (a) image 1, (b) image 2, (c) image 3, (d) image 4 and (e) image 5.
Fig. 3. Histogram of the experimental images: (a) image 1, (b) image 2, (c) image 3, (d) image 4 and (e) image 5.
Fig. 4. Thresholded image obtained by the proposed method: (a) image 1, (b) image 2, (c) image 3, (d) image 4 and (e) image 5.
Fig. 5. Thresholded image obtained by Otsu’s method: (a) image 1, (b) image 2, (c) image 3, (d) image 4 and (e) image 5.
A.Z. Arifin, A. Asano / Pattern Recognition Letters 27 (2006) 1515–1521 1519
Fig. 6. Thresholded image obtained by KI’s method: (a) image 1, (b) image 2, (c) image 3, (d) image 4 and (e) image 5.
Fig. 7. Thresholded image obtained by Kwon’s method: (a) image 1, (b) image 2, (c) image 3, (d) image 4 and (e) image 5.
Table 1 the method against the two other methods, using the mis-
Threshold values determined using three threshold selection algorithms classification error (ME), (Sezgin and Sankur, 2004), rela-
Sample images Threshold selection methods tive foreground area error (RAE) (Zhang, 1996; Sezgin
Otsu’s Kwon’s KI’s The proposed and Sankur, 2004), and modified Hausdorff distance
method method method method (MHD) (Dubuisson and Jain, 1994; Sezgin and Sankur,
Image 1 77 30 44 59 2004).
Image 2 125 96 132 105 ME is defined in terms of the correlation of the images
Image 3 122 174 92 161 with human observation. It corresponds to the ratio of
Image 4 110 146 56 121
Image 5 96 127 0 102
background pixels wrongly assigned to foreground, and
vice versa. ME can be simply expressed as
B \ B T jþ jF O \ F Tj
j O
ME ¼ 1 — ; ð7Þ
jB O jþ jF Oj
pixels are misclassified into background in Figs. 5–7 than
that in Fig. 4. The border line around the image 5 is clas- where background and foreground are denoted by BO and
sified as the only object in Fig. 6, due to extreme difference FO for the original image, and by BT and FT for the test
of frequency from the other peaks in the histogram. image, respectively. RAE measures the number of discrep-
In order to compare the quality of the thresholded ancy of thresholded image with respect to the reference
images, we quantitatively evaluated the performance of image. It is defined as
Fig. 8. The ground-truth of the original images: (a) image 1, (b) image 2, (c) image 3, (d) image 4 and (e) image 5.
1520 A.Z. Arifin, A. Asano / Pattern Recognition Letters 27 (2006) 1515–1521
Fig. 9. Test images obtained by adding noise to image 2: (a) noised image with SNR of 42.7 dB, (b) histogram of (a), (c) thresholded image of (a),
(d) noised image with SNR of 4.5 dB, (e) histogram of (d) and (f) thresholded image of (d).
A.Z. Arifin, A. Asano / Pattern Recognition Letters 27 (2006) 1515–1521 1521