Professional Documents
Culture Documents
ABSTRACT
This paper presents an image retrieval engine based on advanced clustering strategy which improves performance
of image retrieval in term of precision, recall, accuracy and retrieval time. This paper also highlights an overview of
recently developed image clustering techniques and their real time applications such as Clustering based image binarization
in palm leaf manuscripts, Blood Vessel Extraction for retinal images, synthetic aperture radar images, tumor detection in
MRI images etc. Clustering is usually used in image processing, data mining, image retrieval, pattern recognition, image
segmentation etc. Clustering provides an effective analysis and proper validation of image for retrieval. The proposed
system is employed advanced clustering strategy which utilized N-cut clustering, hierarchical and fuzzy k-means clustering
techniques. Cluster based image retrieval system search similar images based on clusters that calculated by color, texture
and shape and compare similarity with query image clustered feature. Clustered based image retrieval is fast and accurate.
Here, K mean clustering form clusters of similar image features and then N-cut arrange these clusters to improve retrieval
speed. Hierarchical clustering is applied on retrieved set of images in relevance feedback to improve accuracy. Proposed
content based image retrieval system based on novel clustering strategy significantly improves performance of image
retrieval. Experimental result reveals that there is outstanding improvement in accuracy, retrieval time, precision and recall
of the image retrieval system.
KEYWORDS: Clustering Strategy, Hierarchical Clustering, Image Retrieval Engine, K-Mean Clustering, N-Cut
Clustering, Precision
INTRODUCTION
In general, cluster analysis is an analysis for grouping a set of objects in such a way that objects in the same group.
Such group is called a cluster. It is soul of data mining and statistical data analysis. it is integral part of many fields such as
machine learning, pattern recognition, image analysis, information retrieval, and bioinformatics. Cluster analysis is a
general task which carried out by one or more algorithms in combination. Some popular clustering algorithms are
connectivity based clustering or hierarchical clustering, centroid-based clustering, distribution-based clustering, and
density-based clustering. all clustering techniques are implemented in many real life applications such as biology,
computational biology and bioinformatics, plant and animal ecology, sequence analysis, high- throughput genotyping
platforms, human genetic clustering, medicine, medical imaging, IMRT segmentation, social network analysis, search
result grouping etc.
www.tjprc.org
editor@tjprc.org
36
37
.
Figure 1: Proposed Clustering Strategy
www.tjprc.org
editor@tjprc.org
38
chosen. Fuzzy k-mean clustering extended to fuzzy partitioning of data points. As mention earlier, the proposed contentbased image retrieval system utilised color, texture and shape content of image. The module is related to K-Means
processing and final image grouping for retrieval. The resultant extracted features are placed in k clusters by K-Means
methodology and then all k clusters are arranged by n-cut method. Average linkage method computes the distance between
the new cluster and remaining clusters. Objects are extracted from segments and clustered by using N-cut. Similar objects
are grouped in same cluster. Object clustering increase the speed of image retrieval system. Each object has six features,
which are the average features of all the member pixels. K-mean processing takes k parameters and partitions a set of n
objects into k clusters hence resulting intra-cluster similarity is high. An object is assigned to the most similar cluster.
This object assignment is based on the distance between the object and the cluster center closest to it from K cluster.
It calculates the new-fangled centroid and in this way each center finds the centroid of its own points. N-cut method
presents globally optimized clustering which allows a low-dimensional image clustering with significant reduction of the
complexity.
When user fired query in the form of image, then, the query image goes through the same image segmentation
algorithm to obtain objects. These objects are compared to the cluster centers in the database and similarity is calculated
using L2 distance. The objects in the database that has a minimum distance will be returned to perform global image
distance computation between query image and database image. The objects in the database that has a minimum distance
will be returned to perform global image distance computation between query image and database image.
Relevance feedback based on hierarchical clustering strategy is employed to improve system performance by
filtering retrieved images. Then, images are hierarchically grouped by their similarity levels. Hierarchical clustering return
outputs in the form of hierarchy, a structure that is more informative than the unstructured set of clusters. This results in
random groups of images which are further organised by n-cut. Proposed method uses a hierarchical cluster representation
of the relevant and irrelevant images for given query image. This strategy applies on first initial set of the retrieved images.
In this process, primarily the image groups that are most similar are clustered together. Then the images in the groups are
sorted in a descending order based on their similarity values and find out most similar to the query image.
39
EXPERIMENTAL RESULT
Empirical result shows image retrieval performance improves in term of retrieval time, precision and accuracy.
We performed testing of proposed image retrieval system by using standard databases. The retrieval system take query
image as input and gives set of images as output. For initial testing, we use four images, white rose flower, using color,
texture and shape feature for retrieval. It also employed fuzzy k-means clustering and N-cut algorithm for images retrieval.
For analysis performance of image retrieval system with proposed clustering strategy, we set up a series of test on the
clustering performance.
In Figure 3, the query image is a white rose flower and feature vector composed of color, shape and texture.
Proposed clustering strategy for the images retrieval can remove some images that are visually irrelevant to the query
image for improving accuracy of retrieval result.
Figure 3: Proposed System Input as Query Image and Output as Set of Retrieved Image
Eperimental result show performance evaluation in term of precision, recall and accuracy which given by
equation (1), (2), and (3) respectively.
(1)
(2)
(3)
www.tjprc.org
editor@tjprc.org
40
89.45
89.05
79.23
85.63
82.30
86.56
89.60
88.05
89.55
90.55
90
Table 1 shows performance evaluation in term of three parameter such as precision, recall and accuracy. Fourth
parameter, retrieval time is reduced by using proposed clustering strategy which demonstrated by experimental assessment
on the content based image retrieval system with set of four test query images. At first, we perform image retrieval without
proposed clustering strategy and with clustering strategy. Empirical result clearly shows that retrieval time is reduced
significantly with proposed hybrid clustering approach as shown in figure 4. Figure 5 and figure 6 shows screen shot for
clustering asigment and clustering result for proposed clustering strategy.
Figure 4: Time Required for Image Retrieval with and without Proposed Clustering Strategy
41
CONCLUSIONS
Content based image retrieval system based on novel clustering strategy composed of k-mean clustering,
hierarchical clustering and N-cut clustering which improve performance of image retrieval.
Experimental evaluation carried on standard image data set. Experimental result reveals that there is outstanding
improvement speed, accuracy, precision and recall of the image retrieval system.
Proposed advanced clustering techniques plays crucial role in improvement of searching and retrieval of images in
many real world applications such as medical applications and space research based applications.
REFERENCES
1.
Yanfei Zhong, Ailong Ma, Liangpei Zhang. (2014). An Adaptive Memetic Fuzzy Clustering Algorithm with
Spatial Information for Remote Sensing Imagery. Selected Topics in Applied Earth Observations and Remote
Sensing, (pp. 1235 1248). IEEE Journal.
2.
Ahmed N, Jalil A. (2014). Image clustering using local discriminant model and two-dimensional PCA features.
Applied Sciences and Technology, pp.145 149, IEEE International Conference.
3.
4.
Ribbens A, Hermans J, Maes F, Vandermeulen D, Suetens P. (2014). Unsupervised segmentation, Clustering, and
Group wise Registration of Heterogeneous Populations of Brain MR Images. Medical Imaging, (pp. 201 224).
IEEE Transaction.
5.
Yang, Sheng Chih, Huang, Shih Yen, Liu, Xi Ru. (2014). An Image Clustering Technology for Preserving the
Consistent Color of Contour. Computer, Consumer and Control, (pp.107 110). IEEE International Symposium.
6.
De-An Huang, Li-Wei Kang, Wang, Y.-C. F, Chia-Wen Lin.(2014). Self-Learning Based Image
Decomposition with Applications to Single Image Denoising. Multimedia, (pp. 83 93). IEEE Transaction.
7.
Yaoguo Zheng, Xiangrong Zhang, Biao Hou, Ganchao Liu. (2014). Using Combined Difference Image and
www.tjprc.org
editor@tjprc.org
42
-Means Clustering for SAR Image Change Detection. Geoscience and Remote Sensing Letters, (pp. 691 695)
IEEE.
8.
Sinha K, Sinha G. R.(2014). Efficient segmentation methods for tumor detection in MRI images. Electrical,
Electronics and Computer Science, (pp. 1 60) IEEE International Conference.
9.
Deliang Xiang, Tao Tang, Canbin Hu, Yu Li, Yi Su. (2014)A Kernel Clustering Algorithm With Fuzzy Factor:
Application to SAR Image Segmentation. Geoscience and Remote Sensing Letters, (pp. 1290 1294).
IEEE letter.
10. Yarmohammadi H. A, Fard A. A, Khosravi H. (2014). Clustering low quality Farsi sub-words for word
recognition. Intelligent Systems, (pp. 1 5). IEEE International Conference.
11. Jiantao Zhou, Xianming Liu, Au O. C, Yuan Yan Tang.(2014). Designing an Efficient Image Encryption-ThenCompression System via Prediction Error Clustering and Random Permutation. Information Forensics and
Security, (pp. 39 50). IEEE Transaction
12. Kiruthika V, Ramya M. M. (2014). Automatic Segmentation of Ovarian Follicle Using K-MeansClustering.
Signal and Image Processing, (pp. 137 141). IEEE International Conference.
13. Yihang Zhang, Yun Du, Xiaodong Li, Shiming Fang, Feng Ling. (2014). Unsupervised Subpixel Mapping of
Remotely Sensed Imagery Based on Fuzzy C-Means Clustering Approach,. IEEE, Geoscience and Remote
Sensing Letters, Vol. 11, No.5, pp. 1024 1028, 2014.
14. Cetingul H. E, Wright M. J, Thompson P. M, Vidal R. (2014). Segmentation of High Angular Resolution
Diffusion MRI Using Sparse Riemannian Manifold Clustering, Medical Imaging, (pp.301 317).
IEEE Transaction.
15. Mehrotra A, Tripathi S, Singh K. K, Khandelwal P. (2014). Blood Vessel Extraction for retinal images using
morphological operator and KCN clustering. Advance Computing Conference, (pp. 1142 1146).
IEEE International conference.
16. Krishna M. P, Sriram A, Puhan N. B. (2014). Clustering based image binarization in palm leaf manuscripts.
Advance Computing Conference, (pp.1060 1065). IEEE International conference.
17. Hertz, T, Shental, N, Bar-Hillel, A, Weinshall, D. (2003). Enhancing image and video retrieval: learning via
equivalence constraints. Computer Vision and Pattern Recognition, (pp. II - 668-740. IEEE International
conference.
18. Wenbing Tao, Wuhan, Hai Jin, Yimin Zhang. (2007) Color Image Segmentation Based on Mean Shift and
Normalized Cuts. Systems, Man, and Cybernetics, (pp.1382-1389). IEEE Transaction.