Professional Documents
Culture Documents
55
Department of Accounting and Information Technology, National Chung Cheng University, Taiwan, 2Department of
Management Information Systems, Chung Yuan Christian University, Taiwan
Received: August 1, 2007; Accepted: November 7, 2007; Revised: November 20, 2007
Abstract: The explosive growth of image data leads to the research and development of Content-Based Image Retrieval
(CBIR) systems. CBIR systems extract and retrieve images by their low-level features, such as color, texture, and shape.
However, these visual contents do not allow users to query images by semantic meanings. Image annotation systems, a
solution to solve the inadequacy of CBIR systems, aim at automatically annotating image with some controlled keywords.
Machine learning techniques are used to develop the image annotation systems to map the low-level (visual) features to
high-level concepts or semantics. This paper reviews 50 image annotation systems using supervised machine learning
techniques to annotate images for image retrieval. Future research issues are provided.
output
1. INTRODUCTION
The advances of computing and multimedia technologies
allow the construction and archive of images with low cost.
As a result, the size of image collections is increasing.
Content-based image retrieval (CBIR) was proposed to allow
users retrieve relevant images in an effective and/or efficient
manner [1, 2].
Post-processing
Feature extraction
Annotation
Segmentation
input
Some representative
image annotation systems are also
described. Section 5 presents advanced techniques by
combining multiple
classifiers and hybrid classifiers for
3.2. Texture
(a) [11]
(b) [12]
(c) [13]
(f) [18]
57
P(C i X ) = P( x j C i )
(1)
j =1
R d denotes vectors in a ddimensional feature space and yi {-1, +1} is a class label.
y2),, (xm, ym) where xi
input
vectors
xi
and
xj
where
K ( xi , x j ) = ( xi ) ( x j ) . Figure 5 [34] illustrates this
polynomial) and
K Gaussian ( xi , x j ) = e
( is Gaussian
sigma) respectively.
59
non-linear
mapping
A
A<a
A>a
B
B>b
Class 1
C
B<b
Class 2
C>c
Class 3
C<c
Class 4
Huang et al. [43] construct a classification tree for hierarchical 11-cateogry classification. They report that using the
color correlogram performs better than the general histogram, and the classification tree outperforms the traditional
nearest neighbour classifier.
5.1.2. Bagging
output
Combina tion
rule
k +1
votes is taken. Figure 7 shows
2
y1(n)
Expert 1
y2(n)
input
x(n)
Expert 2
Combiner
output
y(n)
yk(n)
Expert k
Fig. (7).
Architecture of a classifier ensemble.
S VM_1
s _1
S VM_2
s _2
S VM_k
s _k
TeTesting
sTesting
ting s e t Tset
set
Training
setS
Tra ining s e tset
Training
Fig. (8). Architecture of the SVM ensemble.
n1
y1
n2
y2
nm
ym
61
Table 1.
Segmentation
Low-Level Features
Dimensions
C+T
19
14
32
C+T
83
C+T+E
30
C+T
144
294
C+T+S
117
C+T+S
56
C+T
N/A
N/A
N/A
Li et al. [79]
C+T
N/A
C+T
49
256
C+T+S
68
C+T
19
C+T
156
C + T + S + SR
N/A
C + SR
294
C+T
144
C+T
512
N/A
N/A
80
C+T+S
36
Li et al. [59]
C+T
144
C+T
N/A
Yanai [85]
75
C+T+S
N/A
Brank [39]
1024
Howe [63]
512
2006
2005
2004
2003
2002
(Table 1) Contd
Work
Segmentation
Low-Level Features
Dimensions
C+T
37
C+T
55
C+T
C+T
144
64
C+T
512
C+S
640
N/A
N/A
60
Bradshaw [92]
C+T
26
4096
C+T
30
166
512
C+T
512
C + SR
N/A
2001
2000
1999
1998
1997
(Note: Segmentation: G means global descriptors (i.e. non-segmentation); B for block-based segmentation; R for region-based segmentation;
Features: C means color; T texture; S shape; SR spatial relationship; E for edge)
Table 2.
Classifier
No. of classes
DataSet
Ensemble SVMs
60
Corel
SVMs
40
Others
Bayes
371
1~5
Corel
SVMs
19
N/A
Corel
375 (Corel);
24
Corel + TRECVID
Corel
2006
2005
75 (TRECVID)
Goh et al. [73]
Ensemble SVMs
116
63
(Table 2) Contd
Work
Classifier
No. of classes
DataSet
SVMs
1~3
Corel + others
SVMs
20
Corel
SVMs
WWW
SVMs
32
N/A
Corel + WWW
Bayes
125
1~5
Corel
MLP
Corel
SVMs
Others
Li et al. [79]
Bayes
18
N/A
Others
MLP
30
WWW
SVMs
10
Corel
Ensemble CART
WWW
SVMs
23
N/A
Corel + others
k-NN
N/A
Others
N/A
2~4
Corel
SVMs
Corel + others
Ensemble Bayes
116
Corel + WWW
k-NN
20
Corel
Ensemble SVMs
N/A
N/A
Corel
SVMs
Corel
Bayes
153
1~5
Corel
Li et al. [59]
Ensemble SVMs
116
Corel + WWW
MLP
26
Kodak Photo
Yanai [85]
k-NN
50
Corel + WWW
SVMs
Corel
Brank [39]
SVMs
14
Others
Howe [63]
Ensemble k-NNs
Corel
k-NN
Others
MLP
15
Others
Ensemble MLP
Others
Ensemble SVMs
Others
600
3.6
Corel
2004
2003
2002
65
(Table 2) Contd
Work
Classifier
No. of classes
DataSet
SVMs
15
Corel
k-NN
Corel
Bayes
Kodak Photo
Bayes
Corel + others
SVMs
14
Corel
Bayes
18
Corel
Bradshaw [92]
Bayes
Corel
SVMs
Corel
MLP
11
Others
Template
10
N/A
Others
Decision Trees
11
Corel
Ensemble k-NNs
Others
Template
Corel
2001
2000
1999
1998
1997
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[18]
[19]
[20]
[21]
[22]
[23]
[24]
[25]
[26]
[27]
[28]
[29]
[30]
[31]
[32]
[33]
[34]
[35]
[36]
[37]
[38]
[39]
[40]
[41]
[42]
[44]
[45]
[46]
[47]
[48]
[49]
[50]
[51]
[52]
[53]
[54]
[55]
[56]
[57]
[58]
[59]
[60]
[61]
[62]
[63]
[64]
[65]
[66]
[67]
[68]
67
[70]
[71]
[72]
[73]
[74]
[75]
[76]
[77]
[78]
[79]
[80]
[81]
[82]
[83]
[85]
[86]
[87]
[88]
[89]
[90]
[91]
[92]
[93]
[94]
[95]
[96]
[97]