Professional Documents
Culture Documents
Abstract
In recent times advances in data communication
technologies, in particular high speed Internet
connections and 3G mobile phones, have introduced
major concern about the relative ease of access to
unsuitable material. This academic and commercial
problem of real time detection of unsuitable images
communicated by phone and Internet has grown
steadily over the last number of years. This review
paper is presented in three parts. The first part
compares and contrasts the most significant skin
detection techniques, feature extraction techniques and
classification methods. The second gives an analysis of
the significant test results. This review paper examines
thirty-three of the most recent techniques along with
their specific conditions. Finally, this paper concludes
by identifying future challenges and briefly summarizes
the proposed features of an optimal system for future
implementation with a suggested solution to the affects
of lighting variations on the colour of skin pixels.
Skin
Detection
Feature
Extraction
Image
Classification
Benign
Image
2. Skin Detection
1. Introduction
Objectionable
Image
Input
Image
151
152
(1)
(20 I 90) (100 150)
where,
|V |.
(2)
= tan1
| U |
This method of skin detection can be used with a
single colour space [1][17] for simplicity or multiple
colour spaces [7][24] to increase accuracy.
Related to the explicit threshold is the skin
probability ratio (also known as skin likelihood). This
is where a pixel is classified as skin using various
probability theories to create a skin likelihood map. Ye
[9] uses Bayes theorem to reduce the effect of
variations in light while detecting skin.
p(c) = wi
exp (c i )T (c i ) , (3)
1
1
i
2
(2 ) 2 | i | 2
where c is the colour vector, is the mean vector and
is the diagonal covariance matrix. The number of
Gaussian functions used is critical and the choice of
colour space is also of great importance [31]. It is
widely regarded [5][37][38] that the Gaussian mixture
model gives inferior results to that of such systems like
the colour histogram, yet it has been extensively used
for skin colour segmentation in objectionable image
detection systems [27][28] showing surprisingly high
sensitivity (92.2%) and specificity (97.9%) [12].
2.2.3
Non-Parametric
Approaches.
Colour
histograms are a statistical method for representing the
distribution of colour in an image and are constructed
by counting the number of pixels of each colour.
Jayaram [35] shows that the number of bins used in the
histogram is a large factor in the performance of the
skin detection.
Another use of the colour histogram is the
likelihood histogram [6][22], created with the skin
colour likelihood algorithm which establishes the
probability of a pixel being a skin pixel. Jones and
Rehg [5] used a set of training images to create two
colour histograms of skin and non-skin pixels;
maximum entropy modelling was then used to train a
Bayes classifier with 88% accuracy. This model has
been repeatedly used as part of other objectionable
image detection systems [15][19].
A major issue with colour histograms is they only
measure colour density, this means that two images,
153
3. Feature Extraction
The classification of digital images is a memory
hungry and computationally complex process. The
solution for this is a process called feature extraction.
Feature extraction is a form of dimension reduction,
where resources used to describe large sets of data are
simplified with as little loss to accuracy as possible.
The colour and texture methods discussed previously
are forms of feature extraction, but they are used solely
in the classification of skin. This section discusses the
features used in the classification of the objectionable
image, predominately geometric and dimensional.
4. Classifiers
A classifier is a mathematical method of grouping
the images based on the results from the feature
extraction and skin detection. Most of the systems
class the images as benign or objectionable, but some
have various levels such as topless, nude or sex image
[25].
154
5. Results
The test results given are in the form of sensitivity
and specificity, where sensitivity is defined as the ratio
of the number of objectionable images identified to the
total number of objectionable images tested and
specificity is defined as the ratio of the number of
benign images passed to the total number of benign
images tested [2]. Due to space constraints Table 1
only shows the top 5 results of the reviewed
publication whose sensitivity and specificity are both
above 90%. As can be seen from this table the results
of the detection systems look to give extremely high
sensitivity and specificity.
Publication
Wang 1997 [2]
Sensitivity
91%
Specificity
96%
93.47%
91.61%
92.2%
97.9%
97.6%
91.5%
94.7%
95.1%
Publication
Fleck 1996
[1]
Jiao 2003
[4]
Sensitivity
Specificity
52.2%
96.6%
89.3%
90.6%
Duan 2002
[8]
80.7%
Cusano 2004
[10]
90.4%
Lee 2004
[29]
86.4%
90%
88.4%
94.8%
Dataset
Source: Internet, CDs, Magazines
Ethnicity: Caucasians
Illumination Conditions: Various
Source: Internet, Corel Library
Ethnicity: Caucasians, Asian
Illumination Conditions: Not Provided
Source: Internet, Corel Library
Ethnicity: Caucasians, Asian, European
Illumination Conditions: Various
Source: Not Provided
Ethnicity: Caucasians, African, Indian
Illumination Conditions: Various
Source: Not Provided
Ethnicity: Caucasians, African, Asian
Illumination Conditions: Controlled
155
6. Conclusion
To reduce false-positives some papers have added
various steps such as face detection and swimsuit
detection. Generally the techniques have implemented
a skin detection method, as large amounts of skin are
generally a sign of the presence of naked people,
followed by a feature extraction method, to identify the
features such as shape and location, and finally
classification from the results of the two previous
steps.
The right choice of method to perform colour
analysis in the skin identification process directly
stipulates the features that can be extracted from the
image. The use of colour histograms to find the colour
density of an image may identify if large skin areas are
present, but they do not allow for features such as
shape and location to be found. However, using colour
histograms to train a Beyes probability algorithm has
been proven to give good results [5]; note this is an old
method and newer adaptive methods of skin detection
have since been developed [12].
Much of the datasets used are described as being
gathered randomly from the Internet (Some papers
count logos as images from the Internet thus boosting
their results.), but do not state from what domain
(Asian, American...etc) or of what the images depict
(indoor, outdoor, professional, amateur, etc). Both of
these issues can affect the accuracy as the ethnicity of
the persons within the images changes with the domain
and the variations in quality and lighting could reduce
the skin identification performance. The need for an
academically available datasets is essential, but due to
the nature of the images needed this may be problem.
There are legal and ethnical issues surrounding the
distribution of such images which prevent the creation
of a dataset, as no academic institute wishes to be
perceived as a distributor of pornographic material.
After careful examination of the published papers it
was decided that an optimal system would consist of:
1. HSV/HSI or YCbCr should be the choice of
colour space for accuracy; RGB should be
References
[1] M. Fleck, D.A. Forsyth, C. Bregler. Finding naked
people, ECCV 1996, vol. 2, pp 593-602.
[2] J.Z. Wang, J. Li, G. Wiederhold, O. Firschein, System
for
screening
objectionable
images,
Computer
Communications 1998, Vol.21 (15), pg 1355-1360, Elsevier.
[3] Y. Chan, R. Harvey, D. Smith, Building systems to
block pornography, eWiC 1999, pp 34-40.
[4] F. Jiao, W. Gao, L. Duan, G. Cui, Detecting Adult
Image using Multiple Features, ICII 2001, vol.3, pg 378383.
[5] M. Jones, J. M. Rehg, Statistical colour models with
application to skin detection, IJCV 2002, 46(1), pp 81-96.
156
[6] A. Bosson, G.C. Cawley, Y. Chan, R. Harvey, NonRetrieval: blocking pornographic images CIVR02, pp 50-59.
[7] L.L. Cao, X.L. Li, N.H. Yu, Z.K. Liu, Naked People
Retrieval Based on Adaboost Learning, ICMLC 2002, Vol.
2, pp 1133 - 1138.
[11] S-J. Yoo, M-H. Jung, H-B. Kang, C-S. Won, S-M. Choi,
"Composition of MPEG-7 Visual Descriptors for Detecting
Adult Images on the Internet", LNCS 2713, Springer-Verlag,
pp 682-687, 2003.
[29] J.-S. Lee, Y.-M. Kuo, P.-C. Chung, E.-L. Chen, Naked
image detection based on adaptive and extensible skin colour
model, PR(40),No. 8, August 2007, pp. 2261-2270.
[32] M.C. Shin, K.I. Chang, L.V. Tsap, Does Colour space
Transformation Make Any Difference on Skin Detection?
IEEE WACV, Dec 2002, pp 275-279.
157
158