You are on page 1of 4

International Journal of Emerging Trends & Technology in Computer Science (IJETTCS)

Web Site: www.ijettcs.org Email: editor@ijettcs.org


Volume 3, Issue 6, November-December 2014

ISSN 2278-6856

Review on Label Refinement of Web Facial


Images
Mayuri M. Ejgar1, Sonali P. Dhawan2 , Jalindar N. Ekatpure3, Pooja R. Kadam4
S.B.Patil College of Engineering Indapur, Savitribai Phule Pune University,Pune

Abstract
Face annotation means giving the name to the image which is
in the photograph. Auto face annotation is playing urgent role
in many real world applications such as real world
management systems and multimedia information. In real
word applications like Facebook/Google plus/Orcut number of
pictures gets imparted. A few pictures are appropriately
marked however some of them are not properly labeled. So
there is a need of proper labeling of images having noisy
labels. If the images are properly labeled then it will help in
improving the results of web search engines also in
recognizing the person in news videos and giving the proper
names/labels to the images.
Auto face annotation can be gainful to a lot of people true
applications. Face recognition and distinguished as of late
research engages in mining weekly marked facial pictures on
the web to determination examination challenge in machine
vision and picture understanding. This paper gives different
strategies or techniques that are utilization to clarifying facial
pictures.

Keywords:- weakly labeled images, automatic labeling,


face recognition, content based image retrieval, machine
learning. Label refinement, search based face annotation.

1. INTRODUCTION
Annotations make it simple to discover necessary data
immediately when you think back and study a substance.
They help you adapt yourself with both the substance and
association of what you read. They give an approach to
start captivating thoughts and issues directly through
remarks, inquiries, associations, or other reactions that
occur to you as you read. In all these ways, clarifying a
content will make the reading process an dynamic one,
not simply background for composing assignments, yet a
necessary initial phase in the composition process. A
decently annotated content will achieve the majority of the
accompanying:
clearly distinguish where in the content critical
thoughts and data are placed
express the fundamental thoughts of a content
trace the improvement of thoughts/contentions all
through content
Now days the digital media devices are increasing so the
use of social media tools are also increasing for sharing
photos on internet. Now a days Label refinement
technique is useful in real word applications like
facebook/Google plus/Orcut, online photo collection
administration. Expansive number of pictures get
imparted on genuine word applications some of these
images are properly tagged but some of them are not
properly tagged, so there is a need of properly annotation

Volume 3, Issue 6, November-December 2014

to these weakly labeled images. We can use this label


refinement technique on World Wide Web (WWW) where
the large numbers of weakly labeled images are accessible.
It can be utilized as a part of feature space to recognize the
person in video. Label refinement is very important
technique which automatically labels the image which is
in photograph. It is broadly utilized as a part of genuine
applications .This technique annotates the images which
are uploaded by user on the internet. Here in the label
refinement we will investigate and develop a search based
annotation scheme. We have gone through techniques
such as Face recognition algorithm, Iterative framework
for face annotation, Pose adaptive matching method,
Graph based approach, Content based image retrieval, and
Search based face annotation.
2. RELATED WORK
S. Satho, Y. Nakamura, and T. Kanade[1] described a
method Name-It: Naming and Detecting faces in news
and Videos . They have created Name-It, a framework
that accomplice faces and names in news videos. To
complete this task,the framework incorporates name
extraction, face sequence extraction/identification and
video caption recognition. Every strategy incorporates a
few advanced image and natural language processing
strategies: face ID,face tracking, insightful name
extraction utilizing dictionary, thesaurus, and parser,text
region detection, image improvement, character
distinguishment, and the integration of these strategies.
Name-It incorporates these systems into a unified element:
co-occurrence. The successful test results show the
viability of a multi-modal approach in feature content
extraction. What's more, the execution of every technology
is assessed. In spite of the fact that the execution of every
engineering is not generally high, Name-It attains great
facename relationship as demonstrated in the trials.
Further research will be guided at upgrading every
method, and dissecting and enhancing the incorporation
technique. A.W.M. Smeulders, M. Worring, S. Santini, A.
Gupta, and R. Jain [2] proposed content based image
retrieval. Active learning has been demonstrated as a key
technique for enhancing content based image retrieval
performance. Among different techniques support vector
machine (SVM) active learning is prevalent for its
application to significance input in CBIR. However the
consistent SVM dynamic learning has two fundamental
disadvantages when utilized for significance feedback. In
the first place, SVM regularly suffers from learning with a
small number of marked case, which is the case in
significance feedback. Second, SVM active learning
Page 296

International Journal of Emerging Trends & Technology in Computer Science (IJETTCS)


Web Site: www.ijettcs.org Email: editor@ijettcs.org
Volume 3, Issue 6, November-December 2014
generally does not consider into account the redundancy
among examples, and in this way could select different
cases in significance feedback that are similar to each
other. J. Yang and A.G. Hauptmann[3] proposed a
Naming Every Individual in News Video Monologues.For
naming every individual person in the broadcast news
videos they presented a machine learning model. Naming
individual person appearing in the broadcast news videos
is one kind of challenge. They approach this challenging
problem with a statistical learning method. The
contribution of constraints are not critical In their
examination they included the future work for the
requirements with the classifier in a more justified way.
Berg et al. [4] presents the mix of a probability modelwith
a grouping calculation. This mix is to present the
relationshipbetween the facial pictures and the namesin
their inscriptions for the facial pictures and the discovered
namesin the same record. The essentially grouping
technique is usedto inscribed new pictures and
consequently connection name.
For enhancing the
execution grouping procedure are consolidatewith
probability model .consolidating this two techniques
thereare precise marked situated of appearances. The
aftereffect of this work demonstrates that by examining
dialect painstakingly can deliver much better grouping
furthermore take in a characteristicdialect classifier to
figure out who is imagined from messagealone. This
strategy deals with specific information set they
furthermore enhance for nothing content on website page
utilizing straightforward picturerepresentation and setting
model. Ozkan and Duygulu [5] proposed a Graph based
model for discovering the densest sub-graph as the most
related result. Proposed a strategy to partner names and
appearances for questioning individuals in vast news
photograph accumulation By and large the quantity of
same countenances of questioned individual will be
substantial so the countenances are more like one another.
They proposed the chart based technique to discover the
comparable subset with conceivable setof appearances
with question individual name. Comparability are speak to
by SIFT describers. At that point apply an eager graph
algorithm. Jianping Fan, YuliGao, HangzaiLuo[6]
proposed a hierarchical classifier framework for bridging
the semantic gap adequately and accomplishing multilevel
image annotation automatically. In this paper they
proposed a novel algorithm for automatic multilevel
image annotation through hierarchical structure.They
propose a new multi model algorithm to accomplish more
reliable interpretation of the context oriented relationships
between the atomic image concept and co-appearance of
salient articles. Interoperating concept ontology and multi
task learning to boost hierarchical image classifier and
training proposed novel hierarchical boosting algorithm to
avoid the inter-level error transmission problem.
XiaoguangRui, Mingjing Li, Zhiwei Li, Wei-Ying Ma,
NenghaiYu[7] proposed a bipartite graph reinforcement
model (BGRM) for annotation of web images. They
sufficiently utilized both visual features and textual
information of web images. Automatic image annotation

Volume 3, Issue 6, November-December 2014

ISSN 2278-6856

by words is an powerful way for managing and retrieving


a large and quickly developing number of Web images.
They developed and evaluated a web image annotation
methodology based on a bipartite graph reinforcement
model. It uses the accessible text based data associated
with web images and powers an extensive scale picture
database to perform web image annotation. By physically
looking at annotation results in excess of 5, 000 genuine
web images, they demonstrated that the proposed
methodology is viable. To further enhance the
Performance or ability of BGRM, the other poisoning
techniques are disapproved by them for the amplified
words, research better characteristic representation of
images. YuandongTian, Wei Liu[8] proposed a Face
Annotation Framework with Partial Clustering and
Interactive Labeling technique. Face annotation technique
is important in a photo management system. They
discover a new interactive face annotation framework
combining unsuper- vised and interactive learning.There
are two contributions, first contribution was the
formulation of the partial clustering algorithm, which
aims to reduce user labor rather than improve overall
accuracy and the second contribution was the interactive
labeling strategy, whichmaximizes the information gain of
each user interaction. The technique proposed in this
paper can significantly reduce the face annotation
workload by using the unsupervised clustering algorithm
and can improvethe performance of initial labeling with
help of the partial clustering can group most similar faces
into evident clusters. Jingyu Cuiy, Fang Wenz, Rong
Xiaoz, Yuandong Tianx, Xiaoou Tangz[9] proposed An
Interactive Photo Annotation System Based on Face
Clustering and Re-ranking. for semi-automatic photo
annotation they developed a several innovative interaction
techniques. They provide features like, ad hoc annotation,
contextual re-ranking, cluster annotation and with the
help of learning propagation improved the framework
execution. They created a framework for naming pictures
in family albums. The advantage of this system is that it is
beneficial at the time of labeling albums in large size. the
performance of the system is giving better result after the
addition of keyboard shortcuts and by using some previous
work about interactive photo segmentation.They were
planning to amplify their work on other media file
labeling and organization. Wei Dong, Zhe Wang, William
Josephson, Moses Charikar, Kai Li[10] proposed
Modeling LSH for Performance Tuning. Their study
demonstrates that it is conceivable to model Multi-probe
LSH and data dissemination precisely with small sample
datasets and utilize the models for programmed parameter
tuning in bonafide use. They have been represented an
execution model for multi-probe LSH and an in data
model to predict the circulations of K-NN separates in a
dataset. They derived an adaptive search method based on
the performance model to decrease performance
fluctuation between distinctive query points. The result of
their test demonstrate that the adaptive method can
decrease the standard deviation of reviews by around half,
while attaining the same review with lower inactivity.
Page 297

International Journal of Emerging Trends & Technology in Computer Science (IJETTCS)


Web Site: www.ijettcs.org Email: editor@ijettcs.org
Volume 3, Issue 6, November-December 2014
MatthieuGuillaumin,
JakobVerbeek
and
CordeliaSchmid[11]
described
Metric
Learning
Approaches for Face Identification. Face identification is a
problem because it is very difficult to determine whether
the same person is depicted in both images or not. Face
identification becomes difficult because of variations in
the scale, pose, background, lightning, hairstyle and
glasses. In this paper they presented two methods for
learning rebust distance measures (a) a logistic
discriminant approach which learns the metric from a set
of labeled image pairs (LDML) and (b) a nearest neighbor
approach which computes the probability for two images
to belong to the same class (MKNN). Zhimincao ,qi yin ,
xiaoou tang jian sun[12] proposed a new approach for face
recognition using learning based descriptor and pose
adaptive matching. They presents a novel methodology to
address the representation issue and the matching issue in
face recognition they utilize unsupervised learning
strategy to take in an encoder from training example
Which can naturally achieve very good tradeoff between
discriminative power and stay unaltered. They propose a
posture versatile matching strategy for managing the
changes occurred frequently in the image that uses pose
specific classifier to deal with different pose mixes. There
is the assignment is to comprehend that they need to
identify the face at the initial stage and further they did
face verification. They concern and work mainly on
activity of verification. Yu-Yin Sun Yin Zhang Zhi-Hua
Zhou[13] proposed Multi-Label Learning with Weak
Label. Multi-label learning manages the data related with
multiple labels simultaneously. They concentrated on the
weak label problem which is a new kind of multi-label
learning problem, where only a partial label set connected
with each one preparation sample is given. They described
the well technique which considers the inherent class
irregularity of the weak label problem and implemented
the classification limit for each label to go crosswise over
low density regions. They detail the target as a quadratic
programming issue which can be tackled efficiently. To
exploit label connection, they expected that there is a
gathering of low-rank base likenesses, and the suitable
similitudes between examples for different labels can be
inferred from these base similarities. Enhancing the
proficiency of their strategy and applying it to expansive
scale database are intriguing issues for future study. Yang
Zhou Rong Jin Steven C.H. Hoi[14] proposed Exclusive
Lasso for Multi-task Feature Selection. They proposed
novel group regularization which they call exclusive lasso.
They give point by point hypothetical investigation to
show that the proposed exclusive lasso regularizer has the
capacity rivalries among variables and thus generate
inadequate arrangements. This regularizer is associated
with a multi-task feature choice setting and an proficient
algorithm is given to handle the related enhancement
issue. Observational study demonstrates that their
proposed algorithm outperforms the benchmark algorithm
on benchmark datasets. Jinhui Tang, Richang Hong,
Shuichang Yan, Tat-Seng Chua[15] proposed Image
Annotation by kNN-Sparse Graph-based Label

Volume 3, Issue 6, November-December 2014

ISSN 2278-6856

Propagation over not properly tagged images on web.


They disapprove the subject of doing the label refinement
of large-scale image corpus by name propagation over
noisily-labeled web images. To annotate the images more
precisely, they described a novel kNN-sparse graph-based
semi-supervised learning methodology for outfitting the
labeled and unlabeled data simultaneously. kNN search is
used to ensure efficiency and in addition an effective
training label refinement methodology proposed into the
graph-based learning framework to decrease the impact of
noise in ACM Transactions on Intelligent Systems and
Technology. In future they wanted to focus on how to
build an effective training set from the Communitycontributed images and labels. MatthieuGuillaumin ,
Thomas Mensink , JakobVerbeek , CordeliaSchmid[16]
proposed a face recognition from caption. It is supervision
based. With the help of utilization a collection of images
with subtitles the system is performing the task of
knowing the face. Two things are taken in consideration
by them that are, retrieving all faces of a specific
individual from a data set, and establishing the correct
relationship between the names in the subtitles and the
faces in the images. They used LDML metric learning
technique to enhance the performance of text-based image
retrieval of people and names and faces relationship in
news photographs. In future, they want to utilize the
subtitle based supervision to reduce the need for manual
annotation for metric learning. This could be acquired by
utilizing the face naming procedure for consequently
clarifying the face images, or by throwing the issue in a
numerous instance learning framework. Dayong Wang,
Steven C.H. Hoi, Ying He[17] described. A Unified
Learning Framework for Auto Face Annotation by Mining
Web Facial Images. They propose a Weakly Label
Laplacian Support Vector Machines (WL-LapSVM)
algorithm to prepare classifiers focused on weakly labeled
data. They used the state-of-the-art technique WLRLCC
algorithm for the Tranductive learning" scheme. To
totally abuse the two ways of learning ideal models, they
evaluated different last-fusion algorithms on both
measurement level and rank level. They also proposed an
entropy-based rank level fusion algorithm, which
performs and additionally supervised regression-based
fusion algorithm without additional training efforts. In
future work they want to investigate the applications of
their strategies to solve other real-world problems.

3 IMPLICATIONS
We have gone through the techniques which are such as
Face Recognition Algorithm which is having functions
such as Automatic/semi automatic face recognition and
drawbacks such as the performance of face recognition
algorithms is unacceptable because of difficult
lighting/illumination conditions and large head posture
varieties. The following procedure is Iterative Framework
for face annotation which is having functions such as it
recognize the face of same person and proposed for user
confirmation and having drawbacks such as It requires a
lot of manual labeling and require user interaction for
each iteration. The next method is Pose Adaptive
Page 298

International Journal of Emerging Trends & Technology in Computer Science (IJETTCS)


Web Site: www.ijettcs.org Email: editor@ijettcs.org
Volume 3, Issue 6, November-December 2014
matching method which is having functions such as Uses
pose-specific classifiers to deal with different pose
combinations and having drawbacks such as Pattern
sampling is still manually designed. The further strategy
is Graph based Approach which is having functions such
as Associate names and faces for querying people in large
news photo collection and having drawbacks such as in
this the Multi person naming task Up till now it is
unsolved.. Further technique is Content based image
retrieval which is having functions such as it support
vector machine for improving content based image
retrieval and drawbacks such as it does not take into
account the redundancy. We propose a search based face
annotation for label refinement of web facial images.
There is one problem in search based face annotation is
that how effectively we collect similar images and how
effectively we perform annotation. To solve this problem
we will use unsupervised label refinement (ULR) for
refining the labels of web facial images using machine
learning strategies. They propose clustering based rough
guess algorithm for improving the results.

4 CONCLUSION
This paper exhibits a broad review on face annotation
methods for web facial pictures. Right now, a lot of people
new methodologies are proposed in the field of Auto Face
Annotation. Numerous exploration issues have been
highlighted and bearing for future work has been
recommended. Numerous open issues have been
highlighted by the analysts, for example, managing auto
face annotation on substantial scale databases by
distinctive system future work will be on copy individual
name and learn diverse learning method We propose
technique in that Weakly labeled images are collected
from internet and by doing the feature extraction and
applying ULR algorithm on it we can handle query and by
doing majority voting the refinement of image will be
done. Label refinement of weakly labeled images is now a
days useful in real world applications like facebook,
online photo album management. We propose ULR
algorithm for refining the web facial images and we also
propose a clustering based approximation algorithm to
improve the results.
ACKNLODGEMENT
We thank all faculties and colleagues of SBPCOE, Indapur, for
their constant feedback on paper. We also thank the anonymous
reviewers for their valuable comments.

References
[1] S. Satoh, Y. Nakamura, and T. Kanade, Name-It:
Naming and Detecting Faces in News Videos, IEEE
Multimedia, vol. 6, no.1,pp. 22-35, Jan.-Mar. 1999.
[2] A.W.M. Smeulders, M. Worring, S. Santini, A.
Gupta, and R. Jain,Content-Based Image Retrieval at
the End of the Early Years,IEEE Trans. Pattern
Analysis and Machine Intelligence, vol. 22, no. 12,
pp. 1349-1380, Dec. 2000
[3] J. Yang and A.G. Hauptmann, Naming Every
Individual in NewsVideo Monologues, Proc. 12th
Ann. ACM Intl Conf. Multimedia(Multimedia), pp.
580-587. 2004.

Volume 3, Issue 6, November-December 2014

ISSN 2278-6856

[4] T.L. Berg, A.C. Berg, J. Edwards, and D. Forsyth,


Whos in the Picture, Proc. Neural Information
Processing Systems Conf.(NIPS),2005.
[5] D. Ozkan and P. Duygulu, A Graph Based Approach
for Naming Faces in News Photos, Proc. IEEE CS
Conf. Computer Vision and Pattern Recognition
(CVPR), pp. 1477-1482, 2006.
[6] J. Fan, Y. Gao, and H. Luo, Hierarchical
Classification for Automatic Image Annotation,
Proc. 30th Ann. Intl ACM SIGIR Conf. Research
and Development in Information Retrieval (SIGIR),
pp. 111-118, 2007.
[7] X. Rui, M. Li, Z. Li, W.-Y. Ma, and N. Yu, Bipartite
Graph Reinforcement Model for Web Image
Annotation, Proc. 15th ACM Intl Conf. Multimedia,
pp. 585-594, 2007.
[8] Y. Tian, W. Liu, R. Xiao, F. Wen, and X. Tang, A
Face Annotation Framework with Partial Clustering
and Interactive Labeling, Proc. IEEE Conf.
Computer Vision and Pattern Recognition (CVPR),
2007.
[9] J. Cui, F. Wen, R. Xiao, Y. Tian, and X. Tang,
EasyAlbum: An Interactive Photo Annotation
System Based on Face Clustering and Re-Ranking,
Proc. SIGCHI Conf. Human Factors in Computing
Systems (CHI), pp. 367-376, 2007.
[10] W. Dong, Z. Wang, W. Josephson, M. Charikar, and
K. Li, Modeling LSH for Performance Tuning,
Proc. 17th ACM Conf. Information and Knowledge
Management (CIKM), pp. 669-678, 2008.
[11] M. Guillaumin, J. Verbeek, and C. Schmid, Is that
You? Metric Learning Approaches for Face
Identification, Proc. IEEE 12th Intl Conf. Computer
Vision (ICCV), 2009.
[12] Z. Cao, Q. Yin, X. Tang, and J. Sun, Face
Recognition with Learning-Based Descriptor, IEEE
Conf. Computer Vision and Pattern Recognition
(CVPR), pp. 2707-2714, 2010.
[13] Y.-Y. Sun, Y. Zhang, and Z.-H. Zhou, Multi-Label
Learning with Weak Label, Proc. 24th AAAI Conf.
Artificial Intelligence (AAAI), 2010.
[14] Y. Zhou, R. Jin, and S.C.-H. Hoi, Exclusive Lasso
for Multi-Task Feature Selection, Proc. AISTATS,
pp. 988-995, 2010.
[15] J. Tang, R. Hong, S. Yan, T.-S. Chua, G.-J. Qi, and
R. Jain, Image Annotation by KNN- Sparse GraphBased Label Propagation over Noisily Tagged Web
Images, ACM Trans. Intelligent Systems and
Technology, vol. 2, pp. 14:1-14:15, 2011.
[16] M. Guillaumin, T. Mensink, J. Verbeek, and C.
Schmid, Face Recognition from Caption-Based
Supervision, Intl J. Computer Vision, vol. 96, pp.
64-82, 2011.
[17] D. Wang, S.C.H. Hoi, and Y. He, A Unified
Learning Framework for Auto Face Annotation by
Mining Web Facial Images, Proc. 21st ACM Intl
Conf. Information and Knowledge Management
(CIKM), pp. 1392-1401, 2012.
Page 299

You might also like