You are on page 1of 2

If you asked this question to any data mining or machine learning persons they will

use the term supervised learning and unsupervised learning for explain you the
difference between clustering and classification. so let me first explain you about the
key word supervised and unsupervised.
Supervised learning: suppose you had a basket and it is fulled with some fresh fruits your
task is to arrange the same type fruits at one place. suppose the fruits are
apple,banana,cherry,grape. so you already know from your previous work that, the shape of
each and every fruit so it is easy to arrange the same type of fruits at one place. here your
previous work is called as train data in data mining. so you already learn the things from
your train data, This is because of you have a response variable which says you that if some
fruit have so and so features it is grape, like that for each and every fruit.
This type of data you will get from the train data. This type of learning is called as
supervised learning. This type solving problem come under Classification. So you already
learn the things so you can do you job confidently.
unsupervised : suppose you had a basket and it is fulled with some fresh fruits your task is
to arrange the same type fruits at one place.
This time you don't know any thing about that fruits, you are first time seeing these fruits so
how will you arrange the same type of fruits.
What you will do first you take on fruit and you will select any physical character of that
particular fruit. suppose you taken color.
Then you will arrange them base on the color, then the groups will be some thing like
this. RED COLOR GROUP: apples & cherry fruits. GREEN COLOR GROUP: bananas &
grapes. so now you will take another physical character as size, so now the groups will be
some thing like this. RED COLOR AND BIG SIZE: apple. RED COLOR AND SMALL
SIZE: cherry fruits. GREEN COLOR AND BIG SIZE: bananas. GREEN COLOR AND
SMALL SIZE: grapes. job done happy ending.
here you didn't learn any thing before ,means no train data and no response variable. This
type of learning is known unsupervised learning. clustering comes under unsupervised
learning.

Data mining is extracting/learning data from large data bases. There are two
approaches: Supervised learning and Unsupervised learning. Supervised learning is
the learning or extracting data using the target class. It is predicting the
occurrences using class information. Here the class is readily available, where as in

unsupervised learning, we do not have any information in prior and all should be
done dynamically. Example of unsupervised learning is Clustering, grouping etc.
Classification is the example of supervised learning. One of the important and
popular techniques used for classification is Decision trees. It is a classifier in the
form of a tree structure where each node is either a leaf node or a decision node.
Decision trees are said to be classification trees and these trees are used to predict
memberships of cases or objects in the classes of categorical dependent variable
from their measurement on one or more predictor variable. The goal of decision
trees is to predict or explain responses on a categorical dependent variable.

You might also like