Professional Documents
Culture Documents
Clustering
Dress Attribute Sales Data Set
Introduction
Before I used
to have 14
attribute but
after
applying the
outlier, I had
two new
attributes
which are
outlier and
extreme
value.
It shows thats I have 121 instance having
outliner and 379 do not have outliner. The
extreme values does not have the outliner.
Thus it is good, since the less the better.
Remove the outliner
How to remove the outliner :-
Weka -> Filters-> unsupervised -> instance - > remove with values -
> click on filter field to adjust.
After adjusting the yes instance outliner is removed.
First I specify the index of the attribute of the outliner which is 15.
Then choose the nominal indices as last since the last value of the
outliner instances is yes.
No Extreme values
Noisy data
A3- Data preparation
Attribute construction
After Attribute construction
Adding new attribute
Normalization
A4- Data reduction
Resampling
SRSWithoutR
SRSwithR with
sample size percent = 50
Evaluate 3 different number of clusters by
investigating the errors(says, k = {3,4,5}).
Number of cluster = 3
Number of cluster = 4
Number of cluster =5
Visualize the several number of results
based on different number of clusters.
K=3
k=4
k=5