You are on page 1of 16

Clustering Technique & its Products

Presented By : Shikha Mishra-142 Sonal Pal-149 Vikram Singh-292

Clustering
It is the task of assigning a set of objects into groups (called clusters) so that the objects in the same cluster are more similar (in some sense or another) to each other than to those in other clusters.

Difference
O Freeware O Shareware O Commercial Software

WEKA
O Waikato Environment for Knowledge

Analysis is a popular suit for machine learning software written in Java.


O Weka is a free software available under

the GNU general public license.

Advantages of Weka
O Free availability under the GNU general

public license. O Portability, since it is fully implemented in the java programming language and thus runs on almost any modern computing platform. O A comprehensive collection of data processing and modeling techniques. O Ease of use due to its graphical user interfaces.

KEY FEATURES OF WEKA


Weka supports several standard

data mining taskData processing. Clustering Classification Regression

Visualization

Feature selection
Access to SQL databases using

JAVA database connectivity. It is not capable of multi-relational data mining but there is separate software for converting a collection of linked database tables into a single table that is suitable for processing using Weka.

Flexible facilities for scripting experiment

Different algorithms for data mining and machine learning

Weka product Open source and freely available Easily useable Platformindepende nt

WEKA INTERFACEO Explorer : An environment for

exploring data with WEKA .


O Experimenter : An environment for

performing experiments and conducting statistical tests between learning schemes.

O Knowledge Flow : This

environment supports essentially the same functions as the Explorer but with a drag-and-drop interface. One advantage is that it supports incremental learning. O Simple CLI : Provides a simple command-line interface that allows direct execution of WEKA

commands for operating systems that do not provide their own command line interface.

ARFF FILE
Attribute Relationship File Format

(ARFF) is the text format file used by weka to store data in data base. The ARFF file contains two sections: the header and the data section. The first line of the header tells us the relation name. Then there is the list of the attributes (@attribute...).

BMW CLUSTER DATA IN WEKA

BMW CLUSTER ALGORITHM

CLUSTER ATTRIBUTES

THANK YOU

You might also like