You are on page 1of 2

Creating Datamining For

Hierarchical Clustering

Aim:
To Create a Data Mining process for hierarchical clustering and agglomerative clustering

Procedure:
1. Data Ware House Creation
1.i) Open the rapid miner and create a new process for creating data ware house

2. Extraction Process
2. i)Operator : Read
2.i.a) Under the read operator, Select the read CSV and set the parameter : click Configuration
wizard -> Select File -> Give column separation value as -> finish
2.i.b) Select the rea excel under the read operator set the parameter : click configuration wizard
-> Select file -> Finish.
2.i.c) Select the read database under the read operator Se the parameter : Click Configuration
wizard -> Select file -> Finish ,Define Connection predefined.
2.i.d) Connection -> Name, Database system _ mysql _ mysql, Host _ localhost, port
3306,Database scheme _ Student, user _ root, password admin, URL : jdbc : mysql
://localhost:3306/student and finally test it. Test status Ok.

3. Transformation Process
3.i) Operator : Names and Roles
3.i.a) Select the rename operator under the names and roles operator
3.i.b) Set the parameter : old name and new name and if needed to change more than one
attribute Go to -> Edit list.
3.ii) Operator : join
3.ii.a) Select the union operator the join operator
3.ii.b) Union is used to combine the different two resources and give the output.
4. Loading Process
4.i.) Operator : Write
4.i.a) Select the write Database under write operator . Set parameter : define connection
predefined, connection name, Database system _ mysql, Host _ localhost, port 3306,
Database scheme student, user root, password admin, URL _ jdbc : mysql : //
localhost:3306
4.i.b) Now see the result in the database file

Datamining Process For Hierarchical Clustering

1. Extraction Process:
1.i) Operator : Rea Excel

1.i.a) Select the operator the read excel from the Read operators. Set Configuration wizard -> Select the
File name -> Finish.

2. Transformation Process:
2.i) Operator : Select Attribute

2.i.a) Select the operator -> Select attribute from the Operator

2.ib) Set the parameter : attribute filter type _ Subset, attribute _ Select attributes : Select the attributes
from what you want.

3. Loading Process:
3.i) Select the operator -> Hierarchical Clustering from the clustering operator.

3.ii) Set Parameter mode _ Single link , measure types _ mixed measures.

You might also like