Professional Documents
Culture Documents
Hierarchical Clustering
Aim:
To Create a Data Mining process for hierarchical clustering and agglomerative clustering
Procedure:
1. Data Ware House Creation
1.i) Open the rapid miner and create a new process for creating data ware house
2. Extraction Process
2. i)Operator : Read
2.i.a) Under the read operator, Select the read CSV and set the parameter : click Configuration
wizard -> Select File -> Give column separation value as -> finish
2.i.b) Select the rea excel under the read operator set the parameter : click configuration wizard
-> Select file -> Finish.
2.i.c) Select the read database under the read operator Se the parameter : Click Configuration
wizard -> Select file -> Finish ,Define Connection predefined.
2.i.d) Connection -> Name, Database system _ mysql _ mysql, Host _ localhost, port
3306,Database scheme _ Student, user _ root, password admin, URL : jdbc : mysql
://localhost:3306/student and finally test it. Test status Ok.
3. Transformation Process
3.i) Operator : Names and Roles
3.i.a) Select the rename operator under the names and roles operator
3.i.b) Set the parameter : old name and new name and if needed to change more than one
attribute Go to -> Edit list.
3.ii) Operator : join
3.ii.a) Select the union operator the join operator
3.ii.b) Union is used to combine the different two resources and give the output.
4. Loading Process
4.i.) Operator : Write
4.i.a) Select the write Database under write operator . Set parameter : define connection
predefined, connection name, Database system _ mysql, Host _ localhost, port 3306,
Database scheme student, user root, password admin, URL _ jdbc : mysql : //
localhost:3306
4.i.b) Now see the result in the database file
1. Extraction Process:
1.i) Operator : Rea Excel
1.i.a) Select the operator the read excel from the Read operators. Set Configuration wizard -> Select the
File name -> Finish.
2. Transformation Process:
2.i) Operator : Select Attribute
2.i.a) Select the operator -> Select attribute from the Operator
2.ib) Set the parameter : attribute filter type _ Subset, attribute _ Select attributes : Select the attributes
from what you want.
3. Loading Process:
3.i) Select the operator -> Hierarchical Clustering from the clustering operator.
3.ii) Set Parameter mode _ Single link , measure types _ mixed measures.