Welcome to Scribd!

Skip carousel

Chameleon: A Hierarchical Clustering Algorithm Using Dynamic Modeling

Uploaded by

Tanooj Parekh

0% found this document useful (0 votes)

183 views18 pages

Data Clustering Algorithm

Original Title

Chameleon Clustering

Copyright

Available Formats

PPT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Data Clustering Algorithm

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

183 views18 pages

Chameleon: A Hierarchical Clustering Algorithm Using Dynamic Modeling

Uploaded by

Tanooj Parekh

Data Clustering Algorithm

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 18

Search inside document

Chameleon: A hierarchical Clustering Algorithm Using Dynamic Modeling

By George Karypis, Eui-Hong Han,Vipin Kumar

and not by Prashant Thiruvengadachari

Existing Algorithms

K-means and PAM

Algorithm assigns K-representational points to the clusters and tries to form clusters based on the distance measure.

More algorithms

Other algorithm include CURE, ROCK, CLARANS, etc. CURE takes into account distance between representatives
ROCK takes into account inter-cluster aggregate connectivity.

Chameleon

Two-phase approach
Phase -I

Uses a graph partitioning algorithm to divide the data set into a set of individual clusters.

Phase -II

uses an agglomerative hierarchical mining algorithm to merge the clusters.

So, basically..

Why not stop with Phase-I? We've got the clusters, haven't we ?

Chameleon(Phase-II) takes into account

Inter Connetivity Relative closeness

Hence, chameleon takes into account features intrinsic to a cluster.

Constructing a sparse graph

Using KNN

Data points that are far away are completely avoided by the algorithm (reducing the noise in the dataset) captures the concept of neighbourhood dynamically by taking into account the density of the region.

What do you do with the graph ?

Partition the KNN graph such that the edge cut is minimized.

Reason: Since edge cut represents similarity between the points, less edge cut => less similarity.

Multi-level graph partitioning algorithms to partition the graph hMeTiS library.

Example:

Cluster Similarity

Models cluster similarity based on the relative inter-connectivity and relative closeness of the clusters.

Relative Inter-Connectivity

Ci and Cj

RIC= AbsoluteIC(Ci,Cj) internal IC(Ci)+internal IC(Cj) / 2 where AbsoluteIC(Ci,Cj)= sum of weights of edges that connect Ci with Cj. internalIC(Ci) = weighted sum of edges that partition the cluster into roughly equal parts.

Relative Closeness

Absolute closeness normalized with respect to the internal closeness of the two clusters.
Absolute closeness got by average similarity between the points in Ci that are connected to the points in Cj. average weight of the edges from C(i)->C(j).

Internal Closeness.

Internal closeness of the cluster got by average of the weights of the edges in the cluster.

So, which clusters do we merge?

So far, we have got

Relative Inter-Connectivity measure. Relative Closeness measure.

Using them,

Merging the clusters..

If the relative inter-connectivity measure relative closeness measure are same, choose inter-connectivity.
You can also use,

RI(Ci,Cj)T(RI)andRC(Ci,Cj)T(RC)

Allows multiple clusters to merge at each level.

Good points about the paper :

Nice description of the working of the system.

Gives a note of existing algorithms and as to why chameleon is better. Not specific to a particular domain.

yucky and reasonably yucky parts..

Not much information given about the PhaseI part of the paper graph properties ?
Finding the complexity of the algorithm
O(nm + n log n + m^2log m)

Different domains require different measures for connectivity and closeness, ...................

Questions ?

Segmentation of Range and Intensity Image Sequences by Clustering
Document5 pages
Segmentation of Range and Intensity Image Sequences by Clustering
Seetha Chinna
No ratings yet
Birch
Document6 pages
Birch
hehehenotyours
No ratings yet
Fuzzypaper May No K
Document20 pages
Fuzzypaper May No K
Nguyễn Phong
No ratings yet
Cal 99
Document7 pages
Cal 99
sivakumars
No ratings yet
Image Segmentation Adaptive Clustering
Document9 pages
Image Segmentation Adaptive Clustering
Eliezer Beczi
No ratings yet
Kmeans and Adaptive K Means
Document6 pages
Kmeans and Adaptive K Means
Sathish Kumar
No ratings yet
Jaipur National University: Project Design With Seminar
Document26 pages
Jaipur National University: Project Design With Seminar
Faizan Shaikh
100% (1)
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
Document5 pages
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
Journal of Computer Applications
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
Document9 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
Nikhil Jojen
No ratings yet
Using Fuzzy C-Means and K-Means Algorithm For Optimized Image Segmentation in MRI
Document2 pages
Using Fuzzy C-Means and K-Means Algorithm For Optimized Image Segmentation in MRI
erpublication
No ratings yet
Machine Learning:: Session 1: Session 2
Document15 pages
Machine Learning:: Session 1: Session 2
aakash verma
No ratings yet
DM BS Lec8 Clustering
Document48 pages
DM BS Lec8 Clustering
ruba
No ratings yet
Image Proc - Term Paper Report
Document13 pages
Image Proc - Term Paper Report
Arun Dixit
No ratings yet
Real-Time Image Processing Algorithms For Object and Distances Identification in Mobile Robot Trajectory Planning
Document6 pages
Real-Time Image Processing Algorithms For Object and Distances Identification in Mobile Robot Trajectory Planning
amitsbhati
No ratings yet
Efficient Clustering Algorithm For Large Database
Document25 pages
Efficient Clustering Algorithm For Large Database
Phani Kumar
No ratings yet
1 getPDF - JSP
Document4 pages
1 getPDF - JSP
Luis Felipe Rau Andrade
No ratings yet
18 A Comparison of Various Distance Functions On K - Mean Clustering Algorithm
Document9 pages
18 A Comparison of Various Distance Functions On K - Mean Clustering Algorithm
Irtefaa A.
No ratings yet
Matas Bmvc02
Document10 pages
Matas Bmvc02
kkarthiks
No ratings yet
Data Clustering..
Document10 pages
Data Clustering..
ArjunSahoo
No ratings yet
Clustering
Document23 pages
Clustering
Aditya Mohite
No ratings yet
3D Reconstruction Based On Microsoft Kinect Sensor Using Iterative Closest Point Algorithm
Document6 pages
3D Reconstruction Based On Microsoft Kinect Sensor Using Iterative Closest Point Algorithm
Muthu Kumaran
No ratings yet
BIRCH Cluster For Time Series
Document5 pages
BIRCH Cluster For Time Series
yurimazur
No ratings yet
M6
Document23 pages
M6
foodiegurl2508
No ratings yet
K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
Document3 pages
K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
Carlos Perez S
No ratings yet
Spatial Pyramid Matching For Scene Category Recognition
Document6 pages
Spatial Pyramid Matching For Scene Category Recognition
Vaibhav Jain
No ratings yet
4 Clustering
Document9 pages
4 Clustering
Bibek Neupane
No ratings yet
A Barycentric Coordinate Based Distributed Localization Algorithm For Sensor Networks
Document13 pages
A Barycentric Coordinate Based Distributed Localization Algorithm For Sensor Networks
margarita price
No ratings yet
Data Set Property Based K' in VDBSCAN Clustering Algorithm
Document5 pages
Data Set Property Based K' in VDBSCAN Clustering Algorithm
World of Computer Science and Information Technology Journal
No ratings yet
Matas MSER PDF
Document10 pages
Matas MSER PDF
Tomislav Livnjak
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
Document22 pages
Clustering Techniques - Hierarchical, K-Means Clustering
Tanya Sharma
No ratings yet
KMEANS
Document9 pages
KMEANS
johnzenbano120
No ratings yet
Work Carried Out While As An Intern in Imaging Technilogy Lab, GRC, India
Document5 pages
Work Carried Out While As An Intern in Imaging Technilogy Lab, GRC, India
Lokesh Kancharla
No ratings yet
Optimization of Fuzzy C Means With Darwinian Particle Swarm Optimization On MRI Image
Document4 pages
Optimization of Fuzzy C Means With Darwinian Particle Swarm Optimization On MRI Image
erpublication
No ratings yet
10 Marks Questions
Document19 pages
10 Marks Questions
Anupriya Veerasamy
No ratings yet
Implementing Tumor Detection and Area Calculation in Mri Image of Human Brain Using Image Processing Techniques
Document6 pages
Implementing Tumor Detection and Area Calculation in Mri Image of Human Brain Using Image Processing Techniques
manjunathrv
No ratings yet
CV Lecture 7
Document119 pages
CV Lecture 7
Lovely doll
No ratings yet
Region Covariance: A Fast Descriptor For Detection and Classification
Document12 pages
Region Covariance: A Fast Descriptor For Detection and Classification
Rashad Noureddine
No ratings yet
Fuzzy MODEL IDENTIFICATION BASED ON CLUSTER ESTIMATION
Document12 pages
Fuzzy MODEL IDENTIFICATION BASED ON CLUSTER ESTIMATION
Mamad Vigilante
No ratings yet
Seminar Report
Document21 pages
Seminar Report
Rahul Paul
No ratings yet
Exact Distance Computation For Deformabl
Document8 pages
Exact Distance Computation For Deformabl
Sachin G
No ratings yet
A Generalized Method To Solve Text-Based Captchas: 1 4 Segmentation
Document6 pages
A Generalized Method To Solve Text-Based Captchas: 1 4 Segmentation
18CSE146 Gopi Prasad Gudaru
No ratings yet
Grouped Outlier Removal For Robust Ellipse Fitting
Document4 pages
Grouped Outlier Removal For Robust Ellipse Fitting
Abhra Basak
No ratings yet
Ambo University: Inistitute of Technology
Document15 pages
Ambo University: Inistitute of Technology
abay
No ratings yet
Lecture Notes - Clustering
Document13 pages
Lecture Notes - Clustering
gunjan Bhardwaj
No ratings yet
Lecture+Notes+ +clustering
Document13 pages
Lecture+Notes+ +clustering
Pankaj Pandey
No ratings yet
Hierarchical Clustering
Document10 pages
Hierarchical Clustering
HigorScbd
No ratings yet
Fuzzy Clustering: Presented by CH - Srikanth (07991A1268)
Document11 pages
Fuzzy Clustering: Presented by CH - Srikanth (07991A1268)
Srikanth Surya Chikkala
No ratings yet
Clustering: CMPUT 466/551 Nilanjan Ray
Document34 pages
Clustering: CMPUT 466/551 Nilanjan Ray
Richa Jain
No ratings yet
N Bodyreport
Document9 pages
N Bodyreport
Charles AP
No ratings yet
Brain Tumer Extraction From Mri Image Using K-Means Clustring Tecnique
Document7 pages
Brain Tumer Extraction From Mri Image Using K-Means Clustring Tecnique
Rohit Arya
No ratings yet
Automatic Road Extraction From Satellite Image: B.Sowmya Aashik Hameed
Document6 pages
Automatic Road Extraction From Satellite Image: B.Sowmya Aashik Hameed
تريليون
No ratings yet
2 - A Method For Reassembling Fragments in Image Reconstruction
Document4 pages
2 - A Method For Reassembling Fragments in Image Reconstruction
huma
No ratings yet
IV Distance and Rule Based Models 4.1 Distance Based Models
Document45 pages
IV Distance and Rule Based Models 4.1 Distance Based Models
Ram
No ratings yet
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
Document16 pages
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
Talha Farooq
No ratings yet
DM Clustering UNIT4
Document36 pages
DM Clustering UNIT4
sriramr2508
No ratings yet
A Genetic K-Means Clustering Algorithm Based On The Optimized Initial Centers
Document7 pages
A Genetic K-Means Clustering Algorithm Based On The Optimized Initial Centers
Arief Yuliansyah
No ratings yet
Contour Detection and Hierarchical Image Segmentation
Document19 pages
Contour Detection and Hierarchical Image Segmentation
Quynhtrang Nguyen
No ratings yet
Automatic Detection of Community Structures in Networks: LINMA1691 Théorie Et Algorithmique Des Graphes
Document13 pages
Automatic Detection of Community Structures in Networks: LINMA1691 Théorie Et Algorithmique Des Graphes
Star Star
No ratings yet
2006 Gernez and Al INRIA
Document4 pages
2006 Gernez and Al INRIA
nizzaza
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
PID-Fuzzy Controller For Grate Cooler in Cement Plant
Document5 pages
PID-Fuzzy Controller For Grate Cooler in Cement Plant
amir
No ratings yet
Dfs Practical List - 2019
Document3 pages
Dfs Practical List - 2019
poojadhanrajani
No ratings yet
Phase Stability Analysis of Liquid Liquid Equilibrium
Document15 pages
Phase Stability Analysis of Liquid Liquid Equilibrium
JosemarPereiradaSilva
No ratings yet
Lecture Notes Ling2019 1
Document169 pages
Lecture Notes Ling2019 1
Phúc Nguyễn
No ratings yet
Good Books On Quantum Computing
Document3 pages
Good Books On Quantum Computing
Deon
100% (1)
Selection Sort Algorithm in Programming
Document6 pages
Selection Sort Algorithm in Programming
Jeffer Mwangi
No ratings yet
FFT For ADC
Document8 pages
FFT For ADC
Brian Williamson
No ratings yet
Control Lab 8
Document8 pages
Control Lab 8
saikat ghosh
No ratings yet
Public Key Management by PGP: Network Security
Document22 pages
Public Key Management by PGP: Network Security
Abdullah Shahid
No ratings yet
An Improved Image Steganography Method Based On LSB Technique With Random
Document6 pages
An Improved Image Steganography Method Based On LSB Technique With Random
susuela
No ratings yet
ELG3155 - Control Systems
Document5 pages
ELG3155 - Control Systems
Turab Haider
No ratings yet
Haerdle W., Kleinow T., Stahl G. - Applied Quantitative Finance-Springer (2002)
Document423 pages
Haerdle W., Kleinow T., Stahl G. - Applied Quantitative Finance-Springer (2002)
G Heavy
No ratings yet
Operation Research Question Bank-1
Document25 pages
Operation Research Question Bank-1
22107059
No ratings yet
Tutorial Excel Solver
Document5 pages
Tutorial Excel Solver
wilsongouveia
No ratings yet
Searching
Document21 pages
Searching
umer farooq
No ratings yet
Design of An Optimal PID Controller Based On Lyapunov Approach
Document5 pages
Design of An Optimal PID Controller Based On Lyapunov Approach
Guilherme Henrique
No ratings yet
12.2 Examples: 12.2.1 Teenagers and The Game of Chicken
Document3 pages
12.2 Examples: 12.2.1 Teenagers and The Game of Chicken
Camila Sandoval
No ratings yet
ISYE 6420 Syllabus
Document1 page
ISYE 6420 Syllabus
Yasin Çagatay Gültekin
No ratings yet
Applied Linear Algebra - Boyd Slides Part 7
Document6 pages
Applied Linear Algebra - Boyd Slides Part 7
usmanali17
No ratings yet
A Machine Learning Based Framework For A Stage-Wise Classification of Date Palm White Scale Disease
Document10 pages
A Machine Learning Based Framework For A Stage-Wise Classification of Date Palm White Scale Disease
safae af
No ratings yet
Answer Sheet Group Number: - 6 - Names of Team Members: - Hongru Bi, Yangxi Gan, Tara Kelley, Jie Xu Data Set - Arlhomes1
Document6 pages
Answer Sheet Group Number: - 6 - Names of Team Members: - Hongru Bi, Yangxi Gan, Tara Kelley, Jie Xu Data Set - Arlhomes1
Jie Xu
No ratings yet
Solved Exam 2021 PDF
Document4 pages
Solved Exam 2021 PDF
Rahima Akhter
No ratings yet
Compare and Contrast An Algorithmic and Convolution Reverb. Demonstrate The Difference and The Important Features in Both Types of Reverb.
Document5 pages
Compare and Contrast An Algorithmic and Convolution Reverb. Demonstrate The Difference and The Important Features in Both Types of Reverb.
Miguel Peña
No ratings yet
LINFO2262: Machine Learning: Classification and Evaluation
Document39 pages
LINFO2262: Machine Learning: Classification and Evaluation
Quentin Lambotte
No ratings yet
Inverse Kinematics Solution of PUMA 560 Robot Arm Using ANFIS
Document5 pages
Inverse Kinematics Solution of PUMA 560 Robot Arm Using ANFIS
anon_731929628
No ratings yet
Quasi-Cyclic LDPC Codes For Fast Encoding
Document8 pages
Quasi-Cyclic LDPC Codes For Fast Encoding
Mamour Ba
No ratings yet
Hrvoje Jasak PH D
Document394 pages
Hrvoje Jasak PH D
Jemene
No ratings yet
Recurrent Neural Networks
Document20 pages
Recurrent Neural Networks
Sharvari Gundawar
No ratings yet
Building Diversified Portfolios That Outperform Out-Of-Sample
Document33 pages
Building Diversified Portfolios That Outperform Out-Of-Sample
Quant Bots
No ratings yet
X X X X X X
Document2 pages
X X X X X X
Raj Shetty
No ratings yet