Professional Documents
Culture Documents
252 MaCIP2
DBSCAN: A Density-Based Spatial Clustering Algorithm
Notions
Algorithm
Steps:
Complexity
The good thing about DBSCAN is you dont give how many clusters (class labels) there will be
in dataset.
1- Dataset
2- Size of a Region
And it produces the number of clusters based on this density rate. It clusters point on three base
structures: Core, Density (Neighbor), Outlier (Noise).
What is good about it, in some domains, you never know how many clusters you need to find. So
giving the expected number of clusters like in k-Means algorithm are not suitable at all. At the
other hand you may decide minimum density rate of the given dataset and set the threshold value
for it. Especially it has very good use cases in video processing domains.
Pros:
Implementation
References
[1] Ester, Martin; Kriegel, Hans-Peter; Sander, Jrg; Xu, Xiaowei (1996). Simoudis, Evangelos;
Han, Jiawei; Fayyad, Usama M., eds. A density-based algorithm for discovering clusters in large
spatial databases with noise. Proceedings of the Second International Conference on Knowledge
Discovery and Data Mining (KDD-96). AAAI Press. pp. 226231