Abstract
The argument k is a mandatory user-specified input argument for the number of clusters which is required to start all of the partitioning clustering algorithms. In unsupervised learning applications, an optimal value of this argument is generally determined by using any of the internal validity indexes. However, the determination of k with aid of these indexes are computationally very expensive because they compute a k value using the results after several runs of a clustering algorithm. On the contrary, the package 'kpeaks' enables to estimate k before starting a clustering session. It is based on a simple novel technique using the descriptive statistics of peak counts of the features in datasets. In this paper, we introduce and illustrate the details of R package 'kpeaks' as an implementation for quick selection of the number of clusters for starting cluster algorithms.
Original language | English |
---|---|
Title of host publication | 2018 International Conference on Artificial Intelligence and Data Processing - Proceedings |
Place of Publication | Piscataway, NJ |
Publisher | IEEE |
Number of pages | 7 |
ISBN (Print) | 9781538668788 |
DOIs | |
Publication status | Published - 24 Jan 2019 |
Event | 2018 International Conference on Artificial Intelligence and Data Processing, IDAP 2018 - Malatya, Turkey Duration: 28 Sept 2018 → 30 Sept 2018 |
Conference
Conference | 2018 International Conference on Artificial Intelligence and Data Processing, IDAP 2018 |
---|---|
Country/Territory | Turkey |
City | Malatya |
Period | 28/09/18 → 30/09/18 |
Keywords
- cluster analysis
- number of clusters
- partitioning clustering
- unsupervised learning