Extending the K-Means Clustering Algorithm to Improve the Compactness of the Clusters

Search Browse

Advanced Search

Information

Most-Cited Articles

A Genetic Algorithm for Constructing Compact Binary Decision Trees Sung-Hyuk Cha, Charles C Tappert Cited by 105

Full Text

Bangla Basic Character Recognition Using Digital Curvelet Transform Majumdar Angshul Cited by 89

Full Text

An Overview of Color Constancy Algorithms Vivek Agarwal, B. Abidi, A. Koschan, M. A. Abidi Cited by 89

Full Text

Image Fusion and Enhancement via Empirical Mode Decomposition H. Hariharan, Andrei Gribok, M. A. Abidi, A. Koschan Cited by 84

Full Text

Biometric Authentication and Identification Using Keystroke Dynamics: A Survey Salil Partha Banerjee, Damon Woodard Cited by 58

Full Text

Eye Detection in Facial Images With Unconstrained Background Qiong Wang, Jingyu Yang Cited by 52

Full Text

A Brief Survey of Color Image Preprocessing and Segmentation Techniques Siddhartha Bhattacharyya Cited by 51

Full Text

Enhancing Binary Feature Vector Similarity Measures Sung-Hyuk Cha, Charles Tappert, Sungsoo Yoon Cited by 50

Full Text

Development and Application of Fault Detectability Performance Metrics for Instrument Calibration Verification and Anomaly Detection J. Wesley Hines, Dustin R. Garvey Cited by 45

Full Text

Dynamic Time Warping Based Static Hand Printed Signature Verification Jayadevan R, Satish R Kolhe, Pradeep M Patil Cited by 40

Full Text

Submit a manuscript

Submit online

Join the Editorial Board

To volunteer as an Associate Editor, please contact us at

Enclose a summary of your research interests and relevant expertise. We look forward to hearing from you.

Open Access Archive

2010 vol. 5 no. 1

A Probabilistic Tri-Class Support Vector Machine Luis Gonzalez-Abril, Cecilio Angulo, Francisco Velasco, Juan Antonio Ortega Cited by 2

Full Text

In-Place Algorithm for Connected Components Labeling Tetsuo Asano, Hiroshi Tanaka Cited by 17

Full Text

A Least Square Kernel Machine With Box Constraints Jayanta Basak Cited by 11

Full Text

Optimal Parameter Selection Technique for a Neural Network Based Local Thresholding Method Mohammed Jahirul Islam, Majid Ahmadi, Maher A Sid-Ahmed, Yasser M Alginahi Cited by 1

Full Text

Example Based Single-Frame Image Super-Resolution by Support Vector Regression Dalong Li, Steven Simske Cited by 18

Full Text

Recognition of on-Line Arabic Handwritten Characters Using Structural Features Ahmad Tawfiq Al-Taani, Saeed Al-Haj Cited by 34

Full Text

Shape Feature and Fuzzy Logic Based Offline Devnagari Handwritten Optical Character Recognition Prachi Mukherji, Priti P. Rege Cited by 31

Full Text

Face Verification in Videos: Set Estimation and Class Specific Thresholds Madhura Datta, C.A Murthy Cited by 1

Full Text

Segmentation of Remotely Sensed Images Using Resampling Based Bayesian Learning Abhishek Singh, Padmini Jaikumar, Suman Mitra Cited by 2

Full Text

Rotation and Scale-Invariant Texture Classification Using Log-Polar and Ridgelet Transform Selvaraj Arivazhagan, Kumar Gowri, Lakshmanan Ganesan Cited by 11

Full Text

A Novel LBP Based Methods for Pavement Crack Detection Yong Hu, Chun-xia Zhao Cited by 3

Full Text

Extending the K-Means Clustering Algorithm to Improve the Compactness of the Clusters

Full Text

Extending the K-Means Clustering Algorithm to Improve the Compactness of the Clusters

Antonia Nasiakou, Miltiadis Alamaniotis, Lefteri H. Tsoukalas

JPRR Vol 11, No 1 (2016); doi:10.13176/11.745

Antonia Nasiakou, Miltiadis Alamaniotis, Lefteri H. Tsoukalas

Abstract

Clustering is a popular method essentially applied to data analysis, data mining, vector quantization and data compression. The most widely used clustering algorithm, which belongs to the group of partitioning algorithms, is the k-means. In this paper, we propose an extended version of k-means where the initial cluster centers are selected based on a heuristic data based formula, in contrast to random selection adopted by the traditional k-means algorithm. In particular, a new formula for selecting the initial cluster centers, before applying the k-means algorithm for clustering of a data set, is introduced. The new extended k-means algorithm is tested on clustering a set of 2-D data points. The obtained results exhibit superiority with respect to clustering compactness of the proposed algorithm as compared to traditional k-means. The validity of the extended algorithm is assessed through a set of clustering measures (Silhouette, Davies-Bouldin), with the most prominent being the Davies-Bouldin measure, that identify how compactness and well-separated the clusters are.

JPRR Vol 11, No 1 (2016); doi:10.13176/11.745 | Full Text | Share this paper:

Journal of Pattern Recognition Research