TY - JOUR
T1 - Locality Sensitive K-means Clustering
AU - Liu, Chien-Liang
AU - Hsaio, Wen Hoar
AU - Chang, Tao Hsing
PY - 2018/1/1
Y1 - 2018/1/1
N2 - This study considers clustering and dimensionality reduction simultaneously to devise an unsupervised clustering algorithm called locality sensitive K-means (LS-Kmeans). The goal is to find a linear transformation to project data points into a lower dimensional space, so that clustering can perform well in the new space. We design a novel objective function for LS-Kmeans to achieve the goal, and further show that the proposed method can be reformulated as a matrix trace minimization with constraints problem. The original optimization problem becomes a generalized eigenvalue problem when relaxing the optimization problem of LS-Kmeans by allowing the indicator entries to take arbitrary values in . This paper also shows that the continuous solutions for the transformed cluster membership indicator vectors of LS-Kmeans are located in the subspace spanned by the first K1 eigenvectors. In the experiments, we use two synthetic datasets to show that the proposed method can cluster non-linearly separable data points. Besides, the experimental results of eight real datasets indicate that the proposed algorithm can generally outperform other alternatives.
AB - This study considers clustering and dimensionality reduction simultaneously to devise an unsupervised clustering algorithm called locality sensitive K-means (LS-Kmeans). The goal is to find a linear transformation to project data points into a lower dimensional space, so that clustering can perform well in the new space. We design a novel objective function for LS-Kmeans to achieve the goal, and further show that the proposed method can be reformulated as a matrix trace minimization with constraints problem. The original optimization problem becomes a generalized eigenvalue problem when relaxing the optimization problem of LS-Kmeans by allowing the indicator entries to take arbitrary values in . This paper also shows that the continuous solutions for the transformed cluster membership indicator vectors of LS-Kmeans are located in the subspace spanned by the first K1 eigenvectors. In the experiments, we use two synthetic datasets to show that the proposed method can cluster non-linearly separable data points. Besides, the experimental results of eight real datasets indicate that the proposed algorithm can generally outperform other alternatives.
KW - Clustering
KW - Dimensionality reduction
KW - Linear transformation
KW - Locality sensitive clustering
KW - Unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=85049874601&partnerID=8YFLogxK
U2 - 10.6688/JISE.2018.34.1.17
DO - 10.6688/JISE.2018.34.1.17
M3 - Article
AN - SCOPUS:85049874601
SN - 1016-2364
VL - 34
SP - 289
EP - 305
JO - Journal of Information Science and Engineering
JF - Journal of Information Science and Engineering
IS - 1
ER -