Clustering documents with labeled and unlabeled documents using fuzzy semi-Kmeans

Chien-Liang Liu*, Tao Hsing Chang, Hsuan Hsun Li

*此作品的通信作者

研究成果: Article同行評審

27 引文 斯高帕斯(Scopus)

摘要

While focusing on document clustering, this work presents a fuzzy semi-supervised clustering algorithm called fuzzy semi-Kmeans. The fuzzy semi-Kmeans is an extension of K-means clustering model, and it is inspired by an EM algorithm and a Gaussian mixture model. Additionally, the fuzzy semi-Kmeans provides the flexibility to employ different fuzzy membership functions to measure the distance between data. This work employs Gaussian weighting function to conduct experiments, but cosine similarity function can be used as well. This work conducts experiments on three data sets and compares fuzzy semi-Kmeans with several methods. The experimental results indicate that fuzzy semi-Kmeans can generally outperform the other methods.

原文English
頁(從 - 到)48-64
頁數17
期刊Fuzzy Sets and Systems
221
DOIs
出版狀態Published - 5 2月 2013

指紋

深入研究「Clustering documents with labeled and unlabeled documents using fuzzy semi-Kmeans」主題。共同形成了獨特的指紋。

引用此