Logo Goletty

A New Sub-topics Clustering Method Based on Semi-supervised Learing
Journal Title Journal of Computers
Journal Abbreviation jcp
Publisher Group Academy Publisher
Website http://ojs.academypublisher.com
PDF (615 kb)
   
Title A New Sub-topics Clustering Method Based on Semi-supervised Learing
Authors Xu, Xiaodan
Abstract Sub-topic clustering is a crucial step in multi-document summarization. The traditional k-means clustering method is not effective for topic clustering because the number of clusters k must be given in advance. This paper describes a new method for sub-topic clustering based on semi-supervised learning: the method firstly partition the set of sentences into disjoint subsets, each of which contained sentences covering exactly one topic, and labels the sentences which have high scores in the topic, then use the method of constrained-k-means to decide the number of topics, and finally get the sub-topic sets by k-Means clustering. This algorithm can dynamically generate the number of k-means clustering, and the experiment result indicates that the accuracy of clustering is improved
Publisher ACADEMY PUBLISHER
Date 2012-10-01
Source Journal of Computers Vol 7, No 10 (2012): Special Issue: Advances in Information and Computers
Rights Copyright © ACADEMY PUBLISHER - All Rights Reserved.To request permission, please check out URL: http://www.academypublisher.com/copyrightpermission.html.

 

See other article in the same Issue


Goletty © 2024