As a guest user you are not logged in or recognized by your IP address. You have
access to the Front Matter, Abstracts, Author Index, Subject Index and the full
text of Open Access publications.
Clustering of short-text collections is a very relevant research area, given the current and future mode for people to use “small-language” (e.g. blogs, snippets, news and text-message generation such as email or chat). In recent years, a few approaches based on Particle Swarm Optimization (PSO) have been proposed to solve document clustering problems. However, the particularities that arise when this kind of approaches are used for clustering corpora containing very short documents have not received too much attention by the computational linguistic community, maybe due to the high challenge that this problem implies. In this work, we propose some variants of PSO methods to deal with this kind of corpora. Our proposal includes two very different approaches to the clustering problem, which essentially differ in the representations used for maintaining the information about the clusterings under consideration. In our approach, we used two unsupervised measures of cluster validity to be optimized: the Expected Density Measure and the Global Silhouette coefficient. In recent works on short-text clustering, these measures have shown an interesting correlation level with the “true” categorizations provided by a human expert. The experimental results show that PSO-based approaches can be highly competitive alternatives for clustering short-text corpora and can, in some cases, outperform the performance of the most effective clustering algorithms used in this area.
This website uses cookies
We use cookies to provide you with the best possible experience. They also allow us to analyze user behavior in order to constantly improve the website for you. Info about the privacy policy of IOS Press.
This website uses cookies
We use cookies to provide you with the best possible experience. They also allow us to analyze user behavior in order to constantly improve the website for you. Info about the privacy policy of IOS Press.