Clusterization is a promising group of methods in the context of patient similarity. However, results of clustering are not often clear for physicians as well as different clustering methods can produce different results. We have examined a well-known dataset and implemented 3 clustering methods (k-means, Agglomerative and Spectral). We have compared and evaluated clusters and their correlation with data attributes. In contrast to original dataset's target value, the clusters correlated with only a few attributes. Finally, we train 2 predictive models based on k-nearest neighbors (KNN) algorithm and Artificial Neural Network (ANN). Models evaluation demonstrates that using the results of clustering algorithms as predictive attribute give a higher F-score than the original target attribute.
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
Tel.: +1 703 830 6300
Fax: +1 703 830 2300 firstname.lastname@example.org
(Corporate matters and books only) IOS Press c/o Accucoms US, Inc.
For North America Sales and Customer Service
West Point Commons
Lansdale PA 19446
Tel.: +1 866 855 8967
Fax: +1 215 660 5042 email@example.com