TY - GEN
T1 - Stability-based cluster analysis applied to microarray data
AU - Giurcǎneanu, Ciprian Doru
AU - Tabus, Ioan
AU - Shmulevich, Ilyu
AU - Zhang, Wei
N1 - Copyright:
Copyright 2014 Elsevier B.V., All rights reserved.
PY - 2003
Y1 - 2003
N2 - This paper studies the estimation of the number of clusters using the so-called stability-based approach, where clusters obtained for two subsets of the dataset are compared via a similarity index and the decision regarding the number of clusters is taken based on the statistics of the index over randomly selected subsets. We introduce a new similarity index s(·,·) and analyze the consistency of the estimator of the number of classes when k-means algorithm is used in conjunction with s(·,·). Various similarity indices are experimentally evaluated when comparing the "true" data partition with the partition obtained at each level of a hierarchical clustering tree. Finally, experimental results with real data are reported for a glioma microarray dataset.
AB - This paper studies the estimation of the number of clusters using the so-called stability-based approach, where clusters obtained for two subsets of the dataset are compared via a similarity index and the decision regarding the number of clusters is taken based on the statistics of the index over randomly selected subsets. We introduce a new similarity index s(·,·) and analyze the consistency of the estimator of the number of classes when k-means algorithm is used in conjunction with s(·,·). Various similarity indices are experimentally evaluated when comparing the "true" data partition with the partition obtained at each level of a hierarchical clustering tree. Finally, experimental results with real data are reported for a glioma microarray dataset.
UR - http://www.scopus.com/inward/record.url?scp=27644574208&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=27644574208&partnerID=8YFLogxK
U2 - 10.1109/ISSPA.2003.1224814
DO - 10.1109/ISSPA.2003.1224814
M3 - Conference contribution
AN - SCOPUS:27644574208
SN - 0780379462
SN - 9780780379466
T3 - Proceedings - 7th International Symposium on Signal Processing and Its Applications, ISSPA 2003
SP - 57
EP - 60
BT - Proceedings - 7th International Symposium on Signal Processing and Its Applications, ISSPA 2003
PB - IEEE Computer Society
T2 - 7th International Symposium on Signal Processing and Its Applications, ISSPA 2003
Y2 - 1 July 2003 through 4 July 2003
ER -