Physiol. Genomics Information on EB 2010
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
 QUICK SEARCH:   [advanced]


     


Physiol. Genomics (April 2, 2003). doi:10.1152/physiolgenomics.00138.2002
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
14/1/35    most recent
00138.2002v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ressom, H.
Right arrow Articles by Natarajan, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ressom, H.
Right arrow Articles by Natarajan, P.
Submitted on October 17, 2002
Accepted on March 28, 2003

Clustering gene expression data using adaptive double self-organizing map

Habtom Ressom1*, Dali Wang1, and Padma Natarajan1

1 Department of Electrical and Computer Engineering, University of Maine, Orono, Maine, USA

* To whom correspondence should be addressed. E-mail: ressom{at}eece.maine.edu.

This paper presents a novel clustering technique known as adaptive double self-organizing map (ADSOM). ADSOM has a flexible topology and performs clustering and cluster visualization simultaneously, thereby requiring no a-priori knowledge about the number of clusters. ADSOM is developed based on a recently introduced technique known as double self-organizing map (DSOM). DSOM combines features of the popular self-organizing map (SOM) with two-dimensional position vectors, which serve as a visualization tool to decide how many clusters are needed. Although DSOM addresses the problem of identifying unknown number of clusters, its free parameters are difficult to control to guarantee correct results and convergence. ADSOM updates its free parameters during training and it allows convergence of its position vectors to a fairly consistent number of clusters provided that its initial number of nodes is greater than the expected number of clusters. The number of clusters can be identified by visually counting the clusters formed by the position vectors after training. A novel index is introduced based on hierarchical clustering of the final locations of position vectors. The index allows automated detection of the number of clusters, thereby reducing human error that could be incurred from counting clusters visually. The reliance of ADSOM in identifying the number of clusters is proven by applying it to publicly available gene expression data from multiple biological systems such as yeast, human, and mouse. ADSOM's performance in detecting number of clusters is compared with a model-based clustering method.




This article has been cited by other articles:


Home page
BioinformaticsHome page
X. Leng and H.-G. Muller
Classification using functional data analysis for temporal gene expression data
Bioinformatics, January 1, 2006; 22(1): 68 - 76.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Visit Other APS Journals Online
Copyright © 2003 by the American Physiological Society.