Physiol. Genomics Fuel your research with LabChart
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Physiol. Genomics 5: 99-111, 2001;
1094-8341/01 $5.00
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (40)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Chow, M. L.
Right arrow Articles by Mian, I. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Chow, M. L.
Right arrow Articles by Mian, I. S.
Received 25 September 2000; accepted in final form 30 January 2001.
Physiological Genomics 5:99-111 (2001)
1094-8341/01 $5.00 © 2001 American Physiological Society

Identifying marker genes in transcription profiling data using a mixture of feature relevance experts

M. L. Chow1,3, E. J. Moler1,2 and I. S. Mian1

1 Radiation Biology and Environmental Toxicology Group, Department of Cell and Molecular Biology, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720
2 Chiron Corporation, Emeryville, California 94608
3 Gene Logic Incorporated, Berkeley, California 94704

Transcription profiling experiments permit the expression levels of many genes to be measured simultaneously. Given profiling data from two types of samples, genes that most distinguish the samples (marker genes) are good candidates for subsequent in-depth experimental studies and developing decision support systems for diagnosis, prognosis, and monitoring. This work proposes a mixture of feature relevance experts as a method for identifying marker genes and illustrates the idea using published data from samples labeled as acute lymphoblastic and myeloid leukemia (ALL, AML). A feature relevance expert implements an algorithm that calculates how well a gene distinguishes samples, reorders genes according to this relevance measure, and uses a supervised learning method [here, support vector machines (SVMs)] to determine the generalization performances of different nested gene subsets. The mixture of three feature relevance experts examined implement two existing and one novel feature relevance measures. For each expert, a gene subset consisting of the top 50 genes distinguished ALL from AML samples as completely as all 7,070 genes. The 125 genes at the union of the top 50s are plausible markers for a prototype decision support system. Chromosomal aberration and other data support the prediction that the three genes at the intersection of the top 50s, cystatin C, azurocidin, and adipsin, are good targets for investigating the basic biology of ALL/AML. The same data were employed to identify markers that distinguish samples based on their labels of T cell/B cell, peripheral blood/bone marrow, and male/female. Selenoprotein W may discriminate T cells from B cells. Results from analysis of transcription profiling data from tumor/nontumor colon adenocarcinoma samples support the general utility of the aforementioned approach. Theoretical issues such as choosing SVM kernels and their parameters, training and evaluating feature relevance experts, and the impact of potentially mislabeled samples on marker identification (feature selection) are discussed.

marker genes; mixture of experts; support vector machines; adipsin; cystatin C; azurocidin




This article has been cited by other articles:


Home page
BioinformaticsHome page
S. Yuan and K.-C. Li
Context-dependent clustering for dynamic cellular state modeling of microarray gene expression
Bioinformatics, November 15, 2007; 23(22): 3039 - 3047.
[Abstract] [Full Text] [PDF]


Home page
Transactions of the Institute of Measurement and ControlHome page
H.-Q. Wang and K. Li
A New Algorithm Based on Support Vectors and Penalty Strategy for Identifying Key Genes Related with Cancer
Transactions of the Institute of Measurement and Control, August 1, 2006; 28(3): 263 - 273.
[Abstract] [PDF]


Home page
Nucleic Acids ResHome page
X. Li, S. Rao, Y. Wang, and B. Gong
Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling
Nucleic Acids Res., May 17, 2004; 32(9): 2685 - 2694.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
W. Wang, J. Hayashi, W. E. Kim, and G. Serrero
PC Cell-derived Growth Factor (Granulin Precursor) Expression and Action in Human Multiple Myeloma
Clin. Cancer Res., June 1, 2003; 9(6): 2221 - 2228.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. Nagarajan, N. Le, H. Mahoney, T. Araki, and J. Milbrandt
Deciphering peripheral nerve myelination by using Schwann cell expression profiling
PNAS, June 25, 2002; 99(13): 8998 - 9003.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
C. Ambroise and G. J. McLachlan
Selection bias in gene extraction on the basis of microarray gene-expression data
PNAS, May 14, 2002; 99(10): 6562 - 6566.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. Xiong, X. Fang, and J. Zhao
Biomarker Identification by Feature Wrappers
Genome Res., November 1, 2001; 11(11): 1878 - 1887.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Visit Other APS Journals Online