|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 TI Food and Nutrition, Wageningen, Netherlands; , RIKILT Institute of Food Safety, Wageningen, Netherlands; NIZO Food Research, Ede, Netherlands
2 National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands; Department of Human Biology, Maastricht University, Maastricht, Netherlands; Division of Human Nutrition, Wageningen University and Research Centre, Wageningen, Netherlands
3 National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
4 TI Food and Nutrition, Wageningen, Netherlands; NIZO Food Research, Ede, Netherlands
5 Division of Human Nutrition, Wageningen University and Research Centre, Wageningen, Netherlands
6 Human Biology, Maastricht University, Maastricht, Netherlands
7 TI Food and Nutrition, Wageningen, Netherlands; , RIKILT Institute of Food Safety, Wageningen, Netherlands
* To whom correspondence should be addressed. E-mail: jaap.keijer{at}wur.nl.
In whole genome microarray studies major gene expression changes are easily identified, but it is a challenge to capture small, but biologically important, changes. Pathway based programs can capture small effects, but may have the disadvantage to be restricted to functionally annotated genes. A structured approach towards the identification of major and small changes for interpretation of biological effects is needed. We present a structured approach, a framework, that addresses different considerations in 1) the identification of informative genes in microarray datasets and 2) the interpretation of their biological relevance. The steps of this framework include gene ranking, gene selection, gene grouping and biological interpretation. Random forests (RF), which takes gene-gene interaction into account, is examined to rank and select genes. For human, mouse and rat whole genome arrays, less than half of the probes on the array is annotated. Consequently, pathway analysis tools ignore half of the information present in the microarray dataset. The framework described takes all genes into account. RF is a useful tool to rank genes by taking interactions into account. Applying a permutation approach, we were able to define an objective threshold for gene selection. RF combined with Self-organizing maps identified genes with coordinated but small gene expression responses that were not fully annotated, but corresponded to the same biological process. The presented approach provides a flexible framework for biological interpretation of microarray datasets. It includes all genes in the dataset, takes gene-gene interactions into account and provides an objective threshold for gene selection.
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH |
| Visit Other APS Journals Online |