Title

Mining patterns in disease classification forests

Authors

Authors

H. Y. Hu

Comments

Authors: contact us about adding a copy of your work at STARS@ucf.edu

Abbreviated Journal Title

J. Biomed. Inform.

Keywords

Biological pathway; Pattern mining; Random forests; Disease phenotype; LUNG-CANCER CELLS; C PKC ISOFORMS; BREAST-CANCER; EXPRESSION PROFILES; MAMMARY-GLAND; PATHWAY; ALPHA; ACTIVATION; REGRESSION; SURVIVAL; Computer Science, Interdisciplinary Applications; Medical Informatics

Abstract

Multiple biological pathways often work together to determine a given disease phenotype. Understanding what these pathways are and how they cooperate in disease-relevant biological processes is critical to our understanding of diseases. Using microarray gene expression data, researchers have developed several methods to rank pathways by their disease relevance. However, the exact set of pathways involved and how they are involved under given disease conditions remain unclear. In this paper, we propose a novel method to first select a robust set of pathways that together best classify a given disease, and then investigate how genes in these pathways interact to determine the phenotype. By applying our method to several disease related microarray gene expression datasets, we detected many disease-relevant interaction patterns supported by evidence from the literature. Our algorithm also achieves higher accuracy in terms of identification of a robust set of disease-relevant pathways when compared with alternative strategies. Published by Elsevier Inc.

Journal Title

Journal of Biomedical Informatics

Volume

43

Issue/Number

5

Publication Date

1-1-2010

Document Type

Article

Language

English

First Page

820

Last Page

827

WOS Identifier

WOS:000281927200019

ISSN

1532-0464

Share

COinS