Clustering time-course microarray data using functional Bayesian infinite mixture model

Authors

    Authors

    C. Angelini; D. De Canditiis;M. Pensky

    Comments

    Authors: contact us about adding a copy of your work at STARS@ucf.edu

    Abbreviated Journal Title

    J. Appl. Stat.

    Keywords

    mixture models; Dirichlet processes; MCMC; time-course microarray; DIRICHLET PROCESS MIXTURE; GENE-EXPRESSION PROFILES; VARIABLE SELECTION; PATTERNS; Statistics & Probability

    Abstract

    This paper presents a new Bayesian, infinite mixture model based, clustering approach, specifically designed for time-course microarray data. The problem is to group together genes which have "similar" expression profiles, given the set of noisy measurements of their expression levels over a specific time interval. In order to capture temporal variations of each curve, a non-parametric regression approach is used. Each expression profile is expanded over a set of basis functions and the sets of coefficients of each curve are subsequently modeled through a Bayesian infinite mixture of Gaussian distributions. Therefore, the task of finding clusters of genes with similar expression profiles is then reduced to the problem of grouping together genes whose coefficients are sampled from the same distribution in the mixture. Dirichlet processes prior is naturally employed in such kinds of models, since it allows one to deal automatically with the uncertainty about the number of clusters. The posterior inference is carried out by a split and merge MCMC sampling scheme which integrates out parameters of the component distributions and updates only the latent vector of the cluster membership. The final configuration is obtained via the maximum a posteriori estimator. The performance of the method is studied using synthetic and real microarray data and is compared with the performances of competitive techniques.

    Journal Title

    Journal of Applied Statistics

    Volume

    39

    Issue/Number

    1

    Publication Date

    1-1-2012

    Document Type

    Article

    Language

    English

    First Page

    129

    Last Page

    149

    WOS Identifier

    WOS:000298925200010

    ISSN

    0266-4763

    Share

    COinS