Gene expression studies generate large quantities of data with the defining characteristic that the number of genes (whose expression profiles are to be determined) exceed the number of available replicates by several orders of magnitude. Standard spot-by-spot analysis still seeks to extract useful information for each gene on the basis of the number of available replicates, and thus plays to the weakness of microarrays. On the other hand, because of the data volume, treating the entire data set as an ensemble, and developing theoretical distributions for these ensembles provides a framework that plays instead to the strength of microarrays. We present theoretical results that under reasonable assumptions, the distribution of microarray intensities follows the Gamma model, with the biological interpretations of the model parameters emerging naturally. We subsequently establish that for each microarray data set, the fractional intensities can be represented as a mixture of Beta densities, and develop a procedure for using these results to draw statistical inference regarding differential gene expression. We illustrate the results with experimental data from gene expression studies on Deinococcus radiodurans following DNA damage using cDNA microarrays. (C) 2010 Elsevier Ltd. All rights reserved.
Registro Sencillo
Registro Completo
Autor | Ogunnaike, Babatunde A. Gelmi, Claudio A. Edwards, Jeremy S. |
Título | A probabilistic framework for microarray data analysis: Fundamental probability models and statistical inference |
Revista | JOURNAL OF THEORETICAL BIOLOGY |
ISSN | 0022-5193 |
ISSN electrónico | 1095-8541 |
Volumen | 264 |
Número de publicación | 2 |
Página inicio | 211 |
Página final | 222 |
Fecha de publicación | 2010 |
Resumen | Gene expression studies generate large quantities of data with the defining characteristic that the number of genes (whose expression profiles are to be determined) exceed the number of available replicates by several orders of magnitude. Standard spot-by-spot analysis still seeks to extract useful information for each gene on the basis of the number of available replicates, and thus plays to the weakness of microarrays. On the other hand, because of the data volume, treating the entire data set as an ensemble, and developing theoretical distributions for these ensembles provides a framework that plays instead to the strength of microarrays. We present theoretical results that under reasonable assumptions, the distribution of microarray intensities follows the Gamma model, with the biological interpretations of the model parameters emerging naturally. We subsequently establish that for each microarray data set, the fractional intensities can be represented as a mixture of Beta densities, and develop a procedure for using these results to draw statistical inference regarding differential gene expression. We illustrate the results with experimental data from gene expression studies on Deinococcus radiodurans following DNA damage using cDNA microarrays. (C) 2010 Elsevier Ltd. All rights reserved. |
Derechos | acceso restringido |
Agencia financiadora | Delaware Biotechnology Institute US Department of Energy Genomatica |
DOI | 10.1016/j.jtbi.2010.02.021 |
Editorial | ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD |
Enlace | |
Id de publicación en Pubmed | MEDLINE:20170665 |
Id de publicación en WoS | WOS:000277055500006 |
Paginación | 12 páginas |
Palabra clave | Mixture models Poisson distributions Gamma distributions Beta distributions Gene expression DIFFERENTIALLY EXPRESSED GENES IONIZING-RADIATION MIXTURE-MODELS ISSUES NOISE |
Tema ODS | 03 Good Health and Well-being |
Tema ODS español | 03 Salud y bienestar |
Tipo de documento | artículo |