Computational approach reveals hidden cell subpopulations
Mathematics improve single-cell analysis
Cell populations have a high heterogeneity, even when they consist of the same type of cells. To determine various types of cells, scientists analyze the respective active transcriptome – in the form of RNA molecules – of the individual cells. Recent technical developments of this so-called single-cell analysis have enabled the transcriptomes of hundreds of cells to be assayed, thus providing an exact picture of the individual cell types.
However, the observed differences between the gene expression patterns of individual cells result from numerous sources, including confounding factors, such as short-term changes in gene expression due to the cell cycle as well as biological processes of interest such as stem cell differentiation.
The scientists have now developed a statistical approach, which models the sources of the observed cell-cell differences. This facilitates an accurate dissection of the observed heterogeneity into a variety of factors, which include measurement noise, confounding factors such as cell cycle effects as well as the biological processes of interest.
“In our current study we show how such factors can be taken into account, thus enabling a more accurate picture of the different cell types. Through the combination of single-cell analyses with statistical methods, cell types can be identified that otherwise would remain undetected,” said first author Florian Büttner of the Institute of Computational Biology (ICB) at Helmholtz Zentrum München.
Single-cell profiles: towards a better understanding of health and disease
Using their single-cell latent variable model (scLVM), the team around Florian Büttner from Helmholtz Zentrum München and Fabian Theis, Professor for Mathematics in Systems Biology at TU München, as well as John Marioni and Oliver Stegle from the European Bioinformatics Institute (EMBL-EBI, Cambridge, UK) have succeeded in detecting and characterizing the maturation stages of T-helper cells.
T-cells are immune cells that differentiate into different shapes, such as Th2-cells (T-helper cell type 2) to exert various immune functions. The analysis of single cell types is essential for medical research. Cancer cells, differentiation processes, the pathogenesis of various diseases and much more can be better explored and understood based only on known, detailed cell profiles.”
"Modern single cell analysis has shown that there are significant differences within an apparently homogeneous cell population. Our long-term goal is to understand the biological and technical causes of these heterogeneities," says Professor Fabian Theis. "This requires modern multivariant statistical and computational methods that we develop with our colleagues at the EBI."
The research has been funded by the European Research Council, the Marie Curie-Program of the European Union and the European Molecular Biology Organization.
Florian Buettner, Kedar N. Natarajan, F. Paolo Casale, Valentina Proserpio, Antonio Scialdone, Fabian J. Theis, Sarah A. Teichmann, John C. Marioni and Oliver Stegle.
Computational analysis of cell-to-cell heterogeneity in single-cell RNA-Sequencing data reveals hidden subpopulation of cells, Nature Biotechnology, DOI: 10.1038/nbt.3102