PROTECT YOUR DNA WITH QUANTUM TECHNOLOGY
Orgo-Life the new way to the future Advertising by AdpathwayCitation: Shay T, Benoist CO, Grieshaber-Bouyer R (2026) The cell cloud: Adopting systems biology concepts in the era of single-cell immunology. PLoS Biol 24(7): e3003853. https://doi.org/10.1371/journal.pbio.3003853
Published: July 2, 2026
Copyright: © 2026 Shay et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the following funding agencies: Deutsche Forschungsgemeinschaft (Award Number: GR 5979/2-1 and CRC 1755 (CASCAID) project #02, to R.G.-B.), Else Kröner-Fresenius-Stiftung (Award Number: 2022_EKEA, to R.G.-B.), Lupus Research Alliance (to R.G.-B.), Boehringer Ingelheim Stiftung (to R.G.-B.), IZKF Erlangen (Award Number: N10, to R.G.-B.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: GRN, gene regulatory network; ILC, innate lymphoid cell; scRNA-seq, single-cell RNA sequencing
For over a century, the “cell type” has served as the atomic unit of multicellular biology. Beginning with histological classifications (i.e., hepatocyte) and later refined by flow cytometry gates (i.e., T cells instead of lymphocytes), immunologists have operated under the paradigm that cells belong to discrete, stable categories defined by distinct surface markers or transcriptional programs (Fig 1a). Within cell types, phenotype can be transiently modified by spatial context, cell cycle, metabolite availability, circadian changes, etc., collectively termed cell “states” [1]. The cell type, overlaid by cell state, has been enormously productive for decades. However, the advent of single-cell RNA sequencing (scRNA-seq) and related high-throughput “omics” methods, which allow profiling of thousands of features across millions of cells, has revealed a continuous, probabilistic landscape of cellular identities, in which some regions have high occupancy probability, representing traditional cell types, and some regions have low occupancy probability, representing, for example, transition states (Fig 1b). This continuity cannot be attributed to technical limitations alone, as it reflects a biological truth that systems biologists have modeled for decades, and immunologists now have to acknowledge and integrate.
Fig 1. The cell cloud model: Theory, examples, and therapeutic implications.
(a) Historically, discrete cell types were defined by histology or standard flow cytometry. (b) “Cell clouds” represent cells as the probability distribution of transcriptional states they occupy. Cell clouds serve as the empirical manifestation of attractor basins. Canalization produces tighter clouds, and cellular plasticity produces broader clouds. (c) Theoretical foundation. Waddington”s epigenetic landscape illustrates differentiation as cells settling into stable attractor basins. Stable states are defined by Random Boolean networks with recurrent, stable configurations of gene expression that are robust to perturbations. (d) Empirical observations of cell clouds. (e) Therapeutic targeting. Advanced immunotherapies must shift from targeting simplified discrete subsets to modulating the overarching probability distribution of the cell cloud or neutralizing specific metastable intermediates. The panels in this Figure were generated with the assistance of Google Gemini.
The probabilistic nature of cellular identity has a rich theoretical foundation. Waddington’s epigenetic landscape framed differentiation as a ball rolling down valleys, each representing a stable basin of attraction for a particular cell state. Each cell within this basin was considered as belonging to this cell type with bulk measurements, and the size and steepness of the basin reflect the effort needed to transition from this cell type to another one (Fig 1c) [2]. The mathematical framework underlying this landscape was formalized by Kauffman’s random Boolean networks, in which cell types correspond to attractor states: recurrent, stable configurations of gene expression that are robust to perturbations [3]. In this view, the gene regulatory network (GRN) of an immune lineage may anchor the cellular identity via quasi-potentials, exemplified by toggle switches within a GRN whose two stable states correspond to two cell types. Such a toggle switch is the GATA1/PU.1 circuit governing hematopoietic commitment, where high GATA1 represses PU.1 to drive erythropoiesis, while high PU.1 represses GATA1 thus driving myelopoiesis. Notably, in the “view from the nucleus” GRN, the GRN includes only the active interactions in the specific cell at the specific time (in contrast to the “view from the genome” GRN, which is static and identical in all cells of the same organism) [4]. Cells within a lineage share the same GRN that defines and stabilizes the basin of attraction those cells occupy in the gene expression space.
A cell cloud is the empirical manifestation of an attractor basin: the collection of transcriptional states a cell probabilistically occupies while fluctuating within its lineage (Fig 1b). The breadth of a cloud reflects the steepness of the Waddington valley. Deep, steep valleys produce tight, narrow clouds (canalization); shallow valleys allow broad, diffuse clouds (plasticity). Continuous “smears” between clouds correspond to cells traversing ridges between adjacent attractor basins, a phenomenon now central to trajectory inference methods in scRNA-seq [5].
Several recent immunological datasets illustrate this continuum (Fig 1d). Gene expression programs corresponding to “classical” effector T cell populations such as T helper 1 (TH1), TH2, and TH17 are continuously distributed within the lineage; any discrete partition is arbitrary [6]. Similarly, neutrophils in homeostasis are distributed along a single, chronologically ordered trajectory, termed “neutrotime”, that can be sliced into an arbitrary number of “subsets” [7]. In psoriatic skin, innate lymphoid cells (ILCs) do not partition into discrete ILC1–3 groups but form a continuous cloud, with ILC3-like transcriptomes arising from both ILC2-like cells and resting ILCs [8]. In each case, continuity within an attractor basin is the biological signal, not noise.
Importantly, the cloud model does not replace discrete cell types; it complements them. The discovery of the pulmonary ionocyte via scRNA-seq illustrates this clearly: a previously invisible, rare cell type defined by sharply peaked expression of the anion channel CFTR, accompanied by genes associated with ion transport and fluid regulation, that is now implicated in cystic fibrosis pathogenesis [9]. Though located on a continuous developmental trajectory with basal cells, the ionocyte occupies a deep, narrow attractor basin (Fig 1d). Ionocytes were identified precisely because scRNA-seq resolves the full landscape. The cell cloud model thus expands the toolkit for cell-type discovery: cell types are regions of high probability density within a continuous landscape, not exceptions to it.
A critical challenge is distinguishing the true biological structure of the cloud from variability that is due to measurement. The molecular basis of cloud variance in the phenotypic space is probably attributed, at least in part, to variance in transcriptional bursting. Transcription occurs in stochastic pulses rather than continuously, such that two biologically identical cells can display different transcript counts depending on when they were sampled in the burst cycle [10]. This intrinsic biological stochasticity must be distinguished from technical artifacts inherent to scRNA-seq. Dropout (such as via incomplete capture of mRNA, limited reverse transcription, or insufficient sequencing) introduces false zeros that artificially expand cell cloud boundaries and create spurious gradients between unrelated cells. Low sequencing depth increases dropout rate, but as sequencing depth and cell number represent a fundamental design trade-off, atlas-scale experiments prioritize broad sampling to map the full landscape. Thus, atlases are often less accurate for resolving the internal texture of individual clouds (e.g., subtle T cell state gradients), which demands deeper per-cell coverage [5].
The cell cloud model has direct implications for immunology and therapeutics. Defining a cell type by thresholding on levels of a handful of surface markers identifies only the dense region of an attractor. The shape and boundaries of a cloud, and its connectivity to neighboring attractors, require high-dimensional analysis. Co-expression of the transcription factors RORγt and FOXP3 in single cells, for instance, marks metastable intermediates between TH17 and regulatory T cell attractor basins: functionally distinct from either stable state, transient due to their inherent instability, and potentially targetable through modulation of the underlying quasi-potential landscape. Immunotherapies targeting specific cell populations need to account not only for the attractor center, but for the full probability distribution of phenotypic states that cells within that cloud can occupy (Fig 1e).
The transition from discrete cell types to probabilistic clouds is not a crisis for immunology but an opportunity. The theoretical and mathematical framework for this transition, developed in systems biology over the past half-century, is mature and now made empirically accessible by single-cell technologies. A cell type is a quasi-discrete attractor: stable, reproducible, and biologically meaningful, yet embedded in a continuous landscape of related states. By adopting this framework, immunologists can break their silo and move from cataloguing immune cell types to understanding the dynamic forces that generate and maintain immune cell identity.




















English (US) ·
French (CA) ·