Universal Cell Embeddings: A Foundation Model for Cell Biology DOI Creative Commons
Yanay Rosen, Yusuf Roohani, Ayush Agrawal

et al.

bioRxiv (Cold Spring Harbor Laboratory), Journal Year: 2023, Volume and Issue: unknown

Published: Nov. 29, 2023

Developing a universal representation of cells which encompasses the tremendous molecular diversity cell types within human body and more generally, across species, would be transformative for biology. Recent work using single-cell transcriptomic approaches to create definitions in form atlases has provided necessary data such an endeavor. Here, we present Universal Cell Embedding (UCE) foundation model. UCE was trained on corpus atlas from other species completely self-supervised way without any annotations. offers unified biological latent space that can represent cell, regardless tissue or species. This embedding captures important variation despite presence experimental noise diverse datasets. An aspect UCE's universality is new organism mapped this with no additional labeling, model training fine-tuning. We applied Integrated Mega-scale Atlas, 36 million cells, than 1,000 uniquely named types, hundreds experiments, dozens tissues eight uncovered insights about organization space, leveraged it infer function newly discovered types. exhibits emergent behavior, uncovering biology never explicitly for, as identifying developmental lineages novel not included set. Overall, by enabling every state type, provides valuable tool analysis, annotation hypothesis generation scale single datasets continues grow.

Language: Английский

Methods and applications for single-cell and spatial multi-omics DOI Open Access
Katy Vandereyken, Alejandro Sifrim, Bernard Thienpont

et al.

Nature Reviews Genetics, Journal Year: 2023, Volume and Issue: 24(8), P. 494 - 515

Published: March 2, 2023

Language: Английский

Citations

598

Best practices for single-cell analysis across modalities DOI Open Access
Lukas Heumos, Anna C. Schaar, Christopher Lance

et al.

Nature Reviews Genetics, Journal Year: 2023, Volume and Issue: 24(8), P. 550 - 572

Published: March 31, 2023

Language: Английский

Citations

506

Multi-omics single-cell data integration and regulatory inference with graph-linked embedding DOI Creative Commons
Zhi‐Jie Cao, Ge Gao

Nature Biotechnology, Journal Year: 2022, Volume and Issue: 40(10), P. 1458 - 1466

Published: May 2, 2022

Despite the emergence of experimental methods for simultaneous measurement multiple omics modalities in single cells, most single-cell datasets include only one modality. A major obstacle integrating data from is that different layers typically have distinct feature spaces. Here, we propose a computational framework called GLUE (graph-linked unified embedding), which bridges gap by modeling regulatory interactions across explicitly. Systematic benchmarking demonstrated more accurate, robust and scalable than state-of-the-art tools heterogeneous multi-omics data. We applied to various challenging tasks, including triple-omics integration, integrative inference human cell atlas construction over millions where was able correct previous annotations. features modular design can be flexibly extended enhanced new analysis tasks. The full package available online at https://github.com/gao-lab/GLUE .

Language: Английский

Citations

310

Spatial components of molecular tissue biology DOI
Giovanni Palla, David S. Fischer, Aviv Regev

et al.

Nature Biotechnology, Journal Year: 2022, Volume and Issue: 40(3), P. 308 - 318

Published: Feb. 7, 2022

Language: Английский

Citations

234

A harmonized atlas of mouse spinal cord cell types and their spatial organization DOI Creative Commons
D. Russ, Ryan B. Patterson Cross, Li Li

et al.

Nature Communications, Journal Year: 2021, Volume and Issue: 12(1)

Published: Sept. 29, 2021

Abstract Single-cell RNA sequencing data can unveil the molecular diversity of cell types. Cell type atlases mouse spinal cord have been published in recent years but not integrated together. Here, we generate an atlas types based on single-cell transcriptomic data, unifying available datasets into a common reference framework. We report hierarchical structure postnatal relationships, with location providing highest level organization, then neurotransmitter status, family, and finally, dozens refined populations. validate combinatorial marker code for each neuronal map their spatial distributions adult cord. also show complex lineage relationships among Additionally, develop open-source classifier, SeqSeek, to facilitate standardization identification. This work provides view types, gene expression signatures, organization.

Language: Английский

Citations

181

Gene regulatory network inference in the era of single-cell multi-omics DOI
Pau Badia-i-Mompel, Lorna Wessels, Sophia Müller‐Dott

et al.

Nature Reviews Genetics, Journal Year: 2023, Volume and Issue: 24(11), P. 739 - 754

Published: June 26, 2023

Language: Английский

Citations

179

Single-cell proteomics enabled by next-generation sequencing or mass spectrometry DOI
Hayley M. Bennett, William Stephenson, Christopher M. Rose

et al.

Nature Methods, Journal Year: 2023, Volume and Issue: 20(3), P. 363 - 374

Published: March 1, 2023

Language: Английский

Citations

176

Characterizing cis-regulatory elements using single-cell epigenomics DOI
Sebastian Preißl, Kyle J. Gaulton, Bing Ren

et al.

Nature Reviews Genetics, Journal Year: 2022, Volume and Issue: 24(1), P. 21 - 43

Published: July 15, 2022

Language: Английский

Citations

147

Single-cell atlases: shared and tissue-specific cell types across human organs DOI
Rasa Elmentaite, Cecilia Domínguez Conde, Lu Yang

et al.

Nature Reviews Genetics, Journal Year: 2022, Volume and Issue: 23(7), P. 395 - 410

Published: Feb. 25, 2022

Language: Английский

Citations

129

MultiVI: deep generative model for the integration of multimodal data DOI Creative Commons
Tal Ashuach, Mariano I. Gabitto,

Rohan V. Koodli

et al.

Nature Methods, Journal Year: 2023, Volume and Issue: 20(8), P. 1222 - 1231

Published: June 29, 2023

Jointly profiling the transcriptome, chromatin accessibility and other molecular properties of single cells offers a powerful way to study cellular diversity. Here we present MultiVI, probabilistic model analyze such multiomic data leverage it enhance single-modality datasets. MultiVI creates joint representation that allows an analysis all modalities included in input data, even for which one or more are missing. It is available at scvi-tools.org .

Language: Английский

Citations

122