Nature Communications,
Journal Year:
2023,
Volume and Issue:
14(1)
Published: Jan. 24, 2023
Single
cell
data
integration
methods
aim
to
integrate
cells
across
batches
and
modalities,
tasks
can
be
categorized
into
horizontal,
vertical,
diagonal,
mosaic
integration,
where
is
the
most
general
challenging
case
with
few
developed.
We
propose
scMoMaT,
a
method
that
able
single
multi-omics
under
scenario
using
matrix
tri-factorization.
During
scMoMaT
also
uncover
cluster
specific
bio-markers
modalities.
These
multi-modal
are
used
interpret
annotate
clusters
types.
Moreover,
unequal
type
compositions.
Applying
multiple
real
simulated
datasets
demonstrated
these
features
of
showed
has
superior
performance
compared
existing
methods.
Specifically,
we
show
integrated
embedding
combined
learned
lead
annotations
higher
quality
or
resolution
their
original
annotations.
Cells,
Journal Year:
2023,
Volume and Issue:
12(15), P. 1970 - 1970
Published: July 30, 2023
Single-cell
RNA
sequencing
(scRNA-seq)
has
emerged
as
a
powerful
tool
for
investigating
cellular
biology
at
an
unprecedented
resolution,
enabling
the
characterization
of
heterogeneity,
identification
rare
but
significant
cell
types,
and
exploration
cell-cell
communications
interactions.
Its
broad
applications
span
both
basic
clinical
research
domains.
In
this
comprehensive
review,
we
survey
current
landscape
scRNA-seq
analysis
methods
tools,
focusing
on
count
modeling,
cell-type
annotation,
data
integration,
including
spatial
transcriptomics,
inference
communication.
We
review
challenges
encountered
in
analysis,
issues
sparsity
or
low
expression,
reliability
assumptions
discuss
potential
impact
suboptimal
clustering
differential
expression
tools
downstream
analyses,
particularly
identifying
subpopulations.
Finally,
recent
advancements
future
directions
enhancing
analysis.
Specifically,
highlight
development
novel
annotating
single-cell
data,
integrating
interpreting
multimodal
datasets
covering
epigenomics,
proteomics,
inferring
communication
networks.
By
elucidating
latest
progress
innovation,
provide
overview
rapidly
advancing
field
Nature Neuroscience,
Journal Year:
2024,
Volume and Issue:
unknown
Published: Oct. 14, 2024
Alzheimer's
disease
(AD)
is
the
leading
cause
of
dementia
in
older
adults.
Although
AD
progression
characterized
by
stereotyped
accumulation
proteinopathies,
affected
cellular
populations
remain
understudied.
Here
we
use
multiomics,
spatial
genomics
and
reference
atlases
from
BRAIN
Initiative
to
study
middle
temporal
gyrus
cell
types
84
donors
with
varying
pathologies.
This
cohort
includes
33
male
51
female
donors,
an
average
age
at
time
death
88
years.
We
used
quantitative
neuropathology
place
along
a
pseudoprogression
score.
Pseudoprogression
analysis
revealed
two
phases:
early
phase
slow
increase
pathology,
presence
inflammatory
microglia,
reactive
astrocytes,
loss
somatostatin
Nature Methods,
Journal Year:
2024,
Volume and Issue:
21(7), P. 1196 - 1205
Published: June 13, 2024
Abstract
Single-cell
RNA
sequencing
allows
us
to
model
cellular
state
dynamics
and
fate
decisions
using
expression
similarity
or
velocity
reconstruct
state-change
trajectories;
however,
trajectory
inference
does
not
incorporate
valuable
time
point
information
utilize
additional
modalities,
whereas
methods
that
address
these
different
data
views
cannot
be
combined
do
scale.
Here
we
present
CellRank
2,
a
versatile
scalable
framework
study
multiview
single-cell
of
up
millions
cells
in
unified
fashion.
2
consistently
recovers
terminal
states
probabilities
across
modalities
human
hematopoiesis
endodermal
development.
Our
also
combining
transitions
within
experimental
points,
feature
use
recover
genes
promoting
medullary
thymic
epithelial
cell
formation
during
pharyngeal
endoderm
Moreover,
enable
estimating
cell-specific
transcription
degradation
rates
from
metabolic-labeling
data,
which
apply
an
intestinal
organoid
system
delineate
differentiation
trajectories
pinpoint
regulatory
strategies.
Cell,
Journal Year:
2024,
Volume and Issue:
187(10), P. 2343 - 2358
Published: May 1, 2024
As
the
number
of
single-cell
datasets
continues
to
grow
rapidly,
workflows
that
map
new
data
well-curated
reference
atlases
offer
enormous
promise
for
biological
community.
In
this
perspective,
we
discuss
key
computational
challenges
and
opportunities
reference-mapping
algorithms.
We
how
mapping
algorithms
will
enable
integration
diverse
across
disease
states,
molecular
modalities,
genetic
perturbations,
species
eventually
replace
manual
laborious
unsupervised
clustering
pipelines.
Nature Methods,
Journal Year:
2024,
Volume and Issue:
21(9), P. 1658 - 1667
Published: June 21, 2024
Advances
in
spatial
omics
technologies
now
allow
multiple
types
of
data
to
be
acquired
from
the
same
tissue
slice.
To
realize
full
potential
such
data,
we
need
spatially
informed
methods
for
integration.
Here,
introduce
SpatialGlue,
a
graph
neural
network
model
with
dual-attention
mechanism
that
deciphers
domains
by
intra-omics
integration
location
and
measurement
followed
cross-omics
We
demonstrated
SpatialGlue
on
different
using
technologies,
including
epigenome-transcriptome
transcriptome-proteome
modalities.
Compared
other
methods,
captured
more
anatomical
details
accurately
resolved
as
cortex
layers
brain.
Our
method
also
identified
cell
like
spleen
macrophage
subsets
located
at
three
zones
were
not
available
original
annotations.
scales
well
size
can
used
integrate
multi-omics
analysis
tool
combines
information
complementary
modalities
obtain
holistic
view
cellular
properties.
Nature Biotechnology,
Journal Year:
2024,
Volume and Issue:
42(10), P. 1594 - 1605
Published: Jan. 23, 2024
Abstract
Integrating
single-cell
datasets
produced
by
multiple
omics
technologies
is
essential
for
defining
cellular
heterogeneity.
Mosaic
integration,
in
which
different
share
only
some
of
the
measured
modalities,
poses
major
challenges,
particularly
regarding
modality
alignment
and
batch
effect
removal.
Here,
we
present
a
deep
probabilistic
framework
mosaic
integration
knowledge
transfer
(MIDAS)
multimodal
data.
MIDAS
simultaneously
achieves
dimensionality
reduction,
imputation
correction
data
using
self-supervised
information-theoretic
latent
disentanglement.
We
demonstrate
its
superiority
to
19
other
methods
reliability
evaluating
performance
trimodal
tasks.
also
constructed
atlas
human
peripheral
blood
mononuclear
cells
tailored
learning
reciprocal
reference
mapping
schemes
enable
flexible
accurate
from
new
Applications
pseudotime
analysis
cross-tissue
on
bone
marrow
versatility
MIDAS.
available
at
https://github.com/labomics/midas
.
Briefings in Bioinformatics,
Journal Year:
2024,
Volume and Issue:
25(2)
Published: Jan. 22, 2024
Abstract
Spatially
resolved
transcriptomics
(SRT)
is
a
pioneering
method
for
simultaneously
studying
morphological
contexts
and
gene
expression
at
single-cell
precision.
Data
emerging
from
SRT
are
multifaceted,
presenting
researchers
with
intricate
matrices,
precise
spatial
details
comprehensive
histology
visuals.
Such
rich
datasets,
unfortunately,
render
many
conventional
methods
like
traditional
machine
learning
statistical
models
ineffective.
The
unique
challenges
posed
by
the
specialized
nature
of
data
have
led
scientific
community
to
explore
more
sophisticated
analytical
avenues.
Recent
trends
indicate
an
increasing
reliance
on
deep
algorithms,
especially
in
areas
such
as
clustering,
identification
spatially
variable
genes
alignment
tasks.
In
this
manuscript,
we
provide
rigorous
critique
these
advanced
methodologies,
probing
into
their
merits,
limitations
avenues
further
refinement.
Our
in-depth
analysis
underscores
that
while
recent
innovations
tailored
been
promising,
there
remains
substantial
potential
enhancement.
A
crucial
area
demands
attention
development
can
incorporate
biological
nuances,
phylogeny-aware
processing
or
minuscule
image
segments.
Furthermore,
addressing
elimination
batch
effects,
perfecting
normalization
techniques
countering
overdispersion
zero
inflation
patterns
seen
pivotal.
To
support
broader
endeavors,
meticulously
assembled
directory
readily
accessible
databases,
hoping
serve
foundation
future
research
initiatives.
Computational and Structural Biotechnology Journal,
Journal Year:
2025,
Volume and Issue:
27, P. 265 - 277
Published: Jan. 1, 2025
Despite
the
wealth
of
single-cell
multi-omics
data,
it
remains
challenging
to
predict
consequences
novel
genetic
and
chemical
perturbations
in
human
body.
It
requires
knowledge
molecular
interactions
at
all
biological
levels,
encompassing
disease
models
humans.
Current
machine
learning
methods
primarily
establish
statistical
correlations
between
genotypes
phenotypes
but
struggle
identify
physiologically
significant
causal
factors,
limiting
their
predictive
power.
Key
challenges
modeling
include
scarcity
labeled
generalization
across
different
domains,
disentangling
causation
from
correlation.
In
light
recent
advances
data
integration,
we
propose
a
new
artificial
intelligence
(AI)-powered
biology-inspired
multi-scale
framework
tackle
these
issues.
This
will
integrate
organism
hierarchies,
species
genotype-environment-phenotype
relationships
under
various
conditions.
AI
inspired
by
biology
may
targets,
biomarkers,
pharmaceutical
agents,
personalized
medicines
for
presently
unmet
medical
needs.
Nature,
Journal Year:
2025,
Volume and Issue:
unknown
Published: Jan. 22, 2025
Abstract
Single-cell
genomic
technologies
enable
the
multimodal
profiling
of
millions
cells
across
temporal
and
spatial
dimensions.
However,
experimental
limitations
hinder
comprehensive
measurement
under
native
dynamics
in
their
tissue
niche.
Optimal
transport
has
emerged
as
a
powerful
tool
to
address
these
constraints
facilitated
recovery
original
cellular
context
1–4
.
Yet,
most
optimal
applications
are
unable
incorporate
information
or
scale
single-cell
atlases.
Here
we
introduce
multi-omics
(moscot),
scalable
framework
for
genomics
that
supports
multimodality
all
applications.
We
demonstrate
capability
moscot
efficiently
reconstruct
developmental
trajectories
1.7
million
from
mouse
embryos
20
time
points.
To
illustrate
space,
enrich
transcriptomic
datasets
by
mapping
profiles
liver
sample
align
multiple
coronal
sections
brain.
present
moscot.spatiotemporal,
an
approach
leverages
gene-expression
data
both
dimensions
uncover
spatiotemporal
embryogenesis.
also
resolve
endocrine-lineage
relationships
delta
epsilon
previously
unpublished
mouse,
time-resolved
pancreas
development
dataset
using
paired
measurements
gene
expression
chromatin
accessibility.
Our
findings
confirmed
through
validation
NEUROD2
regulator
progenitor
model
human
induced
pluripotent
stem
cell
islet
differentiation.
Moscot
is
available
open-source
software,
accompanied
extensive
documentation.