Genome biology,
Journal Year:
2024,
Volume and Issue:
25(1)
Published: March 19, 2024
DANCE
is
the
first
standard,
generic,
and
extensible
benchmark
platform
for
accessing
evaluating
computational
methods
across
spectrum
of
datasets
numerous
single-cell
analysis
tasks.
Currently,
supports
3
modules
8
popular
tasks
with
32
state-of-art
on
21
datasets.
People
can
easily
reproduce
results
supported
algorithms
major
via
minimal
efforts,
such
as
using
only
one
command
line.
In
addition,
provides
an
ecosystem
deep
learning
architectures
tools
researchers
to
facilitate
their
own
model
development.
open-source
Python
package
that
welcomes
all
kinds
contributions.
arXiv (Cornell University),
Journal Year:
2021,
Volume and Issue:
unknown
Published: Jan. 1, 2021
AI
is
undergoing
a
paradigm
shift
with
the
rise
of
models
(e.g.,
BERT,
DALL-E,
GPT-3)
that
are
trained
on
broad
data
at
scale
and
adaptable
to
wide
range
downstream
tasks.
We
call
these
foundation
underscore
their
critically
central
yet
incomplete
character.
This
report
provides
thorough
account
opportunities
risks
models,
ranging
from
capabilities
language,
vision,
robotics,
reasoning,
human
interaction)
technical
principles(e.g.,
model
architectures,
training
procedures,
data,
systems,
security,
evaluation,
theory)
applications
law,
healthcare,
education)
societal
impact
inequity,
misuse,
economic
environmental
impact,
legal
ethical
considerations).
Though
based
standard
deep
learning
transfer
learning,
results
in
new
emergent
capabilities,and
effectiveness
across
so
many
tasks
incentivizes
homogenization.
Homogenization
powerful
leverage
but
demands
caution,
as
defects
inherited
by
all
adapted
downstream.
Despite
impending
widespread
deployment
we
currently
lack
clear
understanding
how
they
work,
when
fail,
what
even
capable
due
properties.
To
tackle
questions,
believe
much
critical
research
will
require
interdisciplinary
collaboration
commensurate
fundamentally
sociotechnical
nature.
Nature Communications,
Journal Year:
2022,
Volume and Issue:
13(1)
Published: April 1, 2022
Deep
Learning
(DL)
has
recently
enabled
unprecedented
advances
in
one
of
the
grand
challenges
computational
biology:
half-century-old
problem
protein
structure
prediction.
In
this
paper
we
discuss
recent
advances,
limitations,
and
future
perspectives
DL
on
five
broad
areas:
prediction,
function
genome
engineering,
systems
biology
data
integration,
phylogenetic
inference.
We
each
application
area
cover
main
bottlenecks
approaches,
such
as
training
data,
scope,
ability
to
leverage
existing
architectures
new
contexts.
To
conclude,
provide
a
summary
subject-specific
general
for
across
biosciences.
Genome biology,
Journal Year:
2021,
Volume and Issue:
22(1)
Published: Dec. 28, 2021
Abstract
A
growing
number
of
single-cell
sequencing
platforms
enable
joint
profiling
multiple
omics
from
the
same
cells.
We
present
,
a
novel
method
that
not
only
allows
for
analyzing
data
joint-modality
platforms,
but
provides
coherent
framework
integration
datasets
measured
on
different
modalities.
demonstrate
its
performance
multi-modality
gene
expression
and
chromatin
accessibility
illustrate
abilities
by
jointly
this
with
RNA-seq
ATAC-seq
datasets.
Briefings in Bioinformatics,
Journal Year:
2022,
Volume and Issue:
23(5)
Published: April 27, 2022
Abstract
Precision
medicine
uses
genetic,
environmental
and
lifestyle
factors
to
more
accurately
diagnose
treat
disease
in
specific
groups
of
patients,
it
is
considered
one
the
most
promising
medical
efforts
our
time.
The
use
genetics
arguably
data-rich
complex
components
precision
medicine.
grand
challenge
today
successful
assimilation
into
that
translates
across
different
ancestries,
diverse
diseases
other
distinct
populations,
which
will
require
clever
artificial
intelligence
(AI)
machine
learning
(ML)
methods.
Our
goal
here
was
review
compare
scientific
objectives,
methodologies,
datasets,
data
sources,
ethics
gaps
AI/ML
approaches
used
genomics
We
selected
high-quality
literature
published
within
last
5
years
were
indexed
available
through
PubMed
Central.
scope
narrowed
articles
reported
application
algorithms
for
statistical
predictive
analyses
using
whole
genome
and/or
exome
sequencing
gene
variants,
RNA-seq
microarrays
expression.
did
not
limit
search
or
sources.
Based
on
comparative
analysis
criteria,
we
identified
32
applied
variable
studies
report
widely
adapted
diagnostics
several
diseases.
Biomedicine & Pharmacotherapy,
Journal Year:
2023,
Volume and Issue:
165, P. 115077 - 115077
Published: July 1, 2023
Traditional
bulk
sequencing
methods
are
limited
to
measuring
the
average
signal
in
a
group
of
cells,
potentially
masking
heterogeneity,
and
rare
populations.
The
single-cell
resolution,
however,
enhances
our
understanding
complex
biological
systems
diseases,
such
as
cancer,
immune
system,
chronic
diseases.
However,
technologies
generate
massive
amounts
data
that
often
high-dimensional,
sparse,
complex,
thus
making
analysis
with
traditional
computational
approaches
difficult
unfeasible.
To
tackle
these
challenges,
many
turning
deep
learning
(DL)
potential
alternatives
conventional
machine
(ML)
algorithms
for
studies.
DL
is
branch
ML
capable
extracting
high-level
features
from
raw
inputs
multiple
stages.
Compared
ML,
models
have
provided
significant
improvements
across
domains
applications.
In
this
work,
we
examine
applications
genomics,
transcriptomics,
spatial
multi-omics
integration,
address
whether
techniques
will
prove
be
advantageous
or
if
omics
domain
poses
unique
challenges.
Through
systematic
literature
review,
found
has
not
yet
revolutionized
most
pressing
challenges
field.
using
shown
promising
results
(in
cases
outperforming
previous
state-of-the-art
models)
preprocessing
downstream
analysis.
Although
developments
generally
been
gradual,
recent
advances
reveal
can
offer
valuable
resources
fast-tracking
advancing
research
single-cell.
Cell,
Journal Year:
2024,
Volume and Issue:
187(10), P. 2343 - 2358
Published: May 1, 2024
As
the
number
of
single-cell
datasets
continues
to
grow
rapidly,
workflows
that
map
new
data
well-curated
reference
atlases
offer
enormous
promise
for
biological
community.
In
this
perspective,
we
discuss
key
computational
challenges
and
opportunities
reference-mapping
algorithms.
We
how
mapping
algorithms
will
enable
integration
diverse
across
disease
states,
molecular
modalities,
genetic
perturbations,
species
eventually
replace
manual
laborious
unsupervised
clustering
pipelines.