PLOS Digital Health,
Journal Year:
2022,
Volume and Issue:
1(10), P. e0000102 - e0000102
Published: Oct. 6, 2022
The
availability
of
large,
deidentified
health
datasets
has
enabled
significant
innovation
in
using
machine
learning
(ML)
to
better
understand
patients
and
their
diseases.
However,
questions
remain
regarding
the
true
privacy
this
data,
patient
control
over
how
we
regulate
data
sharing
a
way
that
does
not
encumber
progress
or
further
potentiate
biases
for
underrepresented
populations.
After
reviewing
literature
on
potential
reidentifications
publicly
available
datasets,
argue
cost—measured
terms
access
future
medical
innovations
clinical
software—of
slowing
ML
is
too
great
limit
through
large
databases
concerns
imperfect
anonymization.
This
cost
especially
developing
countries
where
barriers
preventing
inclusion
such
will
continue
rise,
excluding
these
populations
increasing
existing
favor
high-income
countries.
Preventing
artificial
intelligence’s
towards
precision
medicine
sliding
back
practice
dogma
may
pose
larger
threat
than
reidentification
within
datasets.
While
risk
should
be
minimized,
believe
never
zero,
society
determine
an
acceptable
threshold
below
which
can
occur—for
benefit
global
knowledge
system.
Genome Medicine,
Journal Year:
2024,
Volume and Issue:
16(1)
Published: March 27, 2024
Abstract
Histopathology
and
genomic
profiling
are
cornerstones
of
precision
oncology
routinely
obtained
for
patients
with
cancer.
Traditionally,
histopathology
slides
manually
reviewed
by
highly
trained
pathologists.
Genomic
data,
on
the
other
hand,
is
evaluated
engineered
computational
pipelines.
In
both
applications,
advent
modern
artificial
intelligence
methods,
specifically
machine
learning
(ML)
deep
(DL),
have
opened
up
a
fundamentally
new
way
extracting
actionable
insights
from
raw
which
could
augment
potentially
replace
some
aspects
traditional
evaluation
workflows.
this
review,
we
summarize
current
emerging
applications
DL
in
genomics,
including
basic
diagnostic
as
well
advanced
prognostic
tasks.
Based
growing
body
evidence,
suggest
that
be
groundwork
kind
workflow
cancer
research.
However,
also
point
out
models
can
biases
flaws
users
healthcare
research
need
to
know
about,
propose
ways
address
them.
PLOS Digital Health,
Journal Year:
2022,
Volume and Issue:
1(10), P. e0000102 - e0000102
Published: Oct. 6, 2022
The
availability
of
large,
deidentified
health
datasets
has
enabled
significant
innovation
in
using
machine
learning
(ML)
to
better
understand
patients
and
their
diseases.
However,
questions
remain
regarding
the
true
privacy
this
data,
patient
control
over
how
we
regulate
data
sharing
a
way
that
does
not
encumber
progress
or
further
potentiate
biases
for
underrepresented
populations.
After
reviewing
literature
on
potential
reidentifications
publicly
available
datasets,
argue
cost—measured
terms
access
future
medical
innovations
clinical
software—of
slowing
ML
is
too
great
limit
through
large
databases
concerns
imperfect
anonymization.
This
cost
especially
developing
countries
where
barriers
preventing
inclusion
such
will
continue
rise,
excluding
these
populations
increasing
existing
favor
high-income
countries.
Preventing
artificial
intelligence’s
towards
precision
medicine
sliding
back
practice
dogma
may
pose
larger
threat
than
reidentification
within
datasets.
While
risk
should
be
minimized,
believe
never
zero,
society
determine
an
acceptable
threshold
below
which
can
occur—for
benefit
global
knowledge
system.