Cardiovascular
diseases
(CVD)
continue
to
pose
significant
health
risks
globally,
accentuating
the
need
for
early
and
precise
detection
mechanisms.
With
evolution
of
computational
methods
in
healthcare,
machine
learning
offers
transformative
solutions
diagnostic
accuracy.
This
research
aims
identify
an
algorithm
with
consistent
performance
across
multiple
datasets
potential
integration
into
a
cardiac
disease
prediction
platform.
We
examined
nine
prominent
algorithms,
namely
Support
Vector
Machine
(SVM),
Gradient
Boosting
(GB),
Random
Forest
(RF),
Logis-tic
Regression
(LR),
Decision
Tree
(DT),
K-Nearest
Neigh-bor
(KNN),
Naive
Bayes
(NB),
Extreme
(XGBoost),
Multilayer
Perceptron
(MLP),
evalu-ated
their
predictive
two
heterogeneous
datasets.
Both
encompass
14
attributes
but
differ
instance
sizes:
303
1025,
respectively.
Through
meticulous
methodological
framework,
data
underwent
preprocessing,
splitting,
model
training,
followed
by
validation
using
metrics
such
as
Precision,
Recall,
F1
score,
Accuracy,
coupled
confusion
matrix
detailed
class-based
evaluation.
Our
findings
revealed
that
MLP
algorithms
exhibited
superior
consistency
robustness
both
datasets,
achieving
peak
accuracy
95.14%.
While
XGBoost
performed
proficiently
on
one
dataset,
its
wavered
cross-dataset
scenario.
Based
these
findings,
either
or
models
are
recommended
developing
robust
heart
system.
not
only
affirms
revolutionizing
CVD
diagnostics
also
underscores
importance
selection
based
dataset
characteristics.
Informatics in Medicine Unlocked,
Journal Year:
2024,
Volume and Issue:
44, P. 101442 - 101442
Published: Jan. 1, 2024
Cardiovascular
disease
(CVD),
generally
called
heart
illness,
is
a
collective
term
for
various
ailments
that
affect
the
and
blood
vessels.
Heart
primary
cause
of
fatality
morbidity
in
people
worldwide,
resulting
18
million
deaths
per
year.
By
identifying
those
who
are
most
vulnerable
to
diseases
ensuring
they
receive
appropriate
care,
premature
demise
can
be
prevented.
Machine
learning
algorithms
now
crucial
medical
field,
especially
when
using
databases
diagnose
diseases.
Such
efficient
data
processing
techniques
applied
predict
offer
much
potential
accurate
prognosis.
Therefore,
this
study
compares
performance
logistic
regression,
decision
tree,
support
vector
machine
(SVM)
methods
with
without
Boruta
feature
selection.
The
Cleveland
clinic
dataset
acquired
from
Kaggle,
which
consists
14
features
303
instances,
was
used
investigation.
It
found
selection
algorithm,
selects
six
relevant
features,
improved
results
algorithms.
Among
these
classification
algorithms,
regression
produced
result,
an
accuracy
88.52
%.
Preventive Medicine Reports,
Journal Year:
2023,
Volume and Issue:
35, P. 102358 - 102358
Published: Aug. 19, 2023
Diabetes
is
a
chronic
metabolic
disease
characterized
by
hyperglycemia,
the
follow-up
management
of
diabetes
patients
mostly
in
community,
but
relationship
between
key
lifestyle
indicators
community
and
risk
unclear.
In
order
to
explore
association
life
characteristic
diabetes,
252,176
records
people
with
from
2016
2023
were
obtained
Haizhu
District,
Guangzhou.
According
data,
that
affect
are
determined,
optimal
feature
subset
through
selection
technology
accurately
assess
diabetes.
A
assessment
model
based
on
random
forest
classifier
was
designed,
which
used
parameter
algorithm
comparison,
an
accuracy
91.24%
AUC
corresponding
ROC
curve
97%.
improve
applicability
clinical
real
life,
score
card
designed
tested
using
original
95.15%,
reliability
high.
The
prediction
big
data
mining
can
be
for
large-scale
screening
early
warning
doctors
patient
further
promoting
prevention
control
strategies,
also
wearable
devices
or
intelligent
biosensors
individual
self
examination,
reduce
factor
levels.
Computer Methods in Biomechanics & Biomedical Engineering,
Journal Year:
2025,
Volume and Issue:
unknown, P. 1 - 22
Published: Feb. 5, 2025
Cardiovascular
disease
is
a
leading
cause
of
mortality,
necessitating
early
and
precise
prediction
for
improved
patient
outcomes.
This
study
proposes
Quantum-HeartDiseaseNet,
novel
heart
risk
framework
that
integrates
Dynamic
Opposite
Pufferfish
Optimization
Algorithm
feature
selection
Quantum
Attention-based
Bidirectional
Gated
Recurrent
Unit
(QABiGRU)
accurate
diagnosis.
The
method
enhances
diagnosis
accuracy
while
reducing
dimensionality,
Synthetic
Minority
Oversampling
Technique
(SMOTE)
addresses
data
imbalance.
Evaluated
on
three
datasets,
the
proposed
model
achieved
98.87%
accuracy,
98.74%
precision,
98.56%
recall,
outperforming
conventional
methods.
Experimental
results
validate
its
effectiveness
in
prediction.
PLoS ONE,
Journal Year:
2024,
Volume and Issue:
19(5), P. e0302595 - e0302595
Published: May 8, 2024
Diabetes
Mellitus
is
one
of
the
oldest
diseases
known
to
humankind,
dating
back
ancient
Egypt.
The
disease
a
chronic
metabolic
disorder
that
heavily
burdens
healthcare
providers
worldwide
due
steady
increment
patients
yearly.
Worryingly,
diabetes
affects
not
only
aging
population
but
also
children.
It
prevalent
control
this
problem,
as
can
lead
many
health
complications.
As
evolution
happens,
humankind
starts
integrating
computer
technology
with
system.
utilization
artificial
intelligence
assists
be
more
efficient
in
diagnosing
patients,
better
delivery,
and
patient
eccentric.
Among
advanced
data
mining
techniques
intelligence,
stacking
among
most
prominent
methods
applied
domain.
Hence,
study
opts
investigate
potential
ensembles.
aim
reduce
high
complexity
inherent
stacking,
problem
contributes
longer
training
time
reduces
outliers
improve
classification
performance.
In
addressing
concern,
novel
machine
learning
method
called
Stacking
Recursive
Feature
Elimination-Isolation
Forest
was
introduced
for
prediction.
application
Elimination
design
an
model
diagnosis
while
using
fewer
features
resources.
This
incorporates
Isolation
outlier
removal
method.
uses
accuracy,
precision,
recall,
F1
measure,
time,
standard
deviation
metrics
identify
performances.
proposed
acquired
accuracy
79.077%
PIMA
Indians
97.446%
Prediction
dataset,
outperforming
existing
demonstrating
effectiveness
International Journal of Intelligent Systems and Applications,
Journal Year:
2024,
Volume and Issue:
16(1), P. 11 - 23
Published: Jan. 30, 2024
Recently
data
clustering
algorithm
under
machine
learning
are
used
in
‘real-life
data’
to
segregate
them
based
on
the
outcome
of
a
phenomenon.
In
this
paper,
diabetes
is
detected
from
pathological
768
patients
using
four
algorithms:
Fuzzy
C-Means
(FCM),
K-means
clustering,
Inference
system
(FIS)
and
Support
Vector
Machine
(SVM).
Our
main
objective
make
binary
classification
table
sense
that
presence
or
absence
patient.
We
combined
algorithms
entropy-based
probability
enhance
accuracy
detection.
Before
applying
combining
scheme,
we
reduce
size
variables
multiple
linear
regression
(MLR)
then
logistic
again
applied
resultant
keep
outlier
within
narrow
range.
Finally,
entropy
scheme
with
some
modification
ML
got
detection
about
94%
technique.
Data
mining
involves
extracting
valuable
information
from
a
substantial
quantity
of
data.Numerous
applications
for
data
can
be
found
in
different
areas
such
as
finance,
healthcare
&
marketing.
Presently,
hold
major
significance
key
field
precise
disease
prognosis
and
in-depth
exploration
medical
information.
Researchers
employ
varied
minding
technique
help
diagnose
range
illnesses,
including
cancer,
diabetes,
heart
conditions.
Diabetes
has
emerged
modern-day
with
causing
almost
the
highest
number
deaths
around
globe.
It
severely
affects
renal
cardiac
functions
patient
along
eyesight
loss
leading
to
other
diseases
body.
In
this
paper,
we
made
use
Kaggle's
PIMA
Indian
Set.
We
deploy
four
strategies:
logistic
regression,
SVM
(Support
Vector
Machine),
KNN
(K-nearest
neighbor),
Random
Forest.
An
assessment
algorithms'
performances
is
based
on
an
array
metrics,
spanning
Recall,
Accuracy,
Precision,
F-measure.
The
Forest
Classifier
fares
better
compared
alternatives
classification
methods
when
speaking
accuracy
(80.08%).
IEEE Access,
Journal Year:
2024,
Volume and Issue:
12, P. 120454 - 120473
Published: Jan. 1, 2024
Cardiovascular
diseases
(CVD)
persist
as
a
formidable
global
health
challenge,
underscoring
the
imperative
for
advanced
early
detection
mechanisms.
The
evolution
of
computational
methods
within
healthcare
has
paved
way
transformative
applications
machine
learning,
offering
solutions
that
enhance
diagnostic
accuracy
and
contribute
to
SDG-3;
Good
Health
Well-Being.
This
study
aims
identify
an
algorithm
with
consistent
performance
across
diverse
datasets
integrate
it
into
comprehensive
user-centric
approach
heart
disease
prediction.
investigation
includes
examination
eight
learning
algorithms,
three
deep
four
heterogeneous
from
Kaggle.
predictive
these
algorithms
is
assessed
through
measures
include
Precision,
Recall,
F1
score,
Accuracy,
Area
Under
Curve
(AUC).
A
Principal
Component
Analysis
(PCA)
feature
engineering
presented
boost
performance.
An
alternative
selection
method,
Lasso,
was
explored,
PCA
emerging
optimal
choice
in
given
datasets.
As
such,
XGBoost
achieves
impressive
rate
score
around
99%
along
excellent
97%
AUC
prediction
on
other
dataset.
selected
model
integrated
user-friendly
web
application,
providing
holistic
platform
management.
Furthermore,
we
recommended
RPA,
IoTM,
AI-based
tailored
solution
make
our
application
more
reliable,
which
have
proven
attainable.