MPKU‐Net: A U‐Shaped Medical Image Segmentation Network Based on MLP and KAN
Peng Chen,
No information about this author
Huihui Wang,
No information about this author
Qin Jin
No information about this author
et al.
International Journal of Imaging Systems and Technology,
Journal Year:
2025,
Volume and Issue:
35(3)
Published: May 1, 2025
ABSTRACT
The
UNET
architecture
has
been
widely
adopted
for
image
segmentation
across
various
domains,
owing
to
its
efficient
and
powerful
performance
in
recent
years.
Its
application
enhancement
medical
primarily
involve
convolutional
neural
network
(CNN)
Transformer.
However,
both
methods
have
fundamental
limitations.
CNN
struggle
capture
global
features,
which
greatly
reduces
the
computational
complexity
but
compromises
effectiveness.
Transformers
excel
at
capturing
features
demand
substantial
parameters
computations
fail
effectively
extract
local
features.
To
address
these
challenges,
we
propose
a
U‐shaped
model,
MPKU‐NET,
integrates
multilayer
perception
(MLP)
with
Knowledge‐Aware
Networks
(KAN)
architecture,
aiming
characteristics
coordinated
manner.
MPKU‐NET
flexible
rolling
Flip
operation
that,
along
MLP
Network
(KAN),
creates
WE‐MPK
modules
thorough
learning
of
effectiveness
is
proven
by
extensive
testing
on
BUSI,
CVC,
GlaS
datasets.
results
demonstrate
that
MPKU‐Net
consistently
outperforms
several
used
networks,
including
U‐KAN,
Rolling‐U‐net,
U‐Net
++,
terms
model
accuracy,
highlighting
as
scalable
solution
segmentation.
code
uploaded:
https://github.com/cp668688/MPKU‐Net
.
Language: Английский
Attention-enhanced Separable Residual with Dilation Net for Medical Image Segmentation
Neurocomputing,
Journal Year:
2025,
Volume and Issue:
unknown, P. 130434 - 130434
Published: May 1, 2025
Language: Английский
Adapting Classification Neural Network Architectures for Medical Image Segmentation Using Explainable AI
Journal of Imaging,
Journal Year:
2025,
Volume and Issue:
11(2), P. 55 - 55
Published: Feb. 13, 2025
Segmentation
neural
networks
are
widely
used
in
medical
imaging
to
identify
anomalies
that
may
impact
patient
health.
Despite
their
effectiveness,
these
face
significant
challenges,
including
the
need
for
extensive
annotated
data,
time-consuming
manual
segmentation
processes
and
restricted
data
access
due
privacy
concerns.
In
contrast,
classification
networks,
similar
capture
essential
parameters
identifying
objects
during
training.
This
paper
leverages
this
characteristic,
combined
with
explainable
artificial
intelligence
(XAI)
techniques,
address
challenges
of
segmentation.
By
adapting
tasks,
proposed
approach
reduces
dependency
on
To
demonstrate
concept,
Medical
Decathlon
'Brain
Tumours'
dataset
was
utilised.
A
ResNet
network
trained,
XAI
tools
were
applied
generate
segmentation-like
outputs.
Our
findings
reveal
GuidedBackprop
is
among
most
efficient
effective
methods,
producing
heatmaps
closely
resemble
masks
by
accurately
highlighting
entirety
target
object.
Language: Английский
MSA‐MaxNet: Multi‐Scale Attention Enhanced Multi‐Axis Vision Transformer Network for Medical Image Segmentation
Journal of Cellular and Molecular Medicine,
Journal Year:
2024,
Volume and Issue:
28(24)
Published: Dec. 1, 2024
ABSTRACT
Convolutional
neural
networks
(CNNs)
are
well
established
in
handling
local
features
visual
tasks;
yet,
they
falter
managing
complex
spatial
relationships
and
long‐range
dependencies
that
crucial
for
medical
image
segmentation,
particularly
identifying
pathological
changes.
While
vision
transformer
(ViT)
excels
addressing
dependencies,
their
ability
to
leverage
remains
inadequate.
Recent
ViT
variants
have
merged
CNNs
improve
feature
representation
segmentation
outcomes,
yet
challenges
with
limited
receptive
fields
precise
persist.
In
this
work,
we
propose
MSA‐MaxNet.
Specifically,
our
model
utilises
an
encoder–decoder
structure,
using
MaxViT
blocks
apply
multi‐axis
self‐attention
(Max‐SA)
as
the
encoder
global
extraction.
To
restore
map's
resolution
during
upsampling
operations,
a
symmetric
block–based
decoder
layers
employed.
address
mismatches
skip
connections
of
UNet
architecture,
introduce
convolutional
block
attention
module
(CBAM).
Furthermore,
design
multi‐scale
(MCBAM)
based
on
CBAM,
which
enhance
refine
connection.
We
evaluate
performance
MSA‐MaxNet
three
publicly
available
imaging
datasets,
including
Synapse
multi‐organ
ACDC
cardiac
analysis
Kvasir‐SEG
gastrointestinal
polyp
detection.
Notably,
achieves
state‐of‐the‐art
(SOTA)
Dice
scores
85.59%
95.26%
respectively,
40.28
M
parameters.
Additionally,
two
smaller
versions
meet
demands
various
scenarios.
summary,
work
provides
robust
framework
diverse
tasks,
offering
potential
applications
early
cancer
detection,
cardiovascular
disease
diagnosis
comprehensive
organ‐level
assessments.
Language: Английский