BMC Biology,
Journal Year:
2025,
Volume and Issue:
23(1)
Published: April 23, 2025
Abstract
Background
Numerous
studies
have
shown
that
circRNA
can
act
as
a
miRNA
sponge,
competitively
binding
to
miRNAs,
thereby
regulating
gene
expression
and
disease
progression.
Due
the
high
cost
time-consuming
nature
of
traditional
wet
lab
experiments,
analyzing
circRNA-miRNA
associations
is
often
inefficient
labor-intensive.
Although
some
computational
models
been
developed
identify
these
associations,
they
fail
capture
deep
collaborative
features
between
interactions
do
not
guide
training
feature
extraction
networks
based
on
high-order
relationships,
leading
poor
prediction
performance.
Results
To
address
issues,
we
innovatively
propose
novel
graph
collaboration
learning
method
for
interaction,
called
DGCLCMI.
First,
it
uses
word2vec
encode
sequences
into
word
embeddings.
Next,
present
joint
model
combines
an
improved
neural
filtering
with
network
optimization.
Deep
interaction
information
embedded
informative
within
sequence
representations
prediction.
Comprehensive
experiments
three
well-established
datasets
across
seven
metrics
demonstrate
our
algorithm
significantly
outperforms
previous
models,
achieving
average
AUC
0.960.
In
addition,
case
study
reveals
18
out
20
predicted
unknown
CMI
data
points
are
accurate.
Conclusions
The
DGCLCMI
improves
representation
by
capturing
information,
superior
performance
compared
prior
methods.
It
facilitates
discovery
sheds
light
their
roles
in
physiological
processes.
BMC Biology,
Journal Year:
2024,
Volume and Issue:
22(1)
Published: Jan. 29, 2024
Abstract
Background
Circular
RNAs
(circRNAs)
have
been
confirmed
to
play
a
vital
role
in
the
occurrence
and
development
of
diseases.
Exploring
relationship
between
circRNAs
diseases
is
far-reaching
significance
for
studying
etiopathogenesis
treating
To
this
end,
based
on
graph
Markov
neural
network
algorithm
(GMNN)
constructed
our
previous
work
GMNN2CD,
we
further
considered
multisource
biological
data
that
affects
association
circRNA
disease
developed
an
updated
web
server
CircDA
human
hepatocellular
carcinoma
(HCC)
tissue
verify
prediction
results
CircDA.
Results
built
Tumarkov-based
deep
learning
framework.
The
regards
biomolecules
as
nodes
interactions
molecules
edges,
reasonably
abstracts
multiomics
data,
models
them
heterogeneous
biomolecular
network,
which
can
reflect
complex
different
biomolecules.
Case
studies
using
literature
from
HCC,
cervical,
gastric
cancers
demonstrate
predictor
identify
missing
associations
known
diseases,
quantitative
real-time
PCR
(RT-qPCR)
experiment
HCC
samples,
it
was
found
five
were
significantly
differentially
expressed,
proved
predict
related
new
circRNAs.
Conclusions
This
efficient
computational
case
analysis
with
sufficient
feedback
allows
us
circRNA-associated
disease-associated
Our
provides
method
provide
guidance
certain
For
ease
use,
online
(
http://server.malab.cn/CircDA
)
provided,
code
open-sourced
https://github.com/nmt315320/CircDA.git
convenience
improvement.
BMC Biology,
Journal Year:
2024,
Volume and Issue:
22(1)
Published: Jan. 2, 2024
Abstract
Intrinsically
disordered
proteins
and
regions
(IDPs/IDRs)
are
functionally
important
that
exist
as
highly
dynamic
conformations
under
natural
physiological
conditions.
IDPs/IDRs
exhibit
a
broad
range
of
molecular
functions,
their
functions
involve
binding
interactions
with
partners
remaining
native
structural
flexibility.
The
rapid
increase
in
the
number
sequence
databases
diversity
challenge
existing
computational
methods
for
predicting
protein
intrinsic
disorder
functions.
A
region
interacts
different
to
perform
multiple
these
dependencies
correlations.
In
this
study,
we
introduce
DisoFLAG,
method
leverages
graph-based
interaction
language
model
(GiPLM)
jointly
its
potential
GiPLM
integrates
semantic
information
based
on
pre-trained
models
into
units
enhance
correlation
representation
DisoFLAG
predictor
takes
amino
acid
sequences
only
inputs
provides
predictions
six
proteins,
including
protein-binding,
DNA-binding,
RNA-binding,
ion-binding,
lipid-binding,
flexible
linker.
We
evaluated
predictive
performance
following
Critical
Assessment
Intrinsic
Disorder
(CAID)
experiments,
results
demonstrated
offers
accurate
comprehensive
extending
current
coverage
computationally
predicted
function
categories.
standalone
package
web
server
have
been
established
provide
prediction
tools
disorders
associated
Computers in Biology and Medicine,
Journal Year:
2024,
Volume and Issue:
173, P. 108339 - 108339
Published: March 18, 2024
The
application
of
Artificial
Intelligence
(AI)
to
screen
drug
molecules
with
potential
therapeutic
effects
has
revolutionized
the
discovery
process,
significantly
lower
economic
cost
and
time
consumption
than
traditional
pipeline.
With
great
power
AI,
it
is
possible
rapidly
search
vast
chemical
space
for
drug-target
interactions
(DTIs)
between
candidate
disease
protein
targets.
However,
only
a
small
proportion
have
labelled
DTIs,
consequently
limiting
performance
AI-based
screening.
To
solve
this
problem,
machine
learning-based
approach
ability
generalize
DTI
prediction
across
desirable.
Many
existing
learning
approaches
identification
failed
exploit
full
information
respect
topological
structures
molecules.
develop
better
prediction,
we
propose
GraphormerDTI,
which
employs
powerful
Graph
Transformer
neural
network
model
molecular
structures.
GraphormerDTI
embeds
graphs
into
vector-format
representations
through
iterative
Transformer-based
message
passing,
encodes
molecules'
structural
characteristics
by
node
centrality
encoding,
spatial
encoding
edge
encoding.
strong
inductive
bias,
proposed
can
effectively
infer
informative
out-of-sample
as
such,
capable
predicting
DTIs
an
exceptional
performance.
integrates
1-dimensional
Convolutional
Neural
Network
(1D-CNN)
extract
drugs'
target
proteins'
leverages
attention
mechanism
them.
examine
GraphormerDTI's
conduct
experiments
on
three
benchmark
datasets,
where
achieves
superior
five
state-of-the-art
baselines
out-of-molecule
including
GNN-CPI,
GNN-PT,
DeepEmbedding-DTI,
MolTrans
HyperAttentionDTI,
par
best
baseline
transductive
prediction.
source
codes
datasets
are
publicly
accessible
at
https://github.com/mengmeng34/GraphormerDTI.
BMC Biology,
Journal Year:
2024,
Volume and Issue:
22(1)
Published: April 19, 2024
Abstract
Background
The
blood–brain
barrier
serves
as
a
critical
interface
between
the
bloodstream
and
brain
tissue,
mainly
composed
of
pericytes,
neurons,
endothelial
cells,
tightly
connected
basal
membranes.
It
plays
pivotal
role
in
safeguarding
from
harmful
substances,
thus
protecting
integrity
nervous
system
preserving
overall
homeostasis.
However,
this
remarkable
selective
transmission
also
poses
formidable
challenge
realm
central
diseases
treatment,
hindering
delivery
large-molecule
drugs
into
brain.
In
response
to
challenge,
many
researchers
have
devoted
themselves
developing
drug
systems
capable
breaching
barrier.
Among
these,
penetrating
peptides
emerged
promising
candidates.
These
had
advantages
high
biosafety,
ease
synthesis,
exceptional
penetration
efficiency,
making
them
an
effective
solution.
While
previous
studies
developed
few
prediction
models
for
peptides,
their
performance
has
often
been
hampered
by
issue
limited
positive
data.
Results
study,
we
present
Augur,
novel
model
using
borderline-SMOTE-based
data
augmentation
machine
learning.
extract
highly
interpretable
physicochemical
properties
while
solving
issues
small
sample
size
imbalance
negative
samples.
Experimental
results
demonstrate
superior
Augur
with
AUC
value
0.932
on
training
set
0.931
independent
test
set.
Conclusions
This
newly
demonstrates
predicting
offering
valuable
insights
development
targeting
neurological
disorders.
breakthrough
may
enhance
efficiency
peptide-based
discovery
pave
way
innovative
treatment
strategies
diseases.
BMC Biology,
Journal Year:
2024,
Volume and Issue:
22(1)
Published: Aug. 15, 2024
Plenty
of
clinical
and
biomedical
research
has
unequivocally
highlighted
the
tremendous
significance
human
microbiome
in
relation
to
health.
Identifying
microbes
associated
with
diseases
is
crucial
for
early
disease
diagnosis
advancing
precision
medicine.
Considering
that
information
about
changes
microbial
quantities
under
fine-grained
states
helps
enhance
a
comprehensive
understanding
overall
data
distribution,
this
study
introduces
MSignVGAE,
framework
predicting
microbe-disease
sign
associations
using
signed
message
propagation.
MSignVGAE
employs
graph
variational
autoencoder
model
noisy
association
extends
multi-scale
concept
representation
capabilities.
A
novel
strategy
propagating
networks
addresses
heterogeneity
consistency
among
nodes
connected
by
edges.
Additionally,
we
utilize
idea
denoising
handle
noise
similarity
feature
information,
which
overcome
biases
fused
data.
represents
as
heterogeneous
node
features.
The
multi-class
classifier
XGBoost
utilized
predict
between
microbes.
achieves
AUROC
AUPR
values
0.9742
0.9601,
respectively.
Case
studies
on
three
demonstrate
can
effectively
capture
distribution
leveraging
information.
ACS Omega,
Journal Year:
2024,
Volume and Issue:
unknown
Published: Feb. 8, 2024
In
biological
organisms,
metal
ion-binding
proteins
participate
in
numerous
metabolic
activities
and
are
closely
associated
with
various
diseases.
To
accurately
predict
whether
a
protein
binds
to
ions
the
type
of
protein,
this
study
proposed
classifier
named
MIBPred.
The
incorporated
advanced
Word2Vec
technology
from
field
natural
language
processing
extract
semantic
features
sequence
combined
them
position-specific
score
matrix
(PSSM)
features.
Furthermore,
an
ensemble
learning
model
was
employed
for
classification
task.
model,
we
independently
trained
XGBoost,
LightGBM,
CatBoost
algorithms
integrated
output
results
through
SVM
voting
mechanism.
This
innovative
combination
has
led
significant
breakthrough
predictive
performance
our
model.
As
result,
achieved
accuracies
95.13%
85.19%,
respectively,
predicting
their
types.
Our
research
not
only
confirms
effectiveness
extracting
information
sequences
but
also
highlights
outstanding
MIBPred
problem
provides
reliable
tool
method
in-depth
exploration
structure
function
proteins.
Bioinformatics,
Journal Year:
2024,
Volume and Issue:
40(5)
Published: May 1, 2024
Abstract
Motivation
Peptides
are
promising
agents
for
the
treatment
of
a
variety
diseases
due
to
their
specificity
and
efficacy.
However,
development
peptide-based
drugs
is
often
hindered
by
potential
toxicity
peptides,
which
poses
significant
barrier
clinical
application.
Traditional
experimental
methods
evaluating
peptide
time-consuming
costly,
making
process
inefficient.
Therefore,
there
an
urgent
need
computational
tools
specifically
designed
predict
accurately
rapidly,
facilitating
identification
safe
candidates
drug
development.
Results
We
provide
here
novel
approach,
CAPTP,
leverages
power
convolutional
self-attention
enhance
prediction
from
amino
acid
sequences.
CAPTP
demonstrates
outstanding
performance,
achieving
Matthews
correlation
coefficient
approximately
0.82
in
both
cross-validation
settings
on
independent
test
datasets.
This
performance
surpasses
that
existing
state-of-the-art
predictors.
Importantly,
maintains
its
robustness
generalizability
even
when
dealing
with
data
imbalances.
Further
analysis
reveals
certain
sequential
patterns,
particularly
head
central
regions
crucial
determining
toxicity.
insight
can
significantly
inform
guide
design
safer
drugs.
Availability
implementation
The
source
code
freely
available
at
https://github.com/jiaoshihu/CAPTP.
BMC Biology,
Journal Year:
2024,
Volume and Issue:
22(1)
Published: May 30, 2024
A
promoter
is
a
specific
sequence
in
DNA
that
has
transcriptional
regulatory
functions,
playing
role
initiating
gene
expression.
Identifying
promoters
and
their
strengths
can
provide
valuable
information
related
to
human
diseases.
In
recent
years,
computational
methods
have
gained
prominence
as
an
effective
means
for
identifying
promoter,
offering
more
efficient
alternative
labor-intensive
biological
approaches.