A Review of Large Language Models and Autonomous Agents in Chemistry
Chemical Science,
Journal Year:
2024,
Volume and Issue:
unknown
Published: Dec. 9, 2024
Large
language
models
(LLMs)
have
emerged
as
powerful
tools
in
chemistry,
significantly
impacting
molecule
design,
property
prediction,
and
synthesis
optimization.
This
review
highlights
LLM
capabilities
these
domains
their
potential
to
accelerate
scientific
discovery
through
automation.
We
also
LLM-based
autonomous
agents:
LLMs
with
a
broader
set
of
interact
surrounding
environment.
These
agents
perform
diverse
tasks
such
paper
scraping,
interfacing
automated
laboratories,
planning.
As
are
an
emerging
topic,
we
extend
the
scope
our
beyond
chemistry
discuss
across
any
domains.
covers
recent
history,
current
capabilities,
design
agents,
addressing
specific
challenges,
opportunities,
future
directions
chemistry.
Key
challenges
include
data
quality
integration,
model
interpretability,
need
for
standard
benchmarks,
while
point
towards
more
sophisticated
multi-modal
enhanced
collaboration
between
experimental
methods.
Due
quick
pace
this
field,
repository
has
been
built
keep
track
latest
studies:
https://github.com/ur-whitelab/LLMs-in-science.
Language: Английский
T-ALPHA: A Hierarchical Transformer-Based Deep Neural Network for Protein–Ligand Binding Affinity Prediction with Uncertainty-Aware Self-Learning for Protein-Specific Alignment
Journal of Chemical Information and Modeling,
Journal Year:
2025,
Volume and Issue:
unknown
Published: Feb. 18, 2025
There
is
significant
interest
in
targeting
disease-causing
proteins
with
small
molecule
inhibitors
to
restore
healthy
cellular
states.
The
ability
accurately
predict
the
binding
affinity
of
molecules
a
protein
target
silico
enables
rapid
identification
candidate
and
facilitates
optimization
on-target
potency.
In
this
work,
we
present
T-ALPHA,
novel
deep
learning
model
that
enhances
protein–ligand
prediction
by
integrating
multimodal
feature
representations
within
hierarchical
transformer
framework
capture
information
critical
predicting
affinity.
T-ALPHA
outperforms
all
existing
models
reported
literature
on
multiple
benchmarks
designed
evaluate
scoring
functions.
Remarkably,
maintains
state-of-the-art
performance
when
utilizing
predicted
structures
rather
than
crystal
structures,
powerful
capability
real-world
drug
discovery
applications
where
experimentally
determined
are
often
unavailable
or
incomplete.
Additionally,
an
uncertainty-aware
self-learning
method
for
protein-specific
alignment
does
not
require
additional
experimental
data
demonstrate
it
improves
T-ALPHA's
rank
compounds
biologically
targets
such
as
SARS-CoV-2
main
protease
epidermal
growth
factor
receptor.
To
facilitate
implementation
reproducibility
results
presented
paper,
made
our
software
available
at
https://github.com/gregory-kyro/T-ALPHA.
Language: Английский
Effective drug-target affinity prediction via generative active learning
Information Sciences,
Journal Year:
2024,
Volume and Issue:
679, P. 121135 - 121135
Published: July 3, 2024
Language: Английский
Activity cliff-aware reinforcement learning for de novo drug design
Journal of Cheminformatics,
Journal Year:
2025,
Volume and Issue:
17(1)
Published: April 21, 2025
Language: Английский
Molecular property prediction using pretrained-BERT and Bayesian active learning: a data-efficient approach to drug design
Journal of Cheminformatics,
Journal Year:
2025,
Volume and Issue:
17(1)
Published: April 23, 2025
Language: Английский
CardioGenAI: A Machine Learning-Based Framework for Re-Engineering Drugs for Reduced hERG Liability
Research Square (Research Square),
Journal Year:
2024,
Volume and Issue:
unknown
Published: Sept. 10, 2024
Abstract
The
link
between
in
vitro
hERG
ion
channel
inhibition
and
subsequent
vivo
QT
interval
prolongation,
a
critical
risk
factor
for
the
development
of
arrythmias
such
as
Torsade
de
Pointes,
is
so
well
established
that
activity
alone
often
sufficient
to
end
an
otherwise
promising
drug
candidate.
It
therefore
tremendous
interest
develop
advanced
methods
identifying
hERG-active
compounds
early
stages
development,
proposing
redesigned
with
reduced
liability
preserved
primary
pharmacology.
In
this
work,
we
present
CardioGenAI,
machine
learning-based
framework
re-engineering
both
developmental
commercially
available
drugs
while
preserving
their
pharmacological
activity.
incorporates
novel
state-of-the-art
discriminative
models
predicting
activity,
against
voltage-gated
Na
V1.5
Ca
V1.2
channels
due
potential
implications
modulating
arrhythmogenic
induced
by
blockade.
We
applied
complete
pimozide,
FDA-approved
antipsychotic
agent
demonstrates
high
affinity
channel,
generated
100
refined
candidates.
Remarkably,
among
candidates
fluspirilene,
compound
which
same
class
pimozide
(diphenylmethanes)
has
similar
yet
exhibits
over
700-fold
weaker
binding
hERG.
Furthermore,
demonstrated
framework's
ability
optimize
hERG,
profiles
multiple
maintaining
physicochemical
nature
original
drugs.
envision
method
can
effectively
be
exhibiting
liabilities
provide
means
rescuing
programs
have
stalled
hERG-related
safety
concerns.
Additionally,
also
serve
independently
effective
components
virtual
screening
pipelines.
made
all
our
software
open-source
at
https://github.com/gregory-kyro/CardioGenAI
facilitate
integration
CardioGenAI
molecular
hypothesis
generation
into
discovery
workflows.
Language: Английский
CardioGenAI: a machine learning-based framework for re-engineering drugs for reduced hERG liability
Journal of Cheminformatics,
Journal Year:
2025,
Volume and Issue:
17(1)
Published: March 5, 2025
The
link
between
in
vitro
hERG
ion
channel
inhibition
and
subsequent
vivo
QT
interval
prolongation,
a
critical
risk
factor
for
the
development
of
arrythmias
such
as
Torsade
de
Pointes,
is
so
well
established
that
activity
alone
often
sufficient
to
end
an
otherwise
promising
drug
candidate.
It
therefore
tremendous
interest
develop
advanced
methods
identifying
hERG-active
compounds
early
stages
development,
proposing
redesigned
with
reduced
liability
preserved
primary
pharmacology.
In
this
work,
we
present
CardioGenAI,
machine
learning-based
framework
re-engineering
both
developmental
commercially
available
drugs
while
preserving
their
pharmacological
activity.
incorporates
novel
state-of-the-art
discriminative
models
predicting
activity,
against
voltage-gated
NaV1.5
CaV1.2
channels
due
potential
implications
modulating
arrhythmogenic
induced
by
blockade.
We
applied
complete
pimozide,
FDA-approved
antipsychotic
agent
demonstrates
high
affinity
channel,
generated
100
refined
candidates.
Remarkably,
among
candidates
fluspirilene,
compound
which
same
class
pimozide
(diphenylmethanes)
has
similar
yet
exhibits
over
700-fold
weaker
binding
hERG.
Furthermore,
demonstrated
framework's
ability
optimize
hERG,
profiles
multiple
maintaining
physicochemical
nature
original
drugs.
envision
method
can
effectively
be
exhibiting
liabilities
provide
means
rescuing
programs
have
stalled
hERG-related
safety
concerns.
Additionally,
also
serve
independently
effective
components
virtual
screening
pipelines.
made
all
our
software
open-source
at
https://github.com/gregory-kyro/CardioGenAI
facilitate
integration
CardioGenAI
molecular
hypothesis
generation
into
discovery
workflows.Scientific
contributionThis
work
introduces
designed
re-engineer
facing
challenges.
addition,
function
Language: Английский
Generative AI in drug discovery and development: the next revolution of drug discovery and development would be directed by generative AI
Annals of Medicine and Surgery,
Journal Year:
2024,
Volume and Issue:
86(10), P. 6340 - 6343
Published: Aug. 14, 2024
Language: Английский
T-ALPHA: A Hierarchical Transformer-Based Deep Neural Network for Protein-Ligand Binding Affinity Prediction With Uncertainty-Aware Self-Learning for Protein-Specific Alignment
bioRxiv (Cold Spring Harbor Laboratory),
Journal Year:
2024,
Volume and Issue:
unknown
Published: Dec. 20, 2024
Abstract
There
is
significant
interest
in
targeting
disease-causing
proteins
with
small
molecule
inhibitors
to
restore
healthy
cellular
states.
The
ability
accurately
predict
the
binding
affinity
of
molecules
a
protein
target
silico
enables
rapid
identification
candidate
and
facilitates
optimization
on-target
potency.
In
this
work,
we
present
T-ALPHA,
novel
deep
learning
model
that
enhances
protein-ligand
prediction
by
integrating
multimodal
feature
representations
within
hierarchical
transformer
framework
capture
information
critical
predicting
affinity.
T-ALPHA
outperforms
all
existing
models
reported
literature
on
multiple
benchmarks
designed
evaluate
scoring
functions.
Remarkably,
maintains
state-of-the-art
performance
when
utilizing
predicted
structures
rather
than
crystal
structures,
powerful
capability
real-world
drug
discovery
applications
where
experimentally
determined
are
often
unavailable
or
incomplete.
Additionally,
an
uncertainty-aware
self-learning
method
for
protein-specific
alignment
does
not
require
additional
experimental
data,
demonstrate
it
improves
T-ALPHA’s
rank
compounds
biologically
targets
such
as
SARS-CoV-2
main
protease
epidermal
growth
factor
receptor.
To
facilitate
implementation
reproducibility
results
presented
paper,
have
made
our
software
available
at
https://github.com/gregory-kyro/T-ALPHA
.
Language: Английский