The
development
of
chiral
catalysts
that
can
provide
high
enantioselectivities
across
a
wide
assortment
substrates
or
reaction
range
is
priority
for
many
catalyst
design
efforts.
While
several
approaches
are
available
to
aid
in
the
identification
general
systems
there
currently
no
simple
procedure
directly
measuring
how
given
could
be.
Herein,
we
present
catalyst-agnostic
workflow
centered
on
unsupervised
machine
learning
enables
rapid
assessment
and
quantification
generality.
uses
curated
literature
data
sets
descriptors
visualize
cluster
chemical
space
coverage.
This
network
then
be
applied
derive
generality
metric
through
designer
equations
interfaced
with
other
regression
techniques
prediction.
As
validating
case
studies,
have
successfully
this
method
identify-through-quantification
most
chemotype
an
organocatalytic
asymmetric
Mannich
predicted
phosphoric
acid
addition
nucleophile
imines.
mechanistic
basis
gleaned
from
calculated
values
by
deconstructing
contributions
enantiomeric
excess
overall
result.
We
conclude
broadly
applicable
may
more
adaptative
changes
reactant
structure
because
enantioinduction
does
not
rely
single
set
noncovalent
interactions.
In
contrast,
some
work
engaging
robust
contacts
do
change
significantly
nature
when
component
altered.
Ultimately,
our
findings
represent
framework
interrogating
predicting
generality,
strategy
should
relevant
catalytic
widely
synthesis.
ACS Central Science,
Journal Year:
2023,
Volume and Issue:
9(12), P. 2196 - 2204
Published: Dec. 8, 2023
Models
can
codify
our
understanding
of
chemical
reactivity
and
serve
a
useful
purpose
in
the
development
new
synthetic
processes
via,
for
example,
evaluating
hypothetical
reaction
conditions
or
silico
substrate
tolerance.
Perhaps
most
determining
factor
is
composition
training
data
whether
it
sufficient
to
train
model
that
make
accurate
predictions
over
full
domain
interest.
Here,
we
discuss
design
datasets
ways
are
conducive
data-driven
modeling,
emphasizing
idea
set
diversity
generalizability
rely
on
choice
molecular
representation.
We
additionally
experimental
constraints
associated
with
generating
common
types
chemistry
how
these
considerations
should
influence
dataset
building.
Journal of the American Chemical Society,
Journal Year:
2023,
Volume and Issue:
145(23), P. 12870 - 12883
Published: June 2, 2023
The
development
of
chiral
catalysts
that
can
provide
high
enantioselectivities
across
a
wide
assortment
substrates
or
reaction
range
is
priority
for
many
catalyst
design
efforts.
While
several
approaches
are
available
to
aid
in
the
identification
general
systems,
there
currently
no
simple
procedure
directly
measuring
how
given
could
be.
Herein,
we
present
catalyst-agnostic
workflow
centered
on
unsupervised
machine
learning
enables
rapid
assessment
and
quantification
generality.
uses
curated
literature
data
sets
descriptors
visualize
cluster
chemical
space
coverage.
This
network
then
be
applied
derive
generality
metric
through
designer
equations
interfaced
with
other
regression
techniques
prediction.
As
validating
case
studies,
have
successfully
this
method
identify-through-quantification
most
chemotype
an
organocatalytic
asymmetric
Mannich
predicted
phosphoric
acid
addition
nucleophiles
imines.
mechanistic
basis
gleaned
from
calculated
values
by
deconstructing
contributions
enantiomeric
excess
overall
result.
Finally,
our
permitted
mechanistically
informative
screening
allow
experimentalists
rationally
select
highest
probability
achieving
good
result
first
round
development.
Overall,
findings
represent
framework
interrogating
generality,
strategy
should
relevant
catalytic
systems
widely
synthesis.
Scientific Data,
Journal Year:
2024,
Volume and Issue:
11(1)
Published: Jan. 19, 2024
Polycyclic
aromatic
systems
are
highly
important
to
numerous
applications,
in
particular
organic
electronics
and
optoelectronics.
High-throughput
screening
generative
models
that
can
help
identify
new
molecules
advance
these
technologies
require
large
amounts
of
high-quality
data,
which
is
expensive
generate.
In
this
report,
we
present
the
largest
freely
available
dataset
geometries
properties
cata-condensed
poly(hetero)cyclic
calculated
date.
Our
contains
~500k
comprising
11
types
antiaromatic
building
blocks
at
GFN1-xTB
level
representative
a
diverse
chemical
space.
We
detail
structure
enumeration
process
methods
used
provide
various
electronic
(including
HOMO-LUMO
gap,
adiabatic
ionization
potential,
electron
affinity).
Additionally,
benchmark
against
~50k
CAM-B3LYP-D3BJ/def2-SVP
develop
fitting
scheme
correct
xTB
values
higher
accuracy.
These
datasets
represent
second
installment
COMputational
database
Aromatic
Systems
(COMPAS)
Project.
The Journal of Physical Chemistry A,
Journal Year:
2025,
Volume and Issue:
unknown
Published: Feb. 27, 2025
The
extended
tight
binding
(xTB)
family
of
methods
opened
many
new
possibilities
in
the
field
computational
chemistry.
Within
just
5
years,
GFN2-xTB
parametrization
for
all
elements
up
to
Z
=
86
enabled
more
than
a
thousand
applications,
which
were
previously
not
feasible
with
other
electronic
structure
methods.
xTB
provide
robust
and
efficient
way
apply
quantum
mechanics-based
approaches
obtaining
molecular
geometries,
computing
free
energy
corrections
or
describing
noncovalent
interactions
found
applicability
targets.
A
crucial
contribution
success
is
availability
within
simulation
packages
frameworks,
supported
by
open
source
development
its
program
library
packages.
We
present
comprehensive
summary
applications
capabilities
different
fields
Moreover,
we
consider
main
software
calculations,
covering
their
current
ecosystem,
novel
features,
usage
scientific
community.
Digital Discovery,
Journal Year:
2024,
Volume and Issue:
3(8), P. 1638 - 1647
Published: Jan. 1, 2024
Exploiting
crystallographic
data
repositories
for
large-scale
quantum
chemical
computations
requires
the
rapid
and
accurate
extraction
of
molecular
structure,
charge
spin
from
information
file.
Here,
we
develop
a
general
approach
to
assign
ground
state
transition
metal
complexes,
in
complement
our
previous
efforts
on
determining
oxidation
states
bond
order
within