Data-Based Prediction of Redox Potentials via Introducing Chemical Features into the Transformer Architecture DOI
Zhan Si, Deguang Liu, Wan Nie

и другие.

Journal of Chemical Information and Modeling, Год журнала: 2024, Номер 64(22), С. 8453 - 8463

Опубликована: Ноя. 8, 2024

Rapid and accurate prediction of basic physicochemical parameters molecules will greatly accelerate the target-orientated design novel reactions materials but has been long challenging. Herein, a chemical language model-based deep learning method, TransChem, developed for redox potentials organic molecules. Embedding an effective molecular characterization (combining spatial electronic features), nonlinear messaging approach (Mol-Attention), perturbation shows high accuracy in predicting potential radicals comprising over 100,000 data (R2 > 0.97, MAE <0.09 V) is generalized to smaller 2,1,3-benzothiadiazole set (<3000 points) electron affinity (660 data) with low 0.07 V 0.18 eV, respectively. In this context, self-developed set, i.e., oxidation (OP) full-space disubstituted phenol (OPP-data total set: 74,529), predicted by TransChem high-throughput, active strategy. The rapid reliable OP could hopefully screening plausible reagents highly selective cross-coupling derivatives. This study presents important attempt guide modeling knowledge, while demonstrates state-of-the-art (SOTA) predictive performance on benchmark sets its better understanding conformational relationships.

Язык: Английский

Applying statistical modeling strategies to sparse datasets in synthetic chemistry DOI Creative Commons
Brittany C. Haas, Dipannita Kalyani, Matthew S. Sigman

и другие.

Science Advances, Год журнала: 2025, Номер 11(1)

Опубликована: Янв. 1, 2025

The application of statistical modeling in organic chemistry is emerging as a standard practice for probing structure-activity relationships and predictive tool many optimization objectives. This review aimed tutorial those entering the area chemistry. We provide case studies to highlight considerations approaches that can be used successfully analyze datasets low data regimes, common situation encountered given experimental demands Statistical hinges on (what being modeled), descriptors (how are represented), algorithms modeled). Herein, we focus how various reaction outputs (e.g., yield, rate, selectivity, solubility, stability, turnover number) structures binned, heavily skewed, distributed) influence choice algorithm constructing chemically insightful models.

Язык: Английский

Процитировано

3

Controlling Stereoselectivity with Noncovalent Interactions in Chiral Phosphoric Acid Organocatalysis DOI
Isaiah O. Betinol, Yutao Kuang,

Brian P. Mulley

и другие.

Chemical Reviews, Год журнала: 2025, Номер unknown

Опубликована: Март 18, 2025

Chiral phosphoric acids (CPAs) have emerged as highly effective Brønsted acid catalysts in an expanding range of asymmetric transformations, often through novel multifunctional substrate activation modes. Versatile and broadly appealing, these benefit from modular tunable structures, compatibility with additives. Given the unique types noncovalent interactions (NCIs) that can be established between CPAs various reactants─such hydrogen bonding, aromatic interactions, van der Waals forces─it is unsurprising catalyst systems become a promising approach for accessing diverse chiral product outcomes. This review aims to provide in-depth exploration mechanisms by which impart stereoselectivity, positioning NCIs central feature connects broad spectrum catalytic reactions. Spanning literature 2004 2024, it covers nucleophilic additions, radical atroposelective bond formations, highlighting applicability CPA organocatalysis. Special emphasis placed on structural mechanistic features govern CPA-substrate well tools techniques developed enhance our understanding their behavior. In addition emphasizing details stereocontrolling elements individual reactions, we carefully structured this natural progression specifics broader, class-level perspective. Overall, findings underscore critical role catalysis significant contributions advancing synthesis.

Язык: Английский

Процитировано

3

Materials Genes of CO2 Hydrogenation on Supported Cobalt Catalysts: An Artificial Intelligence Approach Integrating Theoretical and Experimental Data DOI Creative Commons
Ray Miyazaki, Kendra S. Belthle, Harun Tüysüz

и другие.

Journal of the American Chemical Society, Год журнала: 2024, Номер 146(8), С. 5433 - 5444

Опубликована: Фев. 20, 2024

Designing materials for catalysis is challenging because the performance governed by an intricate interplay of various multiscale phenomena, such as chemical reactions on surfaces and materials' restructuring during catalytic process. In case supported catalysts, role support material can be also crucial. Here, we address this intricacy challenge a symbolic-regression artificial intelligence (AI) approach. We identify key physicochemical parameters correlated with measured performance, out many offered candidate characterizing materials, reaction environment, possibly relevant underlying phenomena. Importantly, these are obtained both experiments ab initio simulations. The identified might called "materials genes", in analogy to genes biology: they correlate property or function interest, but explicit physical relationship not (necessarily) known. To demonstrate approach, investigate CO2 hydrogenation catalyzed cobalt nanoparticles silica. Crucially, silica modified additive metals magnesium, calcium, titanium, aluminum, zirconium, which results six significantly different performances. These systems mimic hydrothermal vents, have produced first organic molecules Earth. CH3OH selectivity reflect reducibility species, adsorption strength intermediates, nature metal. By using AI model trained basic elemental properties (e.g., ionization potential) parameters, new additives suggested. predicted catalysts vanadium zinc confirmed experiments.

Язык: Английский

Процитировано

17

Designing Target-specific Data Sets for Regioselectivity Predictions on Complex Substrates DOI Creative Commons
Jules Schleinitz, Alba Carretero‐Cerdán, Anjali Gurajapu

и другие.

Journal of the American Chemical Society, Год журнала: 2025, Номер 147(9), С. 7476 - 7484

Опубликована: Фев. 21, 2025

The development of machine learning models to predict the regioselectivity C(sp3)-H functionalization reactions is reported. A data set for dioxirane oxidations was curated from literature and used generate a model C-H oxidation. To assess whether smaller, intentionally designed sets could provide accuracy on complex targets, series acquisition functions were developed select most informative molecules specific target. Active learning-based that leverage predicted reactivity uncertainty found outperform those based molecular site similarity alone. use elaboration significantly reduced number points needed perform accurate prediction, it machine-designed can give predictions when larger, randomly selected fail. Finally, workflow experimentally validated five substrates shown be applicable predicting arene radical borylation. These studies quantitative alternative intuitive extrapolation "model substrates" frequently estimate molecules.

Язык: Английский

Процитировано

2

Chemoinformatic Catalyst Selection Methods for the Optimization of Copper–Bis(oxazoline)-Mediated, Asymmetric, Vinylogous Mukaiyama Aldol Reactions DOI
Casey L. Olen, Andrew F. Zahrt, Sean W. Reilly

и другие.

ACS Catalysis, Год журнала: 2024, Номер 14(4), С. 2642 - 2655

Опубликована: Фев. 6, 2024

A catalyst selection method for the optimization of an asymmetric, vinylogous Mukaiyama aldol reaction is described. large library commercially available and synthetically accessible copper–bis(oxazoline) catalysts was constructed in silico. Conformer-dependent, grid-based descriptors were calculated each catalyst, defining a chemical feature space suitable machine learning. Selection diverse subset produced initial training set 26 new bis(oxazoline) ligands that synthesized tested stereoselectivity copper-catalyzed, five substrate combinations. One ligand provided 88% average enantiomeric excess, exceeding performance identified through campaign. Supervised unsupervised methods, including quantitative structure–selectivity relationship modeling, nearest neighbors analysis, focused analogue clustering strategy, employed to identify additional 12 ligands. The selected outperformed hit four out product classes some cases demonstrated enantiocontrol 95% ee. effectiveness process discussed, expediency neighbor approaches are contrasted with supervised modeling approach.

Язык: Английский

Процитировано

5

C–H Aminoalkylation of 5-Membered Heterocycles: Influence of Descriptors, Data Set Size, and Data Quality on the Predictiveness of Machine Learning Models and Expansion of the Substrate Space Beyond 1,3-Azoles DOI
Stephanie Felten, Cyndi Qixin He, Marion H. Emmert

и другие.

The Journal of Organic Chemistry, Год журнала: 2025, Номер unknown

Опубликована: Фев. 11, 2025

We report a general C-H aminoalkylation of 5-membered heterocycles through combined machine learning/experimental workflow. Our work describes previously unknown functionalization reactivity and creates predictive learning (ML) model iterative refinement over 6 rounds active learning. The initial established with 1,3-azoles predicts the reactivities N-aryl indazoles, 1,2,4-triazolopyrazines, 1,2,3-thiadiazoles, 1,3,4-oxadiazoles, while other substrate classes (e.g., pyrazoles 1,2,4-triazoles) are not predicted well. final includes additional heterocyclic scaffolds in training data, which results high accuracy across all tested cores. prediction performance is shown both within set via cross-validation (CV R2 = 0.81) when predicting unseen substrates diverse molecular weight structure (Test 0.95). concept feature engineering discussed, we benchmark mechanistically related DFT-based features that more time-intensive laborious comparison descriptors fingerprints. Importantly, this establishes novel for methods underdeveloped. Since such key motifs drug discovery development, expect to be significant use synthetic synthesis-oriented ML communities.

Язык: Английский

Процитировано

0

Probability Guided Chemical Reaction Scopes DOI
Inbal Lorena Eshel,

Shahar Barkai,

Sergio Barranco

и другие.

Опубликована: Янв. 1, 2025

Язык: Английский

Процитировано

0

Computational Tools for the Prediction of Site- and Regioselectivity of Organic Reactions DOI Creative Commons
Lukas M. Sigmund,

Michele Assante,

Magnus J. Johansson

и другие.

Chemical Science, Год журнала: 2025, Номер unknown

Опубликована: Янв. 1, 2025

This article reviews computational tools for the prediction of regio- and site-selectivity organic reactions. It spans from quantum chemical procedures to deep learning models showcases application presented tools.

Язык: Английский

Процитировано

0

Optimising Materials Properties with Minimal Data: Lessons from Vanadium Catalyst Modelling DOI
José Ferraz-Caetano, Filipe Teixeira, M. Natália D. S. Cordeiro

и другие.

Challenges and advances in computational chemistry and physics, Год журнала: 2025, Номер unknown, С. 117 - 138

Опубликована: Янв. 1, 2025

Язык: Английский

Процитировано

0

Machine learning workflows beyond linear models in low-data regimes DOI Creative Commons
David Dalmau, Matthew S. Sigman, Juan V. Alegre‐Requena

и другие.

Chemical Science, Год журнала: 2025, Номер unknown

Опубликована: Янв. 1, 2025

This work presents automated non-linear workflows for studying problems in low-data regimes alongside traditional linear models.

Язык: Английский

Процитировано

0