Augmented Memory: Capitalizing on Experience Replay to Accelerate De Novo Molecular Design DOI Creative Commons
Jeff Guo, Philippe Schwaller

Опубликована: Май 22, 2023

Sample efficiency is a fundamental challenge in de novo molecular design. Ideally, generative models should learn to satisfy desired objectives under minimal oracle evaluations (computational prediction or wet-lab experiment). This problem becomes more apparent when using oracles that can provide increased predictive accuracy but impose significant cost. Molecular have shown remarkable sample coupled with reinforcement learn- ing, as demonstrated the Practical Optimization (PMO) benchmark. Here, we propose novel algorithm called Augmented Memory combines data augmentation experience replay. We show scores obtained from calls be reused update model multiple times. compare previously proposed algorithms and significantly enhanced an exploitation task drug discovery case study requiring both exploration exploitation. Our method achieves new state-of-the-art PMO benchmark which enforces computational budget, outperforms previous best performing on 19/23 tasks.

Язык: Английский

Self-Driving Laboratories for Chemistry and Materials Science DOI Creative Commons
Gary Tom, Stefan P. Schmid, Sterling G. Baird

и другие.

Chemical Reviews, Год журнала: 2024, Номер 124(16), С. 9633 - 9732

Опубликована: Авг. 13, 2024

Self-driving laboratories (SDLs) promise an accelerated application of the scientific method. Through automation experimental workflows, along with autonomous planning, SDLs hold potential to greatly accelerate research in chemistry and materials discovery. This review provides in-depth analysis state-of-the-art SDL technology, its applications across various disciplines, implications for industry. additionally overview enabling technologies SDLs, including their hardware, software, integration laboratory infrastructure. Most importantly, this explores diverse range domains where have made significant contributions, from drug discovery science genomics chemistry. We provide a comprehensive existing real-world examples different levels automation, challenges limitations associated each domain.

Язык: Английский

Процитировано

56

Fake it until you make it? Generative de novo design and virtual screening of synthesizable molecules DOI Open Access

Megan Stanley,

Marwin Segler

Current Opinion in Structural Biology, Год журнала: 2023, Номер 82, С. 102658 - 102658

Опубликована: Июль 18, 2023

Язык: Английский

Процитировано

25

Directional multiobjective optimization of metal complexes at the billion-system scale DOI
Hannes Kneiding, Ainara Nova, David Balcells

и другие.

Nature Computational Science, Год журнала: 2024, Номер 4(4), С. 263 - 273

Опубликована: Март 29, 2024

Язык: Английский

Процитировано

15

A Deep Generative Model for the Inverse Design of Transition Metal Ligands and Complexes DOI Creative Commons

Magnus Strandgaard,

Trond Linjordet, Hannes Kneiding

и другие.

JACS Au, Год журнала: 2025, Номер 5(5), С. 2294 - 2308

Опубликована: Апрель 23, 2025

Deep generative models yielding transition metal complexes (TMCs) remain scarce despite the key role of these compounds in industrial catalytic processes, anticancer therapies, and energy transition. Compared to drug discovery within chemical space organic molecules, TMCs pose further challenges, including encoding bonds higher complexity need optimize multiple properties. In this work, we developed a model for inverse design ligands complexes, based on junction tree variational autoencoder (JT-VAE). After implementing SMILES-based metal-ligand bonds, was trained with tmQMg-L ligand library, allowing generation thousands novel, highly diverse monodentate (κ1) bidentate (κ2) ligands, imines, phosphines, carbenes. Further, generated were labeled two target properties reflecting stability electron density associated homoleptic iridium TMCs: HOMO-LUMO gap (ϵ) charge center (q Ir). This data used implement conditional that from prompt, single- or dual-objective optimizing either both ϵ q Ir interpretation optimization trajectories. The optimizations also had an impact other properties, dissociation energies oxidative addition barriers. A similar implemented condition by solubility steric bulk.

Язык: Английский

Процитировано

1

Machine Learning Strategies for Reaction Development: Toward the Low-Data Limit DOI
Eunjae Shim, Ambuj Tewari, Tim Cernak

и другие.

Journal of Chemical Information and Modeling, Год журнала: 2023, Номер 63(12), С. 3659 - 3668

Опубликована: Июнь 14, 2023

Machine learning models are increasingly being utilized to predict outcomes of organic chemical reactions. A large amount reaction data is used train these models, which in stark contrast how expert chemists discover and develop new reactions by leveraging information from a small number relevant transformations. Transfer active two strategies that can operate low-data situations, may help fill this gap promote the use machine for tackling real-world challenges synthesis. This Perspective introduces transfer connects potential opportunities directions further research, especially area prospective development

Язык: Английский

Процитировано

18

A genetic optimization strategy with generality in asymmetric organocatalysis as a primary target DOI Creative Commons
Simone Gallarati, Puck van Gerwen, Rubén Laplaza

и другие.

Chemical Science, Год журнала: 2024, Номер 15(10), С. 3640 - 3660

Опубликована: Янв. 1, 2024

A genetic optimization strategy to discover asymmetric organocatalysts with high activity and enantioselectivity across a broad substrate scope.

Язык: Английский

Процитировано

8

Augmenting Genetic Algorithms with Machine Learning for Inverse Molecular Design DOI Creative Commons
Hannes Kneiding, David Balcells

Chemical Science, Год журнала: 2024, Номер unknown

Опубликована: Янв. 1, 2024

Evolutionary and machine learning methods have been successfully applied to the generation of molecules materials exhibiting desired properties. The combination these two paradigms in inverse design tasks can yield powerful that explore massive chemical spaces more efficiently, improving quality generated compounds. However, such synergistic approaches are still an incipient area research appear underexplored literature. This perspective covers different ways incorporating into evolutionary frameworks, with overall goal increasing optimization efficiency genetic algorithms. In particular, surrogate models for faster fitness function evaluation, discriminator control population diversity on-the-fly, based crossover operations, evolution latent space discussed. further potential generative is also assessed, outlining promising directions future developments.

Язык: Английский

Процитировано

5

Augmented Memory: Sample-Efficient Generative Molecular Design with Reinforcement Learning DOI Creative Commons
Jeff Guo, Philippe Schwaller

JACS Au, Год журнала: 2024, Номер 4(6), С. 2160 - 2172

Опубликована: Апрель 10, 2024

Sample efficiency is a fundamental challenge in

Язык: Английский

Процитировано

3

Discovery of molybdenum based nitrogen fixation catalysts with genetic algorithms DOI Creative Commons

Magnus Strandgaard,

Julius Seumer, Jan H. Jensen

и другие.

Chemical Science, Год журнала: 2024, Номер 15(27), С. 10638 - 10650

Опубликована: Янв. 1, 2024

Using genetic algorithms and semiempirical quantum mechanical methods for discovery of nitrogen fixation catalysts.

Язык: Английский

Процитировано

3

Beyond predefined ligand libraries: a genetic algorithm approach for de novo discovery of catalysts for the Suzuki coupling reactions DOI Creative Commons
Julius Seumer, Jan H. Jensen

PeerJ Physical Chemistry, Год журнала: 2025, Номер 7, С. e34 - e34

Опубликована: Янв. 6, 2025

This study introduces a novel approach for the de novo design of transition metal catalysts, leveraging power genetic algorithms and density functional theory calculations. By focusing on Suzuki reaction, known its significance in forming carbon-carbon bonds, we demonstrate effectiveness fragment-based graph-based identifying ligands palladium-based catalytic systems. Our research highlights capability these to generate with desired thermodynamic properties, moving beyond restriction enumerated chemical libraries. Limitations applicability machine learning models are overcome by calculating properties from first principle. The inclusion synthetic accessibility scores further refines search, steering it towards more practically feasible ligands. Through examination both palladium alternative catalysts like copper silver, our findings reveal algorithms’ ability uncover unique catalyst structures within target energy range, offering insights into electronic steric effects necessary effective catalysis. work not only proves potential cost-effective scalable discovery new but also sets stage future exploration predefined spaces, enhancing toolkit available design.

Язык: Английский

Процитировано

0