AlphaFold and Protein Folding: Not Dead Yet! The Frontier Is Conformational Ensembles DOI
Gregory R. Bowman

Annual Review of Biomedical Data Science, Год журнала: 2024, Номер 7(1), С. 51 - 57

Опубликована: Апрель 11, 2024

Like the black knight in classic Monty Python movie, grand scientific challenges such as protein folding are hard to finish off. Notably, AlphaFold is revolutionizing structural biology by bringing highly accurate structure prediction masses and opening up innumerable new avenues of research. Despite this enormous success, calling prediction, much less related problems, “solved” dangerous, doing so could stymie further progress. Imagine what world would be like if we had declared flight solved after first commercial airlines opened stopped investing research development. Likewise, there still important limitations that benefit from addressing. Moreover, limited our understanding diversity different structures a single can adopt (called conformational ensemble) dynamics which explores space. What clear ensembles critical function, aspect will advance ability design proteins drugs.

Язык: Английский

Modeling conformational states of proteins with AlphaFold DOI Creative Commons
Davide Sala, Felipe Engelberger, Hassane S. Mchaourab

и другие.

Current Opinion in Structural Biology, Год журнала: 2023, Номер 81, С. 102645 - 102645

Опубликована: Июнь 29, 2023

Язык: Английский

Процитировано

110

Direct prediction of intrinsically disordered protein conformational properties from sequence DOI Creative Commons
Jeffrey M. Lotthammer, Garrett M. Ginell, Daniel Griffith

и другие.

Nature Methods, Год журнала: 2024, Номер 21(3), С. 465 - 476

Опубликована: Янв. 31, 2024

Abstract Intrinsically disordered regions (IDRs) are ubiquitous across all domains of life and play a range functional roles. While folded generally well described by stable three-dimensional structure, IDRs exist in collection interconverting states known as an ensemble. This structural heterogeneity means that largely absent from the Protein Data Bank, contributing to lack computational approaches predict ensemble conformational properties sequence. Here we combine rational sequence design, large-scale molecular simulations deep learning develop ALBATROSS, deep-learning model for predicting dimensions IDRs, including radius gyration, end-to-end distance, polymer-scaling exponent asphericity, directly sequences at proteome-wide scale. ALBATROSS is lightweight, easy use accessible both locally installable software package point-and-click-style interface via Google Colab notebooks. We first demonstrate applicability our predictors examining generalizability sequence–ensemble relationships IDRs. Then, leverage high-throughput nature characterize sequence-specific biophysical behavior within between proteomes.

Язык: Английский

Процитировано

105

High-throughput prediction of protein conformational distributions with subsampled AlphaFold2 DOI Creative Commons
Gabriel Monteiro da Silva, Jennifer Y. Cui, David C. Dalgarno

и другие.

Nature Communications, Год журнала: 2024, Номер 15(1)

Опубликована: Март 27, 2024

Abstract This paper presents an innovative approach for predicting the relative populations of protein conformations using AlphaFold 2, AI-powered method that has revolutionized biology by enabling accurate prediction structures. While 2 shown exceptional accuracy and speed, it is designed to predict proteins’ ground state limited in its ability conformational landscapes. Here, we demonstrate how can directly different subsampling multiple sequence alignments. We tested our against nuclear magnetic resonance experiments on two proteins with drastically amounts available data, Abl1 kinase granulocyte-macrophage colony-stimulating factor, predicted changes their more than 80% accuracy. Our worked best when used qualitatively effects mutations or evolution landscape well-populated states proteins. It thus offers a fast cost-effective way at even single-point mutation resolution, making useful tool pharmacology, analysis experimental results, evolution.

Язык: Английский

Процитировано

78

Enhanced Sampling with Machine Learning DOI
Mehdi Shams, Zachary A. Smith, Lukas Herron

и другие.

Annual Review of Physical Chemistry, Год журнала: 2024, Номер 75(1), С. 347 - 370

Опубликована: Фев. 22, 2024

Molecular dynamics (MD) enables the study of physical systems with excellent spatiotemporal resolution but suffers from severe timescale limitations. To address this, enhanced sampling methods have been developed to improve exploration configurational space. However, implementing these is challenging and requires domain expertise. In recent years, integration machine learning (ML) techniques into different domains has shown promise, prompting their adoption in as well. Although ML often employed various fields primarily due its data-driven nature, more natural many common underlying synergies. This review explores merging MD by presenting shared viewpoints. It offers a comprehensive overview this rapidly evolving field, which can be difficult stay updated on. We highlight successful strategies such dimensionality reduction, reinforcement learning, flow-based methods. Finally, we discuss open problems at exciting ML-enhanced interface.

Язык: Английский

Процитировано

48

Navigating the landscape of enzyme design: from molecular simulations to machine learning DOI Creative Commons
Jiahui Zhou, Meilan Huang

Chemical Society Reviews, Год журнала: 2024, Номер 53(16), С. 8202 - 8239

Опубликована: Янв. 1, 2024

Global environmental issues and sustainable development call for new technologies fine chemical synthesis waste valorization. Biocatalysis has attracted great attention as the alternative to traditional organic synthesis. However, it is challenging navigate vast sequence space identify those proteins with admirable biocatalytic functions. The recent of deep-learning based structure prediction methods such AlphaFold2 reinforced by different computational simulations or multiscale calculations largely expanded 3D databases enabled structure-based design. While approaches shed light on site-specific enzyme engineering, they are not suitable large-scale screening potential biocatalysts. Effective utilization big data using machine learning techniques opens up a era accelerated predictions. Here, we review applications machine-learning guided We also provide our view challenges perspectives effectively employing design integrating molecular learning, importance database construction algorithm in attaining predictive ML models explore fitness landscape

Язык: Английский

Процитировано

26

Approximating Projections of Conformational Boltzmann Distributions with AlphaFold2 Predictions: Opportunities and Limitations DOI Creative Commons
Benjamin P. Brown, Richard A. Stein, Jens Meiler

и другие.

Journal of Chemical Theory and Computation, Год журнала: 2024, Номер 20(3), С. 1434 - 1447

Опубликована: Янв. 12, 2024

Protein thermodynamics is intimately tied to biological function and can enable processes such as signal transduction, enzyme catalysis, molecular recognition. The relative free energies of conformations that contribute these functional equilibria evolved for the physiology organism. Despite importance understanding developing treatments disease, computational experimental methods capable quantifying energetic determinants are limited systems modest size. Recently, it has been demonstrated artificial intelligence system AlphaFold2 be manipulated produce structurally valid protein conformational ensembles. Here, we extend studies explore extent which contact distance distributions approximate projections Boltzmann distributions. For this purpose, examine joint probability inter-residue distances along functionally relevant collective variables several systems. Our suggest normalized correlate with conformation probabilities obtained other but they suffer from peak broadening. We also find sensitive point mutations. Overall, anticipate our findings will valuable community seeks model changes in large biomolecular

Язык: Английский

Процитировано

20

AlphaFold prediction of structural ensembles of disordered proteins DOI Creative Commons
Z. Faidon Brotzakis, Shengyu Zhang, Mhd Hussein Murtada

и другие.

Nature Communications, Год журнала: 2025, Номер 16(1)

Опубликована: Фев. 14, 2025

Abstract Deep learning methods of predicting protein structures have reached an accuracy comparable to that high-resolution experimental methods. It is thus possible generate accurate models the native states hundreds millions proteins. An open question, however, concerns whether these advances can be translated disordered proteins, which should represented as structural ensembles because their heterogeneous and dynamical nature. To address this problem, we introduce AlphaFold-Metainference method use AlphaFold-derived distances restraints in molecular dynamics simulations construct ordered The results obtained using illustrate possibility making predictions conformational properties proteins deep trained on large databases available for folded

Язык: Английский

Процитировано

6

AlphaFold2 and Deep Learning for Elucidating Enzyme Conformational Flexibility and Its Application for Design DOI Creative Commons
Guillem Casadevall, Cristina Duran, Sílvia Osuna

и другие.

JACS Au, Год журнала: 2023, Номер 3(6), С. 1554 - 1562

Опубликована: Июнь 6, 2023

The recent success of AlphaFold2 (AF2) and other deep learning (DL) tools in accurately predicting the folded three-dimensional (3D) structure proteins enzymes has revolutionized structural biology protein design fields. 3D indeed reveals key information on arrangement catalytic machinery which elements gate active site pocket. However, comprehending enzymatic activity requires a detailed knowledge chemical steps involved along cycle exploration multiple thermally accessible conformations that adopt when solution. In this Perspective, some studies showing potential AF2 elucidating conformational landscape are provided. Selected examples developments AF2-based DL methods for discussed, as well few enzyme cases. These show allowing routine computational efficient enzymes.

Язык: Английский

Процитировано

34

Exploring Kinase Asp-Phe-Gly (DFG) Loop Conformational Stability with AlphaFold2-RAVE DOI
Bodhi P. Vani, Akashnathan Aranganathan, Pratyush Tiwary

и другие.

Journal of Chemical Information and Modeling, Год журнала: 2023, Номер 64(7), С. 2789 - 2797

Опубликована: Ноя. 20, 2023

Kinases compose one of the largest fractions human proteome, and their misfunction is implicated in many diseases, particular, cancers. The ubiquitousness structural similarities kinases make specific effective drug design difficult. In conformational variability due to evolutionarily conserved Asp-Phe-Gly (DFG) motif adopting out conformations relative stabilities thereof are key structure-based for ATP competitive drugs. These extremely sensitive small changes sequence provide an important problem sampling method development. Since invention AlphaFold2, world has noticeably changed. spite it being limited crystal-like structure prediction, several methods have also leveraged its underlying architecture improve dynamics enhanced ensembles, including AlphaFold2-RAVE. Here, we extend AlphaFold2-RAVE apply a set kinases: wild type DDR1 three mutants with single point mutations that known behave drastically differently. We show able efficiently recover stability using transferable learned order parameters potentials, thereby supplementing AlphaFold2 as tool exploration Boltzmann-weighted protein (Meller, A.; Bhakat, S.; Solieva, Bowman, G. R. Accelerating Cryptic Pocket Discovery Using AlphaFold. J. Chem. Theory Comput. 2023, 19, 4355–4363).

Язык: Английский

Процитировано

24

Information Bottleneck Approach for Markov Model Construction DOI
Dedi Wang, Yunrui Qiu, Eric R. Beyerle

и другие.

Journal of Chemical Theory and Computation, Год журнала: 2024, Номер 20(12), С. 5352 - 5367

Опубликована: Июнь 11, 2024

Markov state models (MSMs) have proven valuable in studying dynamics of protein conformational changes via statistical analysis molecular (MD) simulations. In MSMs, the complex configuration space is coarse-grained into states, with modeled by a series Markovian transitions among these states at discrete lag times. Constructing model specific time necessitates defining that circumvent significant internal energy barriers, enabling relaxation within time. This process effectively coarse-grains and space, integrating out rapid motions metastable states. Thus, MSMs possess multi-resolution nature, where granularity can be adjusted according to time-resolution, offering flexibility capturing system dynamics. work introduces continuous embedding approach for conformations using predictive information bottleneck (SPIB), framework unifies dimensionality reduction partitioning continuous, machine learned basis set. Without explicit optimization VAMP-based scores, SPIB demonstrates state-of-the-art performance identifying slow dynamical processes constructing models. Through applications well-validated mini-proteins, showcases unique advantages compared competing methods. It autonomously self-consistently adjusts number based on specified minimal resolution, eliminating need manual tuning. While maintaining efficacy properties, excels accurately distinguishing numerous well-populated macrostates. contrasts existing methods, which often emphasize expense incorporating sparsely populated Furthermore, SPIB's ability learn low-dimensional underlying enhances interpretation dynamic pathways. With benefits, we propose as an easy-to-implement methodology end-to-end construction.

Язык: Английский

Процитировано

12