AlphaFold2 and its applications in the fields of biology and medicine DOI Creative Commons
Zhenyu Yang, Xiaoxi Zeng, Yi Zhao

et al.

Signal Transduction and Targeted Therapy, Journal Year: 2023, Volume and Issue: 8(1)

Published: March 14, 2023

Abstract AlphaFold2 (AF2) is an artificial intelligence (AI) system developed by DeepMind that can predict three-dimensional (3D) structures of proteins from amino acid sequences with atomic-level accuracy. Protein structure prediction one the most challenging problems in computational biology and chemistry, has puzzled scientists for 50 years. The advent AF2 presents unprecedented progress protein attracted much attention. Subsequent release more than 200 million predicted further aroused great enthusiasm science community, especially fields medicine. thought to have a significant impact on structural research areas need information, such as drug discovery, design, function, et al. Though time not long since was developed, there are already quite few application studies medicine, many them having preliminarily proved potential AF2. To better understand promote its applications, we will this article summarize principle architecture well recipe success, particularly focus reviewing applications Limitations current also be discussed.

Language: Английский

Highly accurate protein structure prediction with AlphaFold DOI Creative Commons
John Jumper, Richard Evans, Alexander Pritzel

et al.

Nature, Journal Year: 2021, Volume and Issue: 596(7873), P. 583 - 589

Published: July 15, 2021

Abstract Proteins are essential to life, and understanding their structure can facilitate a mechanistic of function. Through an enormous experimental effort 1–4 , the structures around 100,000 unique proteins have been determined 5 but this represents small fraction billions known protein sequences 6,7 . Structural coverage is bottlenecked by months years painstaking required determine single structure. Accurate computational approaches needed address gap enable large-scale structural bioinformatics. Predicting three-dimensional that will adopt based solely on its amino acid sequence—the prediction component ‘protein folding problem’ 8 —has important open research problem for more than 50 9 Despite recent progress 10–14 existing methods fall far short atomic accuracy, especially when no homologous available. Here we provide first method regularly predict with accuracy even in cases which similar known. We validated entirely redesigned version our neural network-based model, AlphaFold, challenging 14th Critical Assessment Structure Prediction (CASP14) 15 demonstrating competitive majority greatly outperforming other methods. Underpinning latest AlphaFold novel machine learning approach incorporates physical biological knowledge about structure, leveraging multi-sequence alignments, into design deep algorithm.

Language: Английский

Citations

31046

ColabFold: making protein folding accessible to all DOI Creative Commons
Milot Mirdita, Konstantin Schütze, Yoshitaka Moriwaki

et al.

Nature Methods, Journal Year: 2022, Volume and Issue: 19(6), P. 679 - 682

Published: May 30, 2022

Abstract ColabFold offers accelerated prediction of protein structures and complexes by combining the fast homology search MMseqs2 with AlphaFold2 or RoseTTAFold. ColabFold’s 40−60-fold faster optimized model utilization enables close to 1,000 per day on a server one graphics processing unit. Coupled Google Colaboratory, becomes free accessible platform for folding. is open-source software available at https://github.com/sokrypton/ColabFold its novel environmental databases are https://colabfold.mmseqs.com .

Language: Английский

Citations

6749

Accurate prediction of protein structures and interactions using a three-track neural network DOI
Minkyung Baek, Frank DiMaio, Ivan Anishchenko

et al.

Science, Journal Year: 2021, Volume and Issue: 373(6557), P. 871 - 876

Published: July 15, 2021

Deep learning takes on protein folding In 1972, Anfinsen won a Nobel prize for demonstrating connection between protein’s amino acid sequence and its three-dimensional structure. Since 1994, scientists have competed in the biannual Critical Assessment of Structure Prediction (CASP) protein-folding challenge. methods took center stage at CASP14, with DeepMind’s Alphafold2 achieving remarkable accuracy. Baek et al . explored network architectures based DeepMind framework. They used three-track to process sequence, distance, coordinate information simultaneously achieved accuracies approaching those DeepMind. The method, RoseTTA fold, can solve challenging x-ray crystallography cryo–electron microscopy modeling problems generate accurate models protein-protein complexes. —VV

Language: Английский

Citations

4172

A guide to machine learning for biologists DOI
Joe G. Greener, Shaun M. Kandathil, Lewis Moffat

et al.

Nature Reviews Molecular Cell Biology, Journal Year: 2021, Volume and Issue: 23(1), P. 40 - 55

Published: Sept. 13, 2021

Language: Английский

Citations

1277

Fast and accurate protein structure search with Foldseek DOI Creative Commons
Michel van Kempen, Stephanie Kim, Charlotte Tumescheit

et al.

Nature Biotechnology, Journal Year: 2023, Volume and Issue: 42(2), P. 243 - 246

Published: May 8, 2023

Abstract As structure prediction methods are generating millions of publicly available protein structures, searching these databases is becoming a bottleneck. Foldseek aligns the query against database by describing tertiary amino acid interactions within proteins as sequences over structural alphabet. decreases computation times four to five orders magnitude with 86%, 88% and 133% sensitivities Dali, TM-align CE, respectively.

Language: Английский

Citations

1057

Accurate proteome-wide missense variant effect prediction with AlphaMissense DOI Open Access
Jun Cheng, Guido Novati,

Joshua Pan

et al.

Science, Journal Year: 2023, Volume and Issue: 381(6664)

Published: Sept. 19, 2023

The vast majority of missense variants observed in the human genome are unknown clinical significance. We present AlphaMissense, an adaptation AlphaFold fine-tuned on and primate variant population frequency databases to predict pathogenicity. By combining structural context evolutionary conservation, our model achieves state-of-the-art results across a wide range genetic experimental benchmarks, all without explicitly training such data. average pathogenicity score genes is also predictive for their cell essentiality, capable identifying short essential that existing statistical approaches underpowered detect. As resource community, we provide database predictions possible single amino acid substitutions classify 89% as either likely benign or pathogenic.

Language: Английский

Citations

882

VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses DOI Creative Commons
Jiarong Guo, Benjamin Bolduc, Ahmed A. Zayed

et al.

Microbiome, Journal Year: 2021, Volume and Issue: 9(1)

Published: Feb. 1, 2021

Viruses are a significant player in many biosphere and human ecosystems, but most signals remain "hidden" metagenomic/metatranscriptomic sequence datasets due to the lack of universal gene markers, database representatives, insufficiently advanced identification tools.Here, we introduce VirSorter2, DNA RNA virus tool that leverages genome-informed advances across collection customized automatic classifiers improve accuracy range detection. When benchmarked against genomes from both isolated uncultivated viruses, VirSorter2 uniquely performed consistently with high (F1-score > 0.8) viral diversity, while all other tools under-detected viruses outside group represented reference databases (i.e., those order Caudovirales). Among evaluated, was also able minimize errors associated atypical cellular sequences including eukaryotic plasmids. Finally, as virosphere exploration unravels novel sequences, VirSorter2's modular design makes it inherently expand new types via maintain maximal sensitivity specificity.With multi-classifier design, demonstrates higher overall major groups will advance our knowledge evolution, virus-microbe interaction various ecosystems. Source code is freely available ( https://bitbucket.org/MAVERICLab/virsorter2 ), on bioconda an iVirus app CyVerse https://de.cyverse.org/de ). Video abstract.

Language: Английский

Citations

825

Improved prediction of protein-protein interactions using AlphaFold2 DOI Creative Commons
Patrick Bryant, Gabriele Pozzati, Arne Elofsson

et al.

Nature Communications, Journal Year: 2022, Volume and Issue: 13(1)

Published: March 10, 2022

Abstract Predicting the structure of interacting protein chains is a fundamental step towards understanding function. Unfortunately, no computational method can produce accurate structures complexes. AlphaFold2, has shown unprecedented levels accuracy in modelling single chain structures. Here, we apply AlphaFold2 for prediction heterodimeric We find that protocol together with optimised multiple sequence alignments, generate models acceptable quality (DockQ ≥ 0.23) 63% dimers. From predicted interfaces create simple function to predict DockQ score which distinguishes from incorrect as well non-interacting proteins state-of-art accuracy. that, using scores, identify 51% all pairs at 1% FPR.

Language: Английский

Citations

716

Macromolecular modeling and design in Rosetta: recent methods and frameworks DOI
Julia Koehler Leman, Brian D. Weitzner, Steven M. Lewis

et al.

Nature Methods, Journal Year: 2020, Volume and Issue: 17(7), P. 665 - 680

Published: June 1, 2020

Language: Английский

Citations

710

Dali server: structural unification of protein families DOI
Liisa Holm

Nucleic Acids Research, Journal Year: 2022, Volume and Issue: 50(W1), P. W210 - W215

Published: May 2, 2022

Protein structure is key to understanding biological function. Structure comparison deciphers deep phylogenies, providing insight into functional conservation and shifts during evolution. Until recently, structural coverage of the protein universe was limited by cost labour involved in experimental determination. Recent breakthroughs learning revolutionized bioinformatics accurate models numerous families for which no information existed. The Dali server 3D widely used crystallographers relate new structures pre-existing ones. Here, we report two most recent upgrades web server: (i) foldomes organisms AlphaFold Database (version 1) are searchable Dali, (ii) alignments annotated with families. Using these features, discovered a novel functionally diverse subgroup within WRKY/GCM1 clan. This accomplished linking structurally characterized SWI/SNF NAM as well CG-1 family uncharacterized proteins Gti1/Pac2, previously known member available at http://ekhidna2.biocenter.helsinki.fi/dali. website free open all users there login requirement.

Language: Английский

Citations

684