Comparative evaluation of methods for the prediction of protein-ligand binding sites DOI Creative Commons
Javier S. Utgés, Geoffrey J. Barton

Research Square (Research Square), Год журнала: 2024, Номер unknown

Опубликована: Авг. 8, 2024

Abstract The accurate identification of protein-ligand binding sites is critical importance in understanding and modulating protein function. Accordingly, ligand site prediction has remained a research focus for over three decades with 50 methods developed since the early 1990s. Over this time, paradigm changed from geometry-based to machine learning. In work, we collate 11 predictors, spanning 30 years, focusing on latest learning-based such as VN-EGNN, IF-SitePred, GrASP, PUResNet, DeepPocket compare them established P2Rank or fpocket earlier like PocketFinder, Ligsite Surfnet. We benchmark against human subset new curated reference dataset, LIGYSIS. LIGYSIS comprehensive complex dataset comprising 30,000 proteins bound ligands which aggregates biologically relevant unique interfaces across biological units multiple structures same protein. an improvement testing datasets sc-PDB, PDBbind, MOAD, COACH420 HOLO4K either include 1:1 complexes consider asymmetric units. Re-scoring predictions by PRANK display highest recall (60%) whilst VN-EGNN (46%) IF-SitePred (39%) present lowest recall. demonstrate detrimental effect that redundant performance well beneficial impact stronger pocket scoring schemes, improvements up 14% (IF-SitePred) 30% precision (Surfnet). Methods predicting few pockets per protein, e.g., GrASP PUResNet are very precise (> 90%) but limited Finally, propose universal metric urge authors share not only source code their methods, also benchmark.

Язык: Английский

A comprehensive survey on protein-ligand binding site prediction DOI
Ying Xia, Xiaoyong Pan, Hong‐Bin Shen

и другие.

Current Opinion in Structural Biology, Год журнала: 2024, Номер 86, С. 102793 - 102793

Опубликована: Март 5, 2024

Язык: Английский

Процитировано

10

AI-Driven Parametrization of Michaelis-Menten Maximal Velocity: Advancing in silico New Approach Methodologies (NAMs) DOI Creative Commons
Achilleas Karakoltzidis,

Spyros Karakitsios,

Dimosthenis Sarigiannis

и другие.

NAM journal., Год журнала: 2025, Номер unknown, С. 100012 - 100012

Опубликована: Фев. 1, 2025

Язык: Английский

Процитировано

2

Ligandability and druggability assessment via machine learning DOI Creative Commons
Francesco Di Palma, Carlo Abate, Sergio Decherchi

и другие.

Wiley Interdisciplinary Reviews Computational Molecular Science, Год журнала: 2023, Номер 13(5)

Опубликована: Июнь 4, 2023

Abstract Drug discovery is a daunting and failure‐prone task. A critical process in this research field represented by the biological target pocket identification steps as they heavily determine subsequent efforts selecting putative ligand, most often small molecule. Finding “ligandable” pockets, namely protein cavities that may accept drug‐like binder instrumental to more general drug oriented “druggability” estimation process. While high‐throughput experimental techniques exist identify binding sites other than orthosteric one, these are relatively expensive not so commonly available labs. In regard, computational means of detecting ligandable pockets advisable for their inexpensiveness speed. These methods can become, principle, particularly predictive when supported machine learning methodologies provide modeling framework. As with any data‐driven effort, outcome critically depends on input data, its featurization possible associated biases. Also, task, (supervised/unsupervised) method, usage molecular dynamics data considerably shape inherent assumptions step. Defining proper quantitative thermodynamic and/or kinetic score (or label) key process; here we revise literature propose residence time novel ideal indicator ligandability. Interestingly vast majority does keep into consideration kinetics nor thermodynamics devising predictors. This article categorized under: Data Science > Artificial Intelligence/Machine Learning Structure Mechanism Computational Biochemistry Biophysics Chemoinformatics

Язык: Английский

Процитировано

12

AF2BIND: Predicting ligand-binding sites using the pair representation of AlphaFold2 DOI Creative Commons
Artem Gazizov,

Anna Lian,

Casper A. Goverde

и другие.

bioRxiv (Cold Spring Harbor Laboratory), Год журнала: 2023, Номер unknown

Опубликована: Окт. 18, 2023

Abstract Predicting ligand-binding sites, particularly in the absence of previously resolved homologous structures, presents a significant challenge structural biology. Here, we leverage internal pairwise representation AlphaFold2 (AF2) to train model, AF2BIND, accurately predict small-molecule-binding residues given only target protein. AF2BIND uses 20 “bait” amino acids optimally extract binding signal small-molecule ligand. We find that AF2 pair outperforms other neural-network representations for binding-site prediction. Moreover, unique combinations bait are correlated with chemical properties

Язык: Английский

Процитировано

9

A Point Cloud Graph Neural Network for Protein–Ligand Binding Site Prediction DOI Open Access
Yanpeng Zhao, Song He, Yuting Xing

и другие.

International Journal of Molecular Sciences, Год журнала: 2024, Номер 25(17), С. 9280 - 9280

Опубликована: Авг. 27, 2024

Predicting protein-ligand binding sites is an integral part of structural biology and drug design. A comprehensive understanding these essential for advancing innovation, elucidating mechanisms biological function, exploring the nature disease. However, accurately identifying remains a challenging task. To address this, we propose PGpocket, geometric deep learning-based framework to improve site prediction. Initially, protein surface converted into point cloud, then chemical properties each are calculated. Subsequently, cloud graph constructed based on inter-point distances, neural network (GNN) applied extract analyze information predict potential sites. PGpocket trained scPDB dataset, its performance verified two independent test sets, Coach420 HOLO4K. The results show that achieves 58% success rate dataset 56% HOLO4K dataset. These surpass competing algorithms, demonstrating PGpocket's advancement practicality

Язык: Английский

Процитировано

3

Comparative evaluation of methods for the prediction of protein–ligand binding sites DOI Creative Commons
Javier S. Utgés, Geoffrey J. Barton

Journal of Cheminformatics, Год журнала: 2024, Номер 16(1)

Опубликована: Ноя. 11, 2024

The accurate identification of protein-ligand binding sites is critical importance in understanding and modulating protein function. Accordingly, ligand site prediction has remained a research focus for over three decades with 50 methods developed change paradigm from geometry-based to machine learning. In this work, we collate 13 predictors, spanning 30 years, focusing on the latest learning-based such as VN-EGNN, IF-SitePred, GrASP, PUResNet, DeepPocket compare them established P2Rank, PRANK fpocket earlier like PocketFinder, Ligsite Surfnet. We benchmark against human subset our new curated reference dataset, LIGYSIS. LIGYSIS comprehensive complex dataset comprising 30,000 proteins bound ligands which aggregates biologically relevant unique interfaces across biological units multiple structures same protein. an improvement testing datasets sc-PDB, PDBbind, MOAD, COACH420 HOLO4K either include 1:1 complexes or consider asymmetric units. Re-scoring predictions by display highest recall (60%) whilst IF-SitePred presents lowest (39%). demonstrate detrimental effect that redundant performance well beneficial impact stronger pocket scoring schemes, improvements up 14% (IF-SitePred) 30% precision (Surfnet). Finally, propose top-N+2 universal metric urge authors share not only source code their methods, but also benchmark.Scientific contributionsThis study conducts largest date, comparing original 15 variants using 10 informative metrics. introduced, highlights demonstrates significant through schemes. proposed prediction, recommendation open-source sharing both benchmarks.

Язык: Английский

Процитировано

3

Improving Identification of Drug-Target Binding Sites Based on Structures of Targets Using Residual Graph Transformer Network DOI Creative Commons

Shuang‐Qing Lv,

Xin Zeng,

Guang-Peng Su

и другие.

Biomolecules, Год журнала: 2025, Номер 15(2), С. 221 - 221

Опубликована: Фев. 3, 2025

Improving identification of drug-target binding sites can significantly aid in drug screening and design, thereby accelerating the development process. However, due to challenges such as insufficient fusion multimodal information from targets imbalanced datasets, enhancing performance prediction models remains exceptionally difficult. Leveraging structures targets, we proposed a novel deep learning framework, RGTsite, which employed Residual Graph Transformer Network improve sites. First, residual 1D convolutional neural network (1D-CNN) pre-trained model ProtT5 were extract local global sequence features target, respectively. These then combined with physicochemical properties amino acid residues serve vertex graph. Next, edge incorporated, graph transformer (GTN) was applied more comprehensive features. Finally, fully connected used classify whether site. Experimental results showed that RGTsite outperformed existing state-of-the-art methods key evaluation metrics, F1-score (F1) Matthews Correlation Coefficient (MCC), across multiple benchmark datasets. Additionally, conducted interpretability analysis for through real-world cases, confirmed effectively identify practical applications.

Язык: Английский

Процитировано

0

Recent advances in AI-driven protein-ligand interaction predictions DOI Creative Commons

JaeYong Sim,

Dongwoo Kim, Bomin Kim

и другие.

Current Opinion in Structural Biology, Год журнала: 2025, Номер 92, С. 103020 - 103020

Опубликована: Фев. 24, 2025

Язык: Английский

Процитировано

0

Skittles: GNN‐Assisted Pseudo‐Ligands Generation and Its Application for Binding Sites Classification and Affinity Prediction DOI
Sergei A. Evteev, Alexey V. Ereshchenko,

Denis Adjugim

и другие.

Proteins Structure Function and Bioinformatics, Год журнала: 2025, Номер unknown

Опубликована: Фев. 28, 2025

ABSTRACT Nowadays, multiple solutions are known for identifying ligand–protein binding sites. Another important task is labeling each point of a site with the appropriate atom type, process as pseudo‐ligand generation. The number generation limited, and, to our knowledge, influence machine learning techniques has not been studied previously. Here, we describe Skittles, new graph neural network‐assisted approach, and compare it force‐field‐based methods. We also demonstrate application Skittles‐based data solving several problems in structural biology, including classification affinity prediction.

Язык: Английский

Процитировано

0

bindNode24: Competitive binding residue prediction with 60% smaller model DOI Creative Commons
Kyra Erckert,

Franz Birkeneder,

Burkhard Rost

и другие.

Computational and Structural Biotechnology Journal, Год журнала: 2025, Номер 27, С. 1060 - 1066

Опубликована: Янв. 1, 2025

Many proteins function through ligand binding. Yet, reliable experimental binding data remains limited. Recent advances predict residues from sequences using protein Language Model embeddings. The AlphaFold Protein Structure Database, which has 3D structure predictions AlphaFold2, opens the way for graph neural networks that residues. Here, we introduce bindNode24, a new method Graph Neural Networks to whether residue binds any of three classes: small molecules, metal ions, and nucleic macromolecules. Compared state-of-the-art, this approach reduces number free parameters by almost 60 % at similar performance. Our findings also suggest secondary tertiary features AlphaFold2 are easy integrate into prediction tasks previously solely relied on

Язык: Английский

Процитировано

0