Machine learning methods for protein-protein binding affinity prediction in protein design

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Protein-protein interactions govern a wide range of biological activity. A proper estimation of the protein-protein binding affinity is vital to design proteins with high specificity and binding affinity toward a target protein, which has a variety of applications including antibody design in immunotherapy, enzyme engineering for reaction optimization, and construction of biosensors. However, experimental and theoretical modelling methods are time-consuming, hinder the exploration of the entire protein space, and deter the identification of optimal proteins that meet the requirements of practical applications. In recent years, the rapid development in machine learning methods for protein-protein binding affinity prediction has revealed the potential of a paradigm shift in protein design. Here, we review the prediction methods and associated datasets and discuss the requirements and construction methods of binding affinity prediction models for protein design.

Related collections

Most cited references 98

Record: found
Abstract: found
Article: found

Is Open Access

Highly accurate protein structure prediction with AlphaFold

John Jumper, Richard Evans, Alexander Pritzel … (2021)

Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort 1 – 4 , the structures of around 100,000 unique proteins have been determined 5 , but this represents a small fraction of the billions of known protein sequences 6 , 7 . Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence—the structure prediction component of the ‘protein folding problem’ 8 —has been an important open research problem for more than 50 years 9 . Despite recent progress 10 – 14 , existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14) 15 , demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm. AlphaFold predicts protein structures with an accuracy competitive with experimental structures in the majority of cases using a novel deep learning architecture.

0 comments Cited 10464 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Deep learning.

Yann LeCun, Yoshua Bengio, Geoffrey E Hinton (2015)

Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.

0 comments Cited 9296 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Support-vector networks

Corinna Cortes, Vladimir Vapnik (1995)

0 comments Cited 3059 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Zhongliang Guo: URI : https://loop.frontiersin.org/people/2131572/overview

Rui Yamaguchi: URI : https://loop.frontiersin.org/people/894481/overview

Journal

Journal ID (nlm-ta): Front Bioinform

Journal ID (iso-abbrev): Front Bioinform

Journal ID (publisher-id): Front. Bioinform.

Title: Frontiers in Bioinformatics

Publisher: Frontiers Media S.A.

ISSN (Electronic): 2673-7647

Publication date (Electronic): 16 December 2022

Publication date Collection: 2022

Volume: 2

Electronic Location Identifier: 1065703

Affiliations

[1] ¹ Division of Cancer Systems Biology , Aichi Cancer Center Research Institute , Nagoya, Aichi, Japan

[2] ² Division of Cancer Informatics , Nagoya University Graduate School of Medicine , Nagoya, Aichi, Japan

Author notes

Edited by: Kenji Mizuguchi, Health and Nutrition, Japan

Reviewed by: Sandeep Tiwari, Federal University of Minas Gerais, Brazil

*Correspondence: Rui Yamaguchi, r.yamaguchi@ 123456aichi-cc.jp

This article was submitted to Drug Discovery in Bioinformatics, a section of the journal Frontiers in Bioinformatics

Article

Publisher ID: 1065703

DOI: 10.3389/fbinf.2022.1065703

PMC ID: 9800603

PubMed ID: 36591334

SO-VID: 2dc592f9-cf2b-471a-831a-a42ee9fef78b

License:

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

History

Date received : 10 October 2022

Date accepted : 01 December 2022

Funding

Funded by: Japan Society for the Promotion of Science , doi 10.13039/501100001691;

Award ID: 22K18003 21K19939

Funded by: Uehara Memorial Foundation , doi 10.13039/100008732;

Comments

Comment on this article

scite_

Cited by 3

See all cited by

Most referenced authors 1,606

See all reference authors

Machine learning methods for protein-protein binding affinity prediction in protein design

Read this article at

Abstract

Related collections

Annual Reviews AI, Machine Learning, and Society

Most cited references 98

Highly accurate protein structure prediction with AlphaFold

Deep learning.

Support-vector networks

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Funding

Categories

Comments

Comment on this article

Similar content 163

Cited by 3

Most referenced authors 1,606