8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Machine learning methods for protein-protein binding affinity prediction in protein design

      brief-report

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Protein-protein interactions govern a wide range of biological activity. A proper estimation of the protein-protein binding affinity is vital to design proteins with high specificity and binding affinity toward a target protein, which has a variety of applications including antibody design in immunotherapy, enzyme engineering for reaction optimization, and construction of biosensors. However, experimental and theoretical modelling methods are time-consuming, hinder the exploration of the entire protein space, and deter the identification of optimal proteins that meet the requirements of practical applications. In recent years, the rapid development in machine learning methods for protein-protein binding affinity prediction has revealed the potential of a paradigm shift in protein design. Here, we review the prediction methods and associated datasets and discuss the requirements and construction methods of binding affinity prediction models for protein design.

          Related collections

          Most cited references98

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Highly accurate protein structure prediction with AlphaFold

          Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort 1 – 4 , the structures of around 100,000 unique proteins have been determined 5 , but this represents a small fraction of the billions of known protein sequences 6 , 7 . Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence—the structure prediction component of the ‘protein folding problem’ 8 —has been an important open research problem for more than 50 years 9 . Despite recent progress 10 – 14 , existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14) 15 , demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm. AlphaFold predicts protein structures with an accuracy competitive with experimental structures in the majority of cases using a novel deep learning architecture.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Deep learning.

            Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Support-vector networks

                Bookmark

                Author and article information

                Contributors
                Journal
                Front Bioinform
                Front Bioinform
                Front. Bioinform.
                Frontiers in Bioinformatics
                Frontiers Media S.A.
                2673-7647
                16 December 2022
                2022
                : 2
                : 1065703
                Affiliations
                [1] 1 Division of Cancer Systems Biology , Aichi Cancer Center Research Institute , Nagoya, Aichi, Japan
                [2] 2 Division of Cancer Informatics , Nagoya University Graduate School of Medicine , Nagoya, Aichi, Japan
                Author notes

                Edited by: Kenji Mizuguchi, Health and Nutrition, Japan

                Reviewed by: Sandeep Tiwari, Federal University of Minas Gerais, Brazil

                *Correspondence: Rui Yamaguchi, r.yamaguchi@ 123456aichi-cc.jp

                This article was submitted to Drug Discovery in Bioinformatics, a section of the journal Frontiers in Bioinformatics

                Article
                1065703
                10.3389/fbinf.2022.1065703
                9800603
                36591334
                2dc592f9-cf2b-471a-831a-a42ee9fef78b
                Copyright © 2022 Guo and Yamaguchi.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 10 October 2022
                : 01 December 2022
                Funding
                Funded by: Japan Society for the Promotion of Science , doi 10.13039/501100001691;
                Award ID: 22K18003 21K19939
                Funded by: Uehara Memorial Foundation , doi 10.13039/100008732;
                Categories
                Bioinformatics
                Perspective

                machine learning,deep neural network,protein-protein interaction,binding affinity,protein design

                Comments

                Comment on this article