11
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      DrugEx v3: scaffold-constrained drug design with graph transformer-based reinforcement learning

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Rational drug design often starts from specific scaffolds to which side chains/substituents are added or modified due to the large drug-like chemical space available to search for novel drug-like molecules. With the rapid growth of deep learning in drug discovery, a variety of effective approaches have been developed for de novo drug design. In previous work we proposed a method named DrugEx, which can be applied in polypharmacology based on multi-objective deep reinforcement learning. However, the previous version is trained under fixed objectives and does not allow users to input any prior information ( i.e. a desired scaffold). In order to improve the general applicability, we updated DrugEx to design drug molecules based on scaffolds which consist of multiple fragments provided by users. Here, a  Transformer model was employed to generate molecular structures. The Transformer is a multi-head self-attention deep learning model containing an encoder to receive scaffolds as input and a decoder to generate molecules as output. In order to deal with the graph representation of molecules a novel positional encoding for each atom and bond based on an adjacency matrix was proposed, extending the architecture of the Transformer. The graph Transformer model contains growing and connecting procedures for molecule generation starting from  a given scaffold based on fragments. Moreover, the generator was trained under a reinforcement learning framework to increase the number of desired ligands. As a proof of concept, the method was applied to design ligands for the adenosine A 2A receptor (A 2AAR) and compared with SMILES-based methods. The results show that 100% of the generated molecules are valid and most of them had a high predicted affinity value towards A 2AAR with given scaffolds.

          Supplementary Information

          The online version contains supplementary material available at 10.1186/s13321-023-00694-z.

          Related collections

          Most cited references30

          • Record: found
          • Abstract: found
          • Article: not found

          Deep learning.

          Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules

            We report a method to convert discrete representations of molecules to and from a multidimensional continuous representation. This model allows us to generate new molecules for efficient exploration and optimization through open-ended spaces of chemical compounds. A deep neural network was trained on hundreds of thousands of existing chemical structures to construct three coupled functions: an encoder, a decoder, and a predictor. The encoder converts the discrete representation of a molecule into a real-valued continuous vector, and the decoder converts these continuous vectors back to discrete molecular representations. The predictor estimates chemical properties from the latent continuous vector representation of the molecule. Continuous representations of molecules allow us to automatically generate novel chemical structures by performing simple operations in the latent space, such as decoding random vectors, perturbing known chemical structures, or interpolating between molecules. Continuous representations also allow the use of powerful gradient-based optimization to efficiently guide the search for optimized functional compounds. We demonstrate our method in the domain of drug-like molecules and also in a set of molecules with fewer that nine heavy atoms.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              ChEMBL: a large-scale bioactivity database for drug discovery

              ChEMBL is an Open Data database containing binding, functional and ADMET information for a large number of drug-like bioactive compounds. These data are manually abstracted from the primary published literature on a regular basis, then further curated and standardized to maximize their quality and utility across a wide range of chemical biology and drug-discovery research problems. Currently, the database contains 5.4 million bioactivity measurements for more than 1 million compounds and 5200 protein targets. Access is available through a web-based interface, data downloads and web services at: https://www.ebi.ac.uk/chembldb.
                Bookmark

                Author and article information

                Contributors
                x.liu@lacdr.leidenuniv.nl
                kaiye@xjtu.edu.cn
                hvvlijme@its.jnj.com
                ijzerman@lacdr.leidenuniv.nl
                gerard@lacdr.leidenuniv.nl
                Journal
                J Cheminform
                J Cheminform
                Journal of Cheminformatics
                Springer International Publishing (Cham )
                1758-2946
                20 February 2023
                20 February 2023
                2023
                : 15
                : 24
                Affiliations
                [1 ]GRID grid.5132.5, ISNI 0000 0001 2312 1970, Drug Discovery and Safety, , Leiden Academic Centre for Drug Research, ; Einsteinweg 55, Leiden, The Netherlands
                [2 ]GRID grid.43169.39, ISNI 0000 0001 0599 1243, School of Electrics and Information Engineering, , Xi’an Jiaotong University, ; 28 XianningW Rd, Xi’an, China
                [3 ]GRID grid.419619.2, ISNI 0000 0004 0623 0341, Janssen Pharmaceutica NV, ; Turnhoutseweg 30, B-2340 Beerse, Belgium
                Article
                694
                10.1186/s13321-023-00694-z
                9940339
                36803659
                2727d9f3-5f35-4599-abb0-7b512aee4298
                © The Author(s) 2023

                Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

                History
                : 9 April 2022
                : 6 February 2023
                Funding
                Funded by: Chinese Scholarship Council
                Funded by: FundRef http://dx.doi.org/10.13039/501100003246, Nederlandse Organisatie voor Wetenschappelijk Onderzoek;
                Award ID: STW-Veni #14410
                Award Recipient :
                Categories
                Research
                Custom metadata
                © The Author(s) 2023

                Chemoinformatics
                deep learning,reinforcement learning,policy gradient,drug design,transformer,multi-objective optimization,adenosine a2a receptor

                Comments

                Comment on this article