2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Prediction of Antimalarial Drug-Decorated Nanoparticle Delivery Systems with Random Forest Models

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Drug-decorated nanoparticles (DDNPs) have important medical applications. The current work combined Perturbation Theory with Machine Learning and Information Fusion (PTMLIF). Thus, PTMLIF models were proposed to predict the probability of nanoparticle–compound/drug complexes having antimalarial activity (against Plasmodium). The aim is to save experimental resources and time by using a virtual screening for DDNPs. The raw data was obtained by the fusion of experimental data for nanoparticles with compound chemical assays from the ChEMBL database. The inputs for the eight Machine Learning classifiers were transformed features of drugs/compounds and nanoparticles as perturbations of molecular descriptors in specific experimental conditions (experiment-centered features). The resulting dataset contains 107 input features and 249,992 examples. The best classification model was provided by Random Forest, with 27 selected features of drugs/compounds and nanoparticles in all experimental conditions considered. The high performance of the model was demonstrated by the mean Area Under the Receiver Operating Characteristics (AUC) in a test subset with a value of 0.9921 ± 0.000244 (10-fold cross-validation). The results demonstrated the power of information fusion of the experimental-centered features of drugs/compounds and nanoparticles for the prediction of nanoparticle–compound antimalarial activity. The scripts and dataset for this project are available in the open GitHub repository.

          Related collections

          Most cited references34

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          ChEMBL web services: streamlining access to drug discovery data and utilities

          ChEMBL is now a well-established resource in the fields of drug discovery and medicinal chemistry research. The ChEMBL database curates and stores standardized bioactivity, molecule, target and drug data extracted from multiple sources, including the primary medicinal chemistry literature. Programmatic access to ChEMBL data has been improved by a recent update to the ChEMBL web services (version 2.0.x, https://www.ebi.ac.uk/chembl/api/data/docs), which exposes significantly more data from the underlying database and introduces new functionality. To complement the data-focused services, a utility service (version 1.0.x, https://www.ebi.ac.uk/chembl/api/utils/docs), which provides RESTful access to commonly used cheminformatics methods, has also been concurrently developed. The ChEMBL web services can be used together or independently to build applications and data processing workflows relevant to drug discovery and chemical biology.
            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            XGBoost: A scalable tree boosting system.

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Letter to the editor: Stability of Random Forest importance measures.

              The goal of this article (letter to the editor) is to emphasize the value of exploring ranking stability when using the importance measures, mean decrease accuracy (MDA) and mean decrease Gini (MDG), provided by Random Forest. We illustrate with a real and a simulated example that ranks based on the MDA are unstable to small perturbations of the dataset and ranks based on the MDG provide more robust results.
                Bookmark

                Author and article information

                Journal
                Biology (Basel)
                Biology (Basel)
                biology
                Biology
                MDPI
                2079-7737
                30 July 2020
                August 2020
                : 9
                : 8
                : 198
                Affiliations
                [1 ]Department of Organic Chemistry II, University of Basque Country (UPV/EHU), Sarriena w/n, 48940 Leioa, Spain; diana_urista_marquez@ 123456hotmail.com (D.V.U.); sonia.arrasate@ 123456ehu.es (S.A.); humberto.gonzalezdiaz@ 123456ehu.es (H.G.-D.)
                [2 ]RNASA-IMEDIR, Computer Science Faculty, CITIC, University of A Coruna, Campus Elviña s/n, 15071 A Coruña, Spain; d.bcarrue@ 123456gmail.com (D.B.C.); i.otero.coto@ 123456gmail.com (I.O.); viviana.quevedo@ 123456udc.es (V.F.Q.-T.); marcos.gestal@ 123456udc.es (M.G.)
                [3 ]Universidad Estatal Amazónica UEA, Km. 2 1/2 vía Puyo a Tena (paso lateral), Puyo 160150, Pastaza, Ecuador
                [4 ]Biomedical Research Institute of A Coruña (INIBIC), Hospital Teresa Herrera, Xubias de Arriba 84, 15006 A Coruña, Spain
                [5 ]IKERBASQUE, Basque Foundation for Science, Alameda Urquijo 36, 48011 Bilbao, Spain
                [6 ]Basque Centre for Biophysics CSIC-UPVEHU, University of Basque Country UPV/EHU, Barrio Sarriena, 48940 Leioa, Spain
                Author notes
                [* ]Correspondence: c.munteanu@ 123456udc.es
                Author information
                https://orcid.org/0000-0002-7489-3775
                https://orcid.org/0000-0002-7924-693X
                https://orcid.org/0000-0002-4371-8632
                https://orcid.org/0000-0002-9392-2797
                https://orcid.org/0000-0002-5628-2268
                Article
                biology-09-00198
                10.3390/biology9080198
                7465777
                32751710
                43aac30b-b0ae-4dae-aae3-27605da776c5
                © 2020 by the authors.

                Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

                History
                : 24 June 2020
                : 27 July 2020
                Categories
                Article

                decorated nanoparticles,drug delivery,antimalarial compounds,big data,perturbation theory,machine learning,chembl database

                Comments

                Comment on this article