34
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      QSAR with experimental and predictive distributions: an information theoretic approach for assessing model quality

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We propose that quantitative structure–activity relationship (QSAR) predictions should be explicitly represented as predictive (probability) distributions. If both predictions and experimental measurements are treated as probability distributions, the quality of a set of predictive distributions output by a model can be assessed with Kullback–Leibler (KL) divergence: a widely used information theoretic measure of the distance between two probability distributions. We have assessed a range of different machine learning algorithms and error estimation methods for producing predictive distributions with an analysis against three of AstraZeneca’s global DMPK datasets. Using the KL-divergence framework, we have identified a few combinations of algorithms that produce accurate and valid compound-specific predictive distributions. These methods use reliability indices to assign predictive distributions to the predictions output by QSAR models so that reliable predictions have tight distributions and vice versa. Finally we show how valid predictive distributions can be used to estimate the probability that a test compound has properties that hit single- or multi- objective target profiles.

          Electronic supplementary material

          The online version of this article (doi:10.1007/s10822-013-9639-5) contains supplementary material, which is available to authorized users.

          Related collections

          Author and article information

          Contributors
          davejwood@gmail.com
          Journal
          J Comput Aided Mol Des
          J. Comput. Aided Mol. Des
          Journal of Computer-Aided Molecular Design
          Springer Netherlands (Dordrecht )
          0920-654X
          1573-4951
          16 March 2013
          16 March 2013
          March 2013
          : 27
          : 3
          : 203-219
          Affiliations
          [ ]Novartis, Horsham, UK
          [ ]AstraZeneca, Mölndal, Sweden
          [ ]Lundbeck, Copenhagen, Denmark
          Article
          9639
          10.1007/s10822-013-9639-5
          3639359
          23504478
          e99cea39-eea8-47d4-ba44-6ba257402017
          © The Author(s) 2013

          Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

          History
          : 23 December 2012
          : 5 March 2013
          Categories
          Article
          Custom metadata
          © Springer Science+Business Media Dordrecht 2013

          Biomedical engineering
          quantitative structure–activity relationships,qsar,kullback–leibler divergence,predictive distributions,applicability domain,prediction errors,prediction confidence

          Comments

          Comment on this article