17
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Automated biomarker candidate discovery in imaging mass spectrometry data through spatially localized Shapley additive explanations

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The search for molecular species that are differentially expressed between biological states is an important step towards discovering promising biomarker candidates. In imaging mass spectrometry (IMS), performing this search manually is often impractical due to the large size and high-dimensionality of IMS datasets. Instead, we propose an interpretable machine learning workflow that automatically identifies biomarker candidates by their mass-to-charge ratios, and that quantitatively estimates their relevance to recognizing a given biological class using Shapley additive explanations (SHAP). The task of biomarker candidate discovery is translated into a feature ranking problem: given a classification model that assigns pixels to different biological classes on the basis of their mass spectra, the molecular species that the model uses as features are ranked in descending order of relative predictive importance such that the top-ranking features have a higher likelihood of being useful biomarkers. Besides providing the user with an experiment-wide measure of a molecular species’ biomarker potential, our workflow delivers spatially localized explanations of the classification model’s decision-making process in the form of a novel representation called SHAP maps. SHAP maps deliver insight into the spatial specificity of biomarker candidates by highlighting in which regions of the tissue sample each feature provides discriminative information and in which regions it does not. SHAP maps also enable one to determine whether the relationship between a biomarker candidate and a biological state of interest is correlative or anticorrelative. Our automated approach to estimating a molecular species’ potential for characterizing a user-provided biological class, combined with the untargeted and multiplexed nature of IMS, allows for the rapid screening of thousands of molecular species and the obtention of a broader biomarker candidate shortlist than would be possible through targeted manual assessment. Our biomarker candidate discovery workflow is demonstrated on mouse-pup and rat kidney case studies.

          Graphical Abstract

          Related collections

          Most cited references79

          • Record: found
          • Abstract: not found
          • Article: not found

          Random Forests

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            XGBoost

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Greedy function approximation: A gradient boosting machine.

                Bookmark

                Author and article information

                Journal
                0370534
                520
                Anal Chim Acta
                Anal Chim Acta
                Analytica chimica acta
                0003-2670
                1873-4324
                18 April 2023
                08 September 2021
                26 April 2021
                24 April 2023
                : 1177
                : 338522
                Affiliations
                [a ]Delft Center for Systems and Control, Delft University of Technology, Delft, Netherlands
                [b ]Mass Spectrometry Research Center, Vanderbilt University, Nashville, TN, USA
                [c ]Department of Biochemistry, Vanderbilt University, Nashville, TN, USA
                [d ]Department of Chemistry, Vanderbilt University, Nashville, TN, USA
                [e ]Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
                [f ]Department of Medicine, Vanderbilt University, Nashville, TN, USA
                Author notes
                [* ]Corresponding author. Delft Center for Systems and Control (Delft University of Technology, 3ME faculty, Mekelweg 2 - Gebouw 34, 2628 CD, Delft, the Netherlands). raf.vandeplas@ 123456tudelft.nl (R. Van de Plas).
                Article
                NIHMS1892535
                10.1016/j.aca.2021.338522
                10124144
                34482894
                fad93796-f8f2-4481-8e50-4a01c9aaad71

                This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/).

                History
                Categories
                Article

                Analytical chemistry
                supervised machine learning,model interpretability,biomarker discovery,imaging mass spectrometry,shapley additive explanations,explainable artificial intelligence

                Comments

                Comment on this article