21
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Evaluating performance of existing computational models in predicting CD8+ T cell pathogenic epitopes and cancer neoantigens

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          T cell recognition of a cognate peptide–major histocompatibility complex (pMHC) presented on the surface of infected or malignant cells is of the utmost importance for mediating robust and long-term immune responses. Accurate predictions of cognate pMHC targets for T cell receptors would greatly facilitate identification of vaccine targets for both pathogenic diseases and personalized cancer immunotherapies. Predicting immunogenic peptides therefore has been at the center of intensive research for the past decades but has proven challenging. Although numerous models have been proposed, performance of these models has not been systematically evaluated and their success rate in predicting epitopes in the context of human pathology has not been measured and compared. In this study, we evaluated the performance of several publicly available models, in identifying immunogenic CD8+ T cell targets in the context of pathogens and cancers. We found that for predicting immunogenic peptides from an emerging virus such as severe acute respiratory syndrome coronavirus 2, none of the models perform substantially better than random or offer considerable improvement beyond HLA ligand prediction. We also observed suboptimal performance for predicting cancer neoantigens. Through investigation of potential factors associated with ill performance of models, we highlight several data- and model-associated issues. In particular, we observed that cross-HLA variation in the distribution of immunogenic and non-immunogenic peptides in the training data of the models seems to substantially confound the predictions. We additionally compared key parameters associated with immunogenicity between pathogenic peptides and cancer neoantigens and observed evidence for differences in the thresholds of binding affinity and stability, which suggested the need to modulate different features in identifying immunogenic pathogen versus cancer peptides. Overall, we demonstrate that accurate and reliable predictions of immunogenic CD8+ T cell targets remain unsolved; thus, we hope our work will guide users and model developers regarding potential pitfalls and unsettled questions in existing immunogenicity predictors.

          Related collections

          Most cited references41

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets

          Binary classifiers are routinely evaluated with performance measures such as sensitivity and specificity, and performance is frequently illustrated with Receiver Operating Characteristics (ROC) plots. Alternative measures such as positive predictive value (PPV) and the associated Precision/Recall (PRC) plots are used less frequently. Many bioinformatics studies develop and evaluate classifiers that are to be applied to strongly imbalanced datasets in which the number of negatives outweighs the number of positives significantly. While ROC plots are visually appealing and provide an overview of a classifier's performance across a wide range of specificities, one can ask whether ROC plots could be misleading when applied in imbalanced classification scenarios. We show here that the visual interpretability of ROC plots in the context of imbalanced datasets can be deceptive with respect to conclusions about the reliability of classification performance, owing to an intuitive but wrong interpretation of specificity. PRC plots, on the other hand, can provide the viewer with an accurate prediction of future classification performance due to the fact that they evaluate the fraction of true positives among positive predictions. Our findings have potential implications for the interpretation of a large number of studies that use ROC plots on imbalanced datasets.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data

            Abstract Major histocompatibility complex (MHC) molecules are expressed on the cell surface, where they present peptides to T cells, which gives them a key role in the development of T-cell immune responses. MHC molecules come in two main variants: MHC Class I (MHC-I) and MHC Class II (MHC-II). MHC-I predominantly present peptides derived from intracellular proteins, whereas MHC-II predominantly presents peptides from extracellular proteins. In both cases, the binding between MHC and antigenic peptides is the most selective step in the antigen presentation pathway. Therefore, the prediction of peptide binding to MHC is a powerful utility to predict the possible specificity of a T-cell immune response. Commonly MHC binding prediction tools are trained on binding affinity or mass spectrometry-eluted ligands. Recent studies have however demonstrated how the integration of both data types can boost predictive performances. Inspired by this, we here present NetMHCpan-4.1 and NetMHCIIpan-4.0, two web servers created to predict binding between peptides and MHC-I and MHC-II, respectively. Both methods exploit tailored machine learning strategies to integrate different training data types, resulting in state-of-the-art performance and outperforming their competitors. The servers are available at http://www.cbs.dtu.dk/services/NetMHCpan-4.1/ and http://www.cbs.dtu.dk/services/NetMHCIIpan-4.0/.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Predicting immunogenic tumour mutations by combining mass spectrometry and exome sequencing.

              Human tumours typically harbour a remarkable number of somatic mutations. If presented on major histocompatibility complex class I molecules (MHCI), peptides containing these mutations could potentially be immunogenic as they should be recognized as 'non-self' neo-antigens by the adaptive immune system. Recent work has confirmed that mutant peptides can serve as T-cell epitopes. However, few mutant epitopes have been described because their discovery required the laborious screening of patient tumour-infiltrating lymphocytes for their ability to recognize antigen libraries constructed following tumour exome sequencing. We sought to simplify the discovery of immunogenic mutant peptides by characterizing their general properties. We developed an approach that combines whole-exome and transcriptome sequencing analysis with mass spectrometry to identify neo-epitopes in two widely used murine tumour models. Of the >1,300 amino acid changes identified, ∼13% were predicted to bind MHCI, a small fraction of which were confirmed by mass spectrometry. The peptides were then structurally modelled bound to MHCI. Mutations that were solvent-exposed and therefore accessible to T-cell antigen receptors were predicted to be immunogenic. Vaccination of mice confirmed the approach, with each predicted immunogenic peptide yielding therapeutically active T-cell responses. The predictions also enabled the generation of peptide-MHCI dextramers that could be used to monitor the kinetics and distribution of the anti-tumour T-cell response before and after vaccination. These findings indicate that a suitable prediction algorithm may provide an approach for the pharmacodynamic monitoring of T-cell responses as well as for the development of personalized vaccines in cancer patients.
                Bookmark

                Author and article information

                Journal
                Brief Bioinform
                Brief Bioinform
                bib
                Briefings in Bioinformatics
                Oxford University Press
                1467-5463
                1477-4054
                May 2022
                25 April 2022
                25 April 2022
                : 23
                : 3
                : bbac141
                Affiliations
                [1 ] MRC Human Immunology Unit, Medical Research Council (MRC) Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine (WIMM), John Radcliffe Hospital, University of Oxford , Oxford, United Kingdom
                [2 ] MRC WIMM Centre for Computational Biology, Medical Research Council (MRC) Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, University of Oxford , Oxford, United Kingdom
                [3 ] Department of Neurosurgery, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, University of Oxford , Oxford, United Kingdom
                [4 ] Nuffield Department of Surgical Sciences, University of Oxford , Oxford, United Kingdom
                [5 ] Centre for Immuno-Oncology, Nuffield Department of Medicine, University of Oxford , Oxford, United Kingdom
                [6 ] Chief Scientific Officer, ImmunoMind Inc , Berkeley, CA, USA
                [7 ] Department of Genomics of Adaptive Immunity, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry , Moscow, 117997, Russia
                [8 ] Institute of Translational Medicine, Pirogov Russian National Research Medical University , Moscow, 117997, Russia
                [9 ] The Ludwig Institute for Cancer Research, Old Road Campus Research Building, University of Oxford , Oxford, United Kingdom
                [10 ] Alan Turning Fellow, University of Oxford , Oxford, United Kingdom
                Author notes
                Corresponding author: Hashem Koohy, Associate Professor of Systems immunology, Alan Turing Fellow, Group Head, MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DS, UK. Tel: 44(0)1865222430; E-mail: hashem.koohy@ 123456rdm.ox.ac.uk

                Paul R. Buckley, Chloe H. Lee and Ruichong Ma authors contributed equally to this work.

                Author information
                https://orcid.org/0000-0002-3640-7043
                Article
                bbac141
                10.1093/bib/bbac141
                9116217
                35471658
                533cd845-6092-48c4-a5d9-0c3ff3bb5533
                © The Author(s) 2022. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 20 January 2022
                : 9 March 2022
                : 26 March 2022
                Page count
                Pages: 18
                Funding
                Funded by: UK Medical Research Council, DOI 10.13039/501100000265;
                Award ID: MC_UU_12010/3
                Funded by: Ministry of Science and Higher Education of the Russian Federation, DOI 10.13039/501100012190;
                Award ID: 075-15-2020-807
                Categories
                Problem Solving Protocol
                AcademicSubjects/SCI01060

                Bioinformatics & Computational biology
                peptide immunogenicity,cd8 t-cell target,peptide presentation,t-cell response,cancer neoantigen,immunotherapy

                Comments

                Comment on this article