9
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

      Journal of Proteomics
      Computational Biology, methods, statistics & numerical data, Databases, Protein, False Positive Reactions, Mass Spectrometry, Peptides, analysis, chemistry, Proteins, Proteomics

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          This manuscript provides a comprehensive review of the peptide and protein identification process using tandem mass spectrometry (MS/MS) data generated in shotgun proteomic experiments. The commonly used methods for assigning peptide sequences to MS/MS spectra are critically discussed and compared, from basic strategies to advanced multi-stage approaches. A particular attention is paid to the problem of false-positive identifications. Existing statistical approaches for assessing the significance of peptide to spectrum matches are surveyed, ranging from single-spectrum approaches such as expectation values to global error rate estimation procedures such as false discovery rates and posterior probabilities. The importance of using auxiliary discriminant information (mass accuracy, peptide separation coordinates, digestion properties, and etc.) is discussed, and advanced computational approaches for joint modeling of multiple sources of information are presented. This review also includes a detailed analysis of the issues affecting the interpretation of data at the protein level, including the amplification of error rates when going from peptide to protein level, and the ambiguities in inferring the identifies of sample proteins in the presence of shared peptides. Commonly used methods for computing protein-level confidence scores are discussed in detail. The review concludes with a discussion of several outstanding computational issues. Copyright © 2010 Elsevier B.V. All rights reserved.

          Related collections

          Author and article information

          Journal
          20816881
          2956504
          10.1016/j.jprot.2010.08.009

          Chemistry
          Computational Biology,methods,statistics & numerical data,Databases, Protein,False Positive Reactions,Mass Spectrometry,Peptides,analysis,chemistry,Proteins,Proteomics

          Comments

          Comment on this article