25
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A curated gluten protein sequence database to support development of proteomics methods for determination of gluten in gluten-free foods

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The unique physiochemical properties of wheat gluten enable a diverse range of food products to be manufactured. However, gluten triggers coeliac disease, a condition which is treated using a gluten-free diet. Analytical methods are required to confirm if foods are gluten-free, but current immunoassay-based methods can unreliable and proteomic methods offer an alternative but require comprehensive and well annotated sequence databases which are lacking for gluten. A manually a curated database (GluPro V1.0) of gluten proteins, comprising 630 discrete unique full length protein sequences has been compiled. It is representative of the different types of gliadin and glutenin components found in gluten. An in silico comparison of their coeliac toxicity was undertaken by analysing the distribution of coeliac toxic motifs. This demonstrated that whilst the α-gliadin proteins contained more toxic motifs, these were distributed across all gluten protein sub-types. Comparison of annotations observed using a discovery proteomics dataset acquired using ion mobility MS/MS showed that more reliable identifications were obtained using the GluPro V1.0 database compared to the complete reviewed Viridiplantae database. This highlights the value of a curated sequence database specifically designed to support the proteomic workflows and the development of methods to detect and quantify gluten.

          Significance

          We have constructed the first manually curated open-source wheat gluten protein sequence database (GluPro V1.0) in a FASTA format to support the application of proteomic methods for gluten protein detection and quantification. We have also analysed the manually verified sequences to give the first comprehensive overview of the distribution of sequences able to elicit a reaction in coeliac disease, the prevalent form of gluten intolerance. Provision of this database will improve the reliability of gluten protein identification by proteomic analysis, and aid the development of targeted mass spectrometry methods in line with Codex Alimentarius Commission requirements for foods designed to meet the needs of gluten intolerant individuals.

          Graphical abstract

          Highlights

          • A curated database comprising 630 gluten protein sequences has been compiled.

          • Sequences represent the gliadin and glutenin protein species found in gluten.

          • Coeliac toxic motifs were distributed across all gluten protein sub-types.

          • It gave more reliable identifications than the reviewed Viridiplantae database.

          Related collections

          Most cited references43

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          UniProt: a hub for protein information

          UniProt is an important collection of protein sequences and their annotations, which has doubled in size to 80 million sequences during the past year. This growth in sequences has prompted an extension of UniProt accession number space from 6 to 10 characters. An increasing fraction of new sequences are identical to a sequence that already exists in the database with the majority of sequences coming from genome sequencing projects. We have created a new proteome identifier that uniquely identifies a particular assembly of a species and strain or subspecies to help users track the provenance of sequences. We present a new website that has been designed using a user-experience design process. We have introduced an annotation score for all entries in UniProt to represent the relative amount of knowledge known about each protein. These scores will be helpful in identifying which proteins are the best characterized and most informative for comparative analysis. All UniProt data is provided freely and is available on the web at http://www.uniprot.org/.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Alpha-gliadin genes from the A, B, and D genomes of wheat contain different sets of celiac disease epitopes

            Background Bread wheat (Triticum aestivum) is an important staple food. However, wheat gluten proteins cause celiac disease (CD) in 0.5 to 1% of the general population. Among these proteins, the α-gliadins contain several peptides that are associated to the disease. Results We obtained 230 distinct α-gliadin gene sequences from severaldiploid wheat species representing the ancestral A, B, and D genomes of the hexaploid bread wheat. The large majority of these sequences (87%) contained an internal stop codon. All α-gliadin sequences could be distinguished according to the genome of origin on the basis of sequence similarity, of the average length of the polyglutamine repeats, and of the differences in the presence of four peptides that have been identified as T cell stimulatory epitopes in CD patients through binding to HLA-DQ2/8. By sequence similarity, α-gliadins from the public database of hexaploid T. aestivum could be assigned directly to chromosome 6A, 6B, or 6D. T. monococcum (A genome) sequences, as well as those from chromosome 6A of bread wheat, almost invariably contained epitope glia-α9 and glia-α20, but never the intact epitopes glia-α and glia-α2. A number of sequences from T. speltoides, as well as a number of sequences fromchromosome 6B of bread wheat, did not contain any of the four T cell epitopes screened for. The sequences from T. tauschii (D genome), as well as those from chromosome 6D of bread wheat, were found to contain all of these T cell epitopes in variable combinations per gene. The differences in epitope composition resulted mainly from point mutations. These substitutions appeared to be genome specific. Conclusion Our analysis shows that α-gliadin sequences from the three genomes of bread wheat form distinct groups. The four known T cell stimulatory epitopes are distributed non-randomly across the sequences, indicating that the three genomes contribute differently to epitope content. A systematic analysis of all known epitopes in gliadins and glutenins will lead to better understanding of the differences in toxicity among wheat varieties. On the basis of such insight, breeding strategies can be designed to generate less toxic varieties of wheat which may be tolerated by at least part of the CD patient population.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Genetics of Wheat Storage Proteins and the Effect of Allelic Variation on Bread-Making Quality

              P Payne (1987)
                Bookmark

                Author and article information

                Contributors
                Journal
                J Proteomics
                J Proteomics
                Journal of Proteomics
                Elsevier
                1874-3919
                1876-7737
                23 June 2017
                23 June 2017
                : 163
                : 67-75
                Affiliations
                [a ]School of Biological Sciences, Manchester Institute of Biotechnology, Manchester Academic Health Sciences Centre, University of Manchester, M17DN, UK
                [b ]School of Earth and Environmental Sciences, Manchester Institute of Biotechnology, University of Manchester, M17DN, UK
                [c ]Waters Corporation, Stamford Avenue, Altrincham Road, Wilmslow SK9 4AX, UK
                [d ]Genon Laboratories Limited, Cragg Vale, Halifax, UK
                [e ]Rothamsted Research, Harpenden, UK
                Author notes
                [* ]Corresponding author. clare.mills@ 123456manchester.ac.uk
                Article
                S1874-3919(17)30117-3
                10.1016/j.jprot.2017.03.026
                5479479
                28385663
                d3837228-490c-4fe5-9c90-9e00c32332f4
                © 2017 The Authors. Published by Elsevier B.V.

                This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

                History
                : 24 January 2017
                : 20 March 2017
                : 28 March 2017
                Categories
                Article

                Molecular biology
                ao, allergen online database,bp, biopep database,blast, basic local alignment search tool,cdna, complementary dna,falcpa, food allergen labelling and consumer protection act,hmw, high molecular weight,im, ion mobility,lmw, low molecular weight,ppm, parts per million,triticum aestivum,gluten,celiac toxic motif,database,proteomics

                Comments

                Comment on this article