0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      MilkOligoThesaurus, a dataset of mammalian milk oligosaccharide synonyms

      data-paper

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          There is a growing interest in milk oligosaccharides (MOs) because of their numerous benefits for newborns’ and long-term health. A large number of MO structures have been identified in mammalian milk. Mostly described in human milk, the oligosaccharide richness, although less broad, has also been reported for a wide range of mammalian species. The structure of MOs is particularly difficult to report as it results from the combination of 5 monosaccharides linked by various glycosidic bonds forming structurally diverse and complex matrices of linear and branched oligosaccharides. Exploring the literature and extracting relevant information on MO diversity within or across species appears promising to elucidate structure-function role of MOs. Currently, given the complexity of these molecules, the main issues in exploring literature to extract relevant information on MO diversity within or across species relate to the heterogeneity in the way authors refer to these molecules. Herein, we provide a thesaurus (MilkOligoThesaurus) including the names and synonyms of MOs collected from key selected articles on mammalian milk analyses. MilkOligoThesaurus gathers the names of the MOs with a complete description of their monosaccharide composition and structures. When available, each unique MO molecule is linked to its ID from the NCBI PubChem and ChEBI databases. MilkOligoThesaurus is provided in a tabular format. It gathers 245 unique oligosaccharide structures described by 22 features (columns) including the name of the molecule, its abbreviation, the chemical database IDs if available, the monosaccharide composition, chemical information (molecular formula, monoisotopic mass), synonyms, its formula in condensed form, and in abbreviated condensed form, the abbreviated systematic name, the systematic name, the isomer group, and scientific article sources. MilkOligoThesaurus is also provided in the SKOS (Simple Knowledge Organization System) format. This thesaurus is a valuable resource gathering MO naming variations that are not found elsewhere for (i) Text and Data Mining to enable automatic annotation and rapid extraction of milk oligosaccharide data from scientific papers; (ii) biology researchers aiming to search for or decipher the structure of milk oligosaccharides based on any of their names, abbreviations or monosaccharide compositions and linkages.

          Related collections

          Most cited references12

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The FAIR Guiding Principles for scientific data management and stewardship

          There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Symbol Nomenclature for Graphical Representations of Glycans.

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Proposal for a standard system for drawing structural diagrams of N- and O-linked carbohydrates and related compounds.

              Symbolic diagrams are commonly used to depict N- and O-linked glycans but there is no general consensus as to how individual constituent monosaccharides or linkages are shown. This article proposes a system that avoids ambiguities inherent in most other systems and is appropriate for both hand drawing and computer applications. Constituent monosaccharides are depicted by shapes modified to show OAc, deoxy, etc. Linkage is indicated by the bond angle and anomericity by solid (beta) or dashed (alpha) lines.
                Bookmark

                Author and article information

                Contributors
                @vloux
                Journal
                Data Brief
                Data Brief
                Data in Brief
                Elsevier
                2352-3409
                09 April 2024
                June 2024
                09 April 2024
                : 54
                : 110404
                Affiliations
                [a ]GenPhySE, Université de Toulouse, INRAE, ENVT, 31326, Castanet-Tolosan, France
                [b ]Université Paris-Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), MetaboHUB, 91191 Gif sur Yvette
                [c ]INRAE, LPGP, 35000 Rennes, France
                [d ]Université Paris-Saclay, INRAE, BioinfOmics, MIGALE Bioinformatics Facility, Jouy-en-Josas, France
                [e ]Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
                [f ]INRAE, DipSO, 42 rue Georges Morel, 49070 Beaucouzé, France
                Author notes
                [* ]Corresponding author. sylvie.combes@ 123456inrae.fr
                Article
                S2352-3409(24)00373-1 110404
                10.1016/j.dib.2024.110404
                11043833
                38665156
                bc85762e-c72c-45ee-914a-71c618a22f3c
                © 2024 The Author(s)

                This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

                History
                : 3 October 2023
                : 22 February 2024
                : 5 April 2024
                Categories
                Data Article

                chemical nomenclature,normalized milk oligosaccharide name,milk oligosaccharide monoisotopic mass,milk oligosaccharide monosaccharide composition,oligosaccharide isomer name,vocabulary extraction,systematic names

                Comments

                Comment on this article