13
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      FEDRR: fast, exhaustive detection of redundant hierarchical relations for quality improvement of large biomedical ontologies

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Redundant hierarchical relations refer to such patterns as two paths from one concept to another, one with length one (direct) and the other with length greater than one (indirect). Each redundant relation represents a possibly unintended defect that needs to be corrected in the ontology quality assurance process. Detecting and eliminating redundant relations would help improve the results of all methods relying on the relevant ontological systems as knowledge source, such as the computation of semantic distance between concepts and for ontology matching and alignment.

          Results

          This paper introduces a novel and scalable approach, called FEDRR – Fast, Exhaustive Detection of Redundant Relations – for quality assurance work during ontological evolution. FEDRR combines the algorithm ideas of Dynamic Programming with Topological Sort, for exhaustive mining of all redundant hierarchical relations in ontological hierarchies, in O( c·| V|+| E|) time, where | V| is the number of concepts, | E| is the number of the relations, and c is a constant in practice. Using FEDRR, we performed exhaustive search of all redundant is-a relations in two of the largest ontological systems in biomedicine: SNOMED CT and Gene Ontology (GO). 372 and 1609 redundant is-a relations were found in the 2015-09-01 version of SNOMED CT and 2015-05-01 version of GO, respectively. We have also performed FEDRR on over 190 source vocabularies in the UMLS - a large integrated repository of biomedical ontologies, and identified six sources containing redundant is-a relations. Randomly generated ontologies have also been used to further validate the efficiency of FEDRR.

          Conclusions

          FEDRR provides a generally applicable, effective tool for systematic detecting redundant relations in large ontological systems for quality improvement.

          Related collections

          Most cited references28

          • Record: found
          • Abstract: found
          • Article: not found

          SNOMED-CT: The advanced terminology and coding system for eHealth.

          A clinical terminology is essential for Electronic Health records. It represents clinical information input into clinical IT systems by clinicians in a machine-readable manner. Use of a Clinical Terminology, implemented within a clinical information system, will enable the delivery of many patient health benefits including electronic clinical decision support, disease screening and enhanced patient safety. For example, it will help reduce medication-prescribing errors, which are currently known to kill or injure many citizens. It will also reduce clinical administration effort and the overall costs of healthcare.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Measuring semantic similarity between Gene Ontology terms

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Biomedical ontologies in action: role in knowledge management, data integration and decision support.

              To provide typical examples of biomedical ontologies in action, emphasizing the role played by biomedical ontologies in knowledge management, data integration and decision support. Biomedical ontologies selected for their practical impact are examined from a functional perspective. Examples of applications are taken from operational systems and the biomedical literature, with a bias towards recent journal articles. The ontologies under investigation in this survey include SNOMED CT, the Logical Observation Identifiers, Names, and Codes (LOINC), the Foundational Model of Anatomy, the Gene Ontology, RxNorm, the National Cancer Institute Thesaurus, the International Classification of Diseases, the Medical Subject Headings (MeSH) and the Unified Medical Language System (UMLS). The roles played by biomedical ontologies are classified into three major categories: knowledge management (indexing and retrieval of data and information, access to information, mapping among ontologies); data integration, exchange and semantic interoperability; and decision support and reasoning (data selection and aggregation, decision support, natural language processing applications, knowledge discovery). Ontologies play an important role in biomedical research through a variety of applications. While ontologies are used primarily as a source of vocabulary for standardization and integration purposes, many applications also use them as a source of computable knowledge. Barriers to the use of ontologies in biomedical applications are discussed.
                Bookmark

                Author and article information

                Contributors
                guangming.xing@wku.edu
                gqatcase@gmail.com
                licong.cui@uky.edu
                Journal
                BioData Min
                BioData Min
                BioData Mining
                BioMed Central (London )
                1756-0381
                10 October 2016
                10 October 2016
                2016
                : 9
                : 31
                Affiliations
                [1 ]Department of Computer Science, Western Kentucky University, Bowling Green, 42101 KY USA
                [2 ]Institute of Biomedical Informatics, University of Kentucky, Lexington, 40536 KY USA
                [3 ]Department of Computer Science, University of Kentucky, Lexington, 40506 KY USA
                Author information
                http://orcid.org/0000-0001-5549-8780
                Article
                110
                10.1186/s13040-016-0110-8
                5057496
                da55c3e2-c4f4-441a-aef9-e66d71de1f69
                © The Author(s) 2016

                Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 2 March 2016
                : 3 October 2016
                Categories
                Research
                Custom metadata
                © The Author(s) 2016

                Bioinformatics & Computational biology
                redundant relations,dynamic programming,snomed ct,gene ontology,umls

                Comments

                Comment on this article