159
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report

      research-article
      1 , * , 2 , 3 , 3 , 4 , on behalf of the Gene Ontology Consortium
      PLoS Computational Biology
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A recent paper (Nehrt et al., PLoS Comput. Biol. 7:e1002073, 2011) has proposed a metric for the “functional similarity” between two genes that uses only the Gene Ontology (GO) annotations directly derived from published experimental results. Applying this metric, the authors concluded that paralogous genes within the mouse genome or the human genome are more functionally similar on average than orthologous genes between these genomes, an unexpected result with broad implications if true. We suggest, based on both theoretical and empirical considerations, that this proposed metric should not be interpreted as a functional similarity, and therefore cannot be used to support any conclusions about the “ortholog conjecture” (or, more properly, the “ortholog functional conservation hypothesis”). First, we reexamine the case studies presented by Nehrt et al. as examples of orthologs with divergent functions, and come to a very different conclusion: they actually exemplify how GO annotations for orthologous genes provide complementary information about conserved biological functions. We then show that there is a global ascertainment bias in the experiment-based GO annotations for human and mouse genes: particular types of experiments tend to be performed in different model organisms. We conclude that the reported statistical differences in annotations between pairs of orthologous genes do not reflect differences in biological function, but rather complementarity in experimental approaches. Our results underscore two general considerations for researchers proposing novel types of analysis based on the GO: 1) that GO annotations are often incomplete, potentially in a biased manner, and subject to an “open world assumption” (absence of an annotation does not imply absence of a function), and 2) that conclusions drawn from a novel, large-scale GO analysis should whenever possible be supported by careful, in-depth examination of examples, to help ensure the conclusions have a justifiable biological basis.

          Author Summary

          Understanding gene function—how individual genes contribute to the biology of an organism at the molecular, cellular and organism levels—is one of the primary aims of biomedical research. It has been a longstanding tenet of model organism research that experimental knowledge obtained in one organism is often applicable to other organisms, particularly if the organisms share the relevant genes because they inherited them from their common ancestor. Nevertheless this tenet is, like any hypothesis, not beyond question. A recent paper has termed this hypothesis a “conjecture,” and performed a statistical analysis, the results of which were interpreted as evidence against the hypothesis. This statistical analysis relied on a computational representation of gene function, the Gene Ontology (GO). As representatives of the international consortium that produces the GO, we show how the apparent evidence against the “ortholog conjecture” can be better explained as an artifact of how molecular biology knowledge is accumulated. In short, a complementarity between knowledge obtained in mouse and human experimental systems was incorrectly interpreted as a disagreement. We discuss the proper interpretation of GO annotations and potential sources of bias, with an eye toward enhancing the informed use of the GO by the scientific community.

          Related collections

          Most cited references15

          • Record: found
          • Abstract: found
          • Article: not found

          Gene Ontology: tool for the unification of biology

          Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Distinguishing homologous from analogous proteins.

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Physiological and molecular basis of thyroid hormone action.

              P M Yen (2001)
              Thyroid hormones (THs) play critical roles in the differentiation, growth, metabolism, and physiological function of virtually all tissues. TH binds to receptors that are ligand-regulatable transcription factors belonging to the nuclear hormone receptor superfamily. Tremendous progress has been made recently in our understanding of the molecular mechanisms that underlie TH action. In this review, we present the major advances in our knowledge of the molecular mechanisms of TH action and their implications for TH action in specific tissues, resistance to thyroid hormone syndrome, and genetically engineered mouse models.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Comput Biol
                plos
                ploscomp
                PLoS Computational Biology
                Public Library of Science (San Francisco, USA )
                1553-734X
                1553-7358
                February 2012
                February 2012
                16 February 2012
                : 8
                : 2
                : e1002386
                Affiliations
                [1 ]Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America
                [2 ]Cambridge Systems Biology Centre and Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
                [3 ]Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
                [4 ]Bioinformatics and Computational Biology, The Jackson Laboratory, Bar Harbor, Maine, United States of America
                University of California San Diego, United States of America
                Author notes

                Conceived and designed the experiments: PDT VW. Performed the experiments: PDT VW CJM. Analyzed the data: PDT VW CJM SEL JAB. Wrote the paper: PDT VW SEL JAB.

                Article
                PCOMPBIOL-D-11-01744
                10.1371/journal.pcbi.1002386
                3280971
                22359495
                63bde014-1d7e-4c38-a06d-ebafc2520e9a
                This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
                History
                : 23 November 2011
                : 21 December 2011
                Page count
                Pages: 7
                Categories
                Research Article
                Biology
                Computational Biology
                Genomics
                Model Organisms

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content15

                Cited by45

                Most referenced authors789