1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Disease gene prioritization using network topological analysis from a sequence based human functional linkage network

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Sequencing large number of candidate disease genes which cause diseases in order to identify the relationship between them is an expensive and time-consuming task. To handle these challenges, different computational approaches have been developed. Based on the observation that genes associated with similar diseases have a higher likelihood of interaction, a large class of these approaches relay on analyzing the topological properties of biological networks. However, the incomplete and noisy nature of biological networks is known as an important challenge in these approaches. In this paper, we propose a two-step framework for disease gene prioritization: (1) construction of a reliable human FLN using sequence information and machine learning techniques, (2) prioritizing the disease gene relations based on the constructed FLN. On our framework, unlike other FLN based frameworks that using FLNs based on integration of various low quality biological data, the sequence of proteins is used as the comprehensive data to construct a reliable initial network. In addition, the physicochemical properties of amino-acids are employed to describe the functionality of proteins. All in all, the proposed approach is evaluated and the results indicate the high efficiency and validity of the FLN in disease gene prioritization.

          Related collections

          Most cited references24

          • Record: found
          • Abstract: not found
          • Article: not found

          A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders.

            A Hamosh (2002)
            Online Mendelian Inheritance in Man (OMIM) is a comprehensive, authoritative and timely knowledgebase of human genes and genetic disorders compiled to support research and education in human genomics and the practice of clinical genetics. Started by Dr Victor A. McKusick as the definitive reference Mendelian Inheritance in Man, OMIM (www.ncbi.nlm.nih.gov/omim) is now distributed electronically by the National Center for Biotechnology Information (NCBI), where it is integrated with the Entrez suite of databases. Derived from the biomedical literature, OMIM is written and edited at Johns Hopkins University with input from scientists and physicians around the world. Each OMIM entry has a full-text summary of a genetically determined phenotype and/or gene and has numerous links to other genetic databases such as DNA and protein sequence, PubMed references, general and locus-specific mutation databases, approved gene nomenclature, and the highly detailed mapviewer, as well as patient support groups and many others. OMIM is an easy and straightforward portal to the burgeoning information in human genetics.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Predicting disease genes using protein-protein interactions.

              The responsible genes have not yet been identified for many genetically mapped disease loci. Physically interacting proteins tend to be involved in the same cellular process, and mutations in their genes may lead to similar disease phenotypes. To investigate whether protein-protein interactions can predict genes for genetically heterogeneous diseases. 72,940 protein-protein interactions between 10,894 human proteins were used to search 432 loci for candidate disease genes representing 383 genetically heterogeneous hereditary diseases. For each disease, the protein interaction partners of its known causative genes were compared with the disease associated loci lacking identified causative genes. Interaction partners located within such loci were considered candidate disease gene predictions. Prediction accuracy was tested using a benchmark set of known disease genes. Almost 300 candidate disease gene predictions were made. Some of these have since been confirmed. On average, 10% or more are expected to be genuine disease genes, representing a 10-fold enrichment compared with positional information only. Examples of interesting candidates are AKAP6 for arrythmogenic right ventricular dysplasia 3 and SYN3 for familial partial epilepsy with variable foci. Exploiting protein-protein interactions can greatly increase the likelihood of finding positional candidate disease genes. When applied on a large scale they can lead to novel candidate gene predictions.
                Bookmark

                Author and article information

                Journal
                15 April 2019
                Article
                1904.06973
                c483f215-98a8-4c63-8c76-6f0c3cac66a8

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                q-bio.MN q-bio.GN

                Molecular biology,Genetics
                Molecular biology, Genetics

                Comments

                Comment on this article