31
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Integrated Bayesian analysis of rare exonic variants to identify risk genes for schizophrenia and neurodevelopmental disorders

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Integrating rare variation from trio family and case–control studies has successfully implicated specific genes contributing to risk of neurodevelopmental disorders (NDDs) including autism spectrum disorders (ASD), intellectual disability (ID), developmental disorders (DDs), and epilepsy (EPI). For schizophrenia (SCZ), however, while sets of genes have been implicated through the study of rare variation, only two risk genes have been identified.

          Methods

          We used hierarchical Bayesian modeling of rare-variant genetic architecture to estimate mean effect sizes and risk-gene proportions, analyzing the largest available collection of whole exome sequence data for SCZ (1,077 trios, 6,699 cases, and 13,028 controls), and data for four NDDs (ASD, ID, DD, and EPI; total 10,792 trios, and 4,058 cases and controls).

          Results

          For SCZ, we estimate there are 1,551 risk genes. There are more risk genes and they have weaker effects than for NDDs. We provide power analyses to predict the number of risk-gene discoveries as more data become available. We confirm and augment prior risk gene and gene set enrichment results for SCZ and NDDs. In particular, we detected 98 new DD risk genes at FDR < 0.05. Correlations of risk-gene posterior probabilities are high across four NDDs ( ρ>0.55), but low between SCZ and the NDDs ( ρ<0.3). An in-depth analysis of 288 NDD genes shows there is highly significant protein–protein interaction (PPI) network connectivity, and functionally distinct PPI subnetworks based on pathway enrichment, single-cell RNA-seq cell types, and multi-region developmental brain RNA-seq.

          Conclusions

          We have extended a pipeline used in ASD studies and applied it to infer rare genetic parameters for SCZ and four NDDs ( https://github.com/hoangtn/extTADA). We find many new DD risk genes, supported by gene set enrichment and PPI network connectivity analyses. We find greater similarity among NDDs than between NDDs and SCZ. NDD gene subnetworks are implicated in postnatally expressed presynaptic and postsynaptic genes, and for transcriptional and post-transcriptional gene regulation in prenatal neural progenitor and stem cells.

          Electronic supplementary material

          The online version of this article (doi:10.1186/s13073-017-0497-y) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references38

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor

          Single-cell RNA sequencing (scRNA-seq) is widely used to profile the transcriptome of individual cells. This provides biological resolution that cannot be matched by bulk RNA sequencing, at the cost of increased technical noise and data complexity. The differences between scRNA-seq and bulk RNA-seq data mean that the analysis of the former cannot be performed by recycling bioinformatics pipelines for the latter. Rather, dedicated single-cell methods are required at various steps to exploit the cellular resolution while accounting for technical noise. This article describes a computational workflow for low-level analyses of scRNA-seq data, based primarily on software packages from the open-source Bioconductor project. It covers basic steps including quality control, data exploration and normalization, as well as more complex procedures such as cell cycle phase assignment, identification of highly variable and correlated genes, clustering into subpopulations and marker gene detection. Analyses were demonstrated on gene-level count data from several publicly available datasets involving haematopoietic stem cells, brain-derived cells, T-helper cells and mouse embryonic stem cells. This will provide a range of usage scenarios from which readers can construct their own analysis pipelines.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Systematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals.

            Comprehensive identification of all functional elements encoded in the human genome is a fundamental need in biomedical research. Here, we present a comparative analysis of the human, mouse, rat and dog genomes to create a systematic catalogue of common regulatory motifs in promoters and 3' untranslated regions (3' UTRs). The promoter analysis yields 174 candidate motifs, including most previously known transcription-factor binding sites and 105 new motifs. The 3'-UTR analysis yields 106 motifs likely to be involved in post-transcriptional regulation. Nearly one-half are associated with microRNAs (miRNAs), leading to the discovery of many new miRNA genes and their likely target genes. Our results suggest that previous estimates of the number of human miRNA genes were low, and that miRNAs regulate at least 20% of human genes. The overall results provide a systematic view of gene regulation in the human, which will be refined as additional mammalian genomes become available.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs.

              The purpose of the dbNSFP is to provide a one-stop resource for functional predictions and annotations for human nonsynonymous single-nucleotide variants (nsSNVs) and splice-site variants (ssSNVs), and to facilitate the steps of filtering and prioritizing SNVs from a large list of SNVs discovered in an exome-sequencing study. A list of all potential nsSNVs and ssSNVs based on the human reference sequence were created and functional predictions and annotations were curated and compiled for each SNV. Here, we report a recent major update of the database to version 3.0. The SNV list has been rebuilt based on GENCODE 22 and currently the database includes 82,832,027 nsSNVs and ssSNVs. An attached database dbscSNV, which compiled all potential human SNVs within splicing consensus regions and their deleteriousness predictions, add another 15,030,459 potentially functional SNVs. Eleven prediction scores (MetaSVM, MetaLR, CADD, VEST3, PROVEAN, 4× fitCons, fathmm-MKL, and DANN) and allele frequencies from the UK10K cohorts and the Exome Aggregation Consortium (ExAC), among others, have been added. The original seven prediction scores in v2.0 (SIFT, 2× Polyphen2, LRT, MutationTaster, MutationAssessor, and FATHMM) as well as many SNV and gene functional annotations have been updated. dbNSFP v3.0 is freely available at http://sites.google.com/site/jpopgen/dbNSFP.
                Bookmark

                Author and article information

                Contributors
                tan-hoang.nguyen@mssm.edu
                julien.bryois@ki.se
                aprilkim@broadinstitute.org
                amanda.dobbyn@mssm.edu
                laura.huckins@mssm.edu
                ana.munoz.manchado@ki.se
                douglas.ruderfer@vanderbilt.edu
                giulio.genovese@gmail.com
                fromer@broadinstitute.org
                xinyi.xu@mssm.edu
                dalila.pinto@mssm.edu
                Sten.Linnarsson@ki.se
                matthijsverhage@me.com
                guus.smit@vu.nl
                hjerling.leffler@gmail.com
                joseph.buxbaum@mssm.edu
                Christina.Hultman@mep.ki.se
                shaun.purcell@mssm.edu
                Lage.Kasper@mgh.harvard.edu
                xinhe@uchicago.edu
                pfs9999@gmail.com
                eli.stahl@mssm.edu
                Journal
                Genome Med
                Genome Med
                Genome Medicine
                BioMed Central (London )
                1756-994X
                20 December 2017
                20 December 2017
                2017
                : 9
                : 114
                Affiliations
                [1 ]ISNI 0000 0001 0670 2351, GRID grid.59734.3c, Division of Psychiatric Genomics, Department of Genetics and Genomic Sciences, Institute for Genomics and Multiscale Biology, , Icahn School of Medicine at Mount Sinai, ; New York, 10029 NY USA
                [2 ]ISNI 0000 0004 1937 0626, GRID grid.4714.6, Department of Medical Epidemiology and Biostatistics, , Karolinska Institutet, ; Stockholm, Sweden
                [3 ]GRID grid.66859.34, Stanley Center for Psychiatric Research, , Broad Institute of MIT and Harvard, ; Cambridge, Massachusetts USA
                [4 ]ISNI 0000 0004 0386 9924, GRID grid.32224.35, Department of Surgery, , Massachusetts General Hospital, ; Boston, 02114 MA USA
                [5 ]ISNI 0000 0001 0670 2351, GRID grid.59734.3c, Charles Bronfman Institute for Personalized Medicine, , Icahn School of Medicine at Mount Sinai, ; New York, 10029 NY USA
                [6 ]ISNI 0000 0004 1937 0626, GRID grid.4714.6, Laboratory of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, , Karolinska Institutet, ; Stockholm, SE-17177 Sweden
                [7 ]ISNI 0000 0004 1936 9916, GRID grid.412807.8, Division of Genetic Medicine, Departments of Medicine, Psychiatry and Biomedical Informatics, Vanderbilt Genetics Institute, , Vanderbilt University Medical Center, ; Nashville, 37235 TN USA
                [8 ]ISNI 000000041936754X, GRID grid.38142.3c, Department of Genetics, , Harvard Medical School, ; Cambridge, Massachusetts USA
                [9 ]Verily Life Sciences, 269 E Grand Ave, South San Francisco, 94080 CA USA
                [10 ]ISNI 0000 0001 0670 2351, GRID grid.59734.3c, Seaver Autism Center, Department of Psychiatry, , Icahn School of Medicine at Mount Sinai, ; New York, 10029 NY USA
                [11 ]ISNI 0000 0001 0670 2351, GRID grid.59734.3c, The Mindich Child Health and Development Institute, , Icahn School of Medicine at Mount Sinai, ; New York, 10029 NY USA
                [12 ]ISNI 0000 0001 0670 2351, GRID grid.59734.3c, Friedman Brain Institute, , Icahn School of Medicine at Mount Sinai, ; New York, 10029 NY USA
                [13 ]ISNI 0000 0004 0435 165X, GRID grid.16872.3a, Department of Functional Genomics, The Center for Neurogenomics and Cognitive Research, , VU University and VU Medical Center, ; Amsterdam, The Netherlands
                [14 ]ISNI 0000 0004 1754 9227, GRID grid.12380.38, Department of Molecular and Cellular Neurobiology, The Center for Neurogenomics and Cognitive Research, , VU University, ; Amsterdam, The Netherlands
                [15 ]ISNI 000000041936754X, GRID grid.38142.3c, Sleep Center, Brigham and Women’s Hospital, , Harvard Medical School, ; Boston, Massachusetts USA
                [16 ]ISNI 0000 0004 1936 7822, GRID grid.170205.1, Department of Human Genetics, , University of Chicago, ; Chicago, 60637 IL USA
                [17 ]ISNI 0000 0001 1034 1720, GRID grid.410711.2, Departments of Genetics and Psychiatry, , University of North Carolina, ; Chapel Hill, 27599-7264 North Carolina USA
                Article
                497
                10.1186/s13073-017-0497-y
                5738153
                29262854
                fe55e5ab-85af-48c1-9e19-6e9c3a8a1ec6
                © The Author(s) 2017

                Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 21 August 2017
                : 16 November 2017
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100000025, National Institute of Mental Health;
                Award ID: R01MH105554
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100000025, National Institute of Mental Health;
                Award ID: R01MH110555
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100000272, National Institute for Health Research;
                Award ID: R01 MH077139
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: R01 MH077139
                Award Recipient :
                Funded by: National Institutes of Health (US)
                Award ID: R01 MH077139
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100001711, Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung;
                Categories
                Research
                Custom metadata
                © The Author(s) 2017

                Molecular medicine
                de novo mutations,rare variants,schizophrenia,autism,developmental disorders,intellectual disability,epilepsy,hierarchical model

                Comments

                Comment on this article