6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Fine-mapping across diverse ancestries drives the discovery of putative causal variants underlying human complex traits and diseases

      Preprint
      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Genome-wide association studies (GWAS) of human complex traits or diseases often implicate genetic loci that span hundreds or thousands of genetic variants, many of which have similar statistical significance. While statistical fine-mapping in individuals of European ancestries has made important discoveries, cross-population fine-mapping has the potential to improve power and resolution by capitalizing on the genomic diversity across ancestries. Here we present SuSiEx, an accurate and computationally efficient method for cross-population fine-mapping, which builds on the single-population fine-mapping framework, Sum of Single Effects (SuSiE). SuSiEx integrates data from an arbitrary number of ancestries, explicitly models population-specific allele frequencies and LD patterns, accounts for multiple causal variants in a genomic region, and can be applied to GWAS summary statistics. We comprehensively evaluated SuSiEx using simulations, a range of quantitative traits measured in both UK Biobank and Taiwan Biobank, and schizophrenia GWAS across East Asian and European ancestries. In all evaluations, SuSiEx fine-mapped more association signals, produced smaller credible sets and higher posterior inclusion probability (PIP) for putative causal variants, and captured population-specific causal variants.

          Related collections

          Most cited references45

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          A global reference for human genetic variation

          The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Second-generation PLINK: rising to the challenge of larger and richer datasets

            PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic calls, phase information, and/or multiallelic variants, none of which can be represented by PLINK 1's primary data format. To address these issues, we are developing a second-generation codebase for PLINK. The first major release from this codebase, PLINK 1.9, introduces extensive use of bit-level parallelism, O(sqrt(n))-time/constant-space Hardy-Weinberg equilibrium and Fisher's exact tests, and many other algorithmic improvements. In combination, these changes accelerate most operations by 1-4 orders of magnitude, and allow the program to handle datasets too large to fit in RAM. This will be followed by PLINK 2.0, which will introduce (a) a new data format capable of efficiently representing probabilities, phase, and multiallelic variants, and (b) extensions of many functions to account for the new types of information. The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility. For the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age

              Cathie Sudlow and colleagues describe the UK Biobank, a large population-based prospective study, established to allow investigation of the genetic and non-genetic determinants of the diseases of middle and old age.
                Bookmark

                Author and article information

                Journal
                medRxiv
                MEDRXIV
                medRxiv
                Cold Spring Harbor Laboratory
                09 July 2023
                : 2023.01.07.23284293
                Affiliations
                [1 ]Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
                [2 ]Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
                [3 ]Department of Medicine, Harvard Medical School, Boston, MA, USA
                [4 ]MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University School of Medicine, Cardiff, UK
                [5 ]Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
                [6 ]Center for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, China
                [7 ]Human Genetics, Genome Institute of Singapore, Singapore, Singapore
                [8 ]Division of Psychiatry Research, the Zucker Hillside Hospital, Northwell Health, Glen Oaks, NY, USA
                [9 ]Research Division Institute of Mental Health Singapore, Singapore, Singapore
                [10 ]Digital Health China Technologies Corp. Ltd., Beijing, China
                [11 ]Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
                [12 ]Department of Public Health & Medical Humanities, School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
                [13 ]Institute of Behavioral Medicine, College of Medicine, National Cheng Kung University
                [14 ]Biogen, Cambridge, MA, USA
                [15 ]Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
                [16 ]Department of Psychiatry, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
                Author notes
                [§]

                A list of members is available in the supplementary information

                [*]

                These authors jointly supervised this work.

                AUTHOR CONTRIBUTIONS

                T.G. and H.H. designed the project; T.G. developed the statistical methods. K.Y. and T.G. programmed the code for SuSiEx; K.Y., R.J.L. and T.G. conducted simulation studies; K.Y., R.J.L. and M.Y. performed the biobank fine-mapping analysis; T.T.C., S.C.L., Y.A.F., Y.F.L. and C.Y.C. performed the analysis in the Taiwan Biobank; K.Y. and A.F.P. performed the analysis in the schizophrenia cohorts; M.J.D. and B.M.N. provided critical suggestions for the study design; M.Y., Y.C., M.L., R.L. and Y.X. took part in the testing of the code; M.O.D. and Z.G. made significant contributions to the generation and management of schizophrenia data. K.Y., T.G. and H.H. wrote the manuscript; All the authors reviewed and approved the final version of the manuscript.

                Article
                10.1101/2023.01.07.23284293
                9882563
                36711496
                e04a15c7-02aa-4c70-9027-29b95044acb3

                This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.

                History
                Categories
                Article

                Comments

                Comment on this article