3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genetic sensitivity analysis: Adjusting for genetic confounding in epidemiological associations

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Associations between exposures and outcomes reported in epidemiological studies are typically unadjusted for genetic confounding. We propose a two-stage approach for estimating the degree to which such observed associations can be explained by genetic confounding. First, we assess attenuation of exposure effects in regressions controlling for increasingly powerful polygenic scores. Second, we use structural equation models to estimate genetic confounding using heritability estimates derived from both SNP-based and twin-based studies. We examine associations between maternal education and three developmental outcomes – child educational achievement, Body Mass Index, and Attention Deficit Hyperactivity Disorder. Polygenic scores explain between 14.3% and 23.0% of the original associations, while analyses under SNP- and twin-based heritability scenarios indicate that observed associations could be almost entirely explained by genetic confounding. Thus, caution is needed when interpreting associations from non-genetically informed epidemiology studies. Our approach, akin to a genetically informed sensitivity analysis can be applied widely.

          Author summary

          An objective shared across the life, behavioural, and social sciences is to identify factors that increase risk for a particular disease or trait. However, identifying true risk factors is challenging. Often, a risk factor is statistically associated with a disease even if it is not really relevant, meaning that even successfully improving the risk factor will not impact the disease. One reason for the existence of such misleading associations stems from genetic confounding. This is when genetic factors influence directly both the risk factor and the disease, which generates a statistical association even in the absence of a true effect of the risk factor. Here, we propose a method to estimate genetic confounding and quantify its effect on observed associations. We show that a large part of the associations between maternal education and three child outcomes—educational achievement, body mass index and Attention-Deficit Hyperactivity Disorder—is explained by genetic confounding. Our findings can be applied to better understand the role of genetics in explaining associations of key risk factors with diseases and traits.

          Related collections

          Most cited references38

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Second-generation PLINK: rising to the challenge of larger and richer datasets

          PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic calls, phase information, and/or multiallelic variants, none of which can be represented by PLINK 1's primary data format. To address these issues, we are developing a second-generation codebase for PLINK. The first major release from this codebase, PLINK 1.9, introduces extensive use of bit-level parallelism, O(sqrt(n))-time/constant-space Hardy-Weinberg equilibrium and Fisher's exact tests, and many other algorithmic improvements. In combination, these changes accelerate most operations by 1-4 orders of magnitude, and allow the program to handle datasets too large to fit in RAM. This will be followed by PLINK 2.0, which will introduce (a) a new data format capable of efficiently representing probabilities, phase, and multiallelic variants, and (b) extensions of many functions to account for the new types of information. The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility. For the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found

            LD Score regression distinguishes confounding from polygenicity in genome-wide association studies.

            Both polygenicity (many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from a true polygenic signal and bias. We have developed an approach, LD Score regression, that quantifies the contribution of each by examining the relationship between test statistics and linkage disequilibrium (LD). The LD Score regression intercept can be used to estimate a more powerful and accurate correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of the inflation in test statistics in many GWAS of large sample size.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              An Atlas of Genetic Correlations across Human Diseases and Traits

              Identifying genetic correlations between complex traits and diseases can provide useful etiological insights and help prioritize likely causal relationships. The major challenges preventing estimation of genetic correlation from genome-wide association study (GWAS) data with current methods are the lack of availability of individual genotype data and widespread sample overlap among meta-analyses. We circumvent these difficulties by introducing a technique – cross-trait LD Score regression – for estimating genetic correlation that requires only GWAS summary statistics and is not biased by sample overlap. We use this method to estimate 276 genetic correlations among 24 traits. The results include genetic correlations between anorexia nervosa and schizophrenia, anorexia and obesity and associations between educational attainment and several diseases. These results highlight the power of genome-wide analyses, since there currently are no significantly associated SNPs for anorexia nervosa and only three for educational attainment.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: SoftwareRole: SupervisionRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Formal analysisRole: MethodologyRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: SoftwareRole: VisualizationRole: Writing – review & editing
                Role: MethodologyRole: Writing – review & editing
                Role: Data curationRole: Writing – review & editing
                Role: Data curationRole: Writing – review & editing
                Role: MethodologyRole: Writing – review & editing
                Role: ConceptualizationRole: Formal analysisRole: Funding acquisitionRole: MethodologyRole: SoftwareRole: SupervisionRole: Writing – original draftRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS Genet
                PLoS Genet
                plos
                PLoS Genetics
                Public Library of Science (San Francisco, CA USA )
                1553-7390
                1553-7404
                11 June 2021
                June 2021
                : 17
                : 6
                : e1009590
                Affiliations
                [1 ] Department of Clinical, Educational and Health Psychology, University College London, London, United Kingdom
                [2 ] Social, Genetic, and Developmental Psychiatry Centre, King’s College London, London, United Kingdom
                [3 ] Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York City, New York, United States of America
                [4 ] Department of Health Sciences, University of Leicester, Leicester, United Kingdom
                Case Western Reserve University, UNITED STATES
                Author notes

                The authors have declared that no competing interests exist.

                Author information
                https://orcid.org/0000-0003-2557-4716
                https://orcid.org/0000-0003-4762-2803
                https://orcid.org/0000-0003-2215-3238
                https://orcid.org/0000-0003-4985-8174
                https://orcid.org/0000-0002-6809-7908
                https://orcid.org/0000-0001-7515-0845
                https://orcid.org/0000-0002-8817-8908
                Article
                PGENETICS-D-20-01757
                10.1371/journal.pgen.1009590
                8238188
                34115765
                b1e4fb2d-d478-4cbc-9cac-6f7dd3c3b18e
                © 2021 Pingault et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 18 November 2020
                : 7 May 2021
                Page count
                Figures: 8, Tables: 3, Pages: 22
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/501100000265, Medical Research Council;
                Award ID: MR/M021475/1, previously G0901245
                Funded by: funder-id http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: AG046938
                Funded by: funder-id http://dx.doi.org/10.13039/501100000781, European Research Council;
                Award ID: 863981
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/501100000265, Medical Research Council;
                Award ID: MR/S037055/1
                Award Recipient :
                TEDS is supported by a programme grant from the UK Medical Research Council (MR/M021475/1 and previously G0901245, https://mrc.ukri.org/), with additional support from the US National Institutes of Health (AG046938, https://www.nih.gov/). This project has received funding from the European Research Council (ERC) attributed to JBP under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 863981, https://erc.europa.eu/). FD is funded by the MRC (MR/S037055/1, https://mrc.ukri.org/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology and Life Sciences
                Genetics
                Biology and Life Sciences
                Genetics
                Heredity
                Biology and Life Sciences
                Genetics
                Human Genetics
                Medicine and Health Sciences
                Epidemiology
                Genetic Epidemiology
                Medicine and Health Sciences
                Medical Conditions
                Neurodevelopmental Disorders
                Adhd
                Biology and Life Sciences
                Neuroscience
                Developmental Neuroscience
                Neurodevelopmental Disorders
                Adhd
                Medicine and Health Sciences
                Neurology
                Neurodevelopmental Disorders
                Adhd
                Medicine and Health Sciences
                Mental Health and Psychiatry
                Neuropsychiatric Disorders
                Adhd
                Biology and Life Sciences
                Computational Biology
                Genome Analysis
                Genome-Wide Association Studies
                Biology and Life Sciences
                Genetics
                Genomics
                Genome Analysis
                Genome-Wide Association Studies
                Biology and Life Sciences
                Genetics
                Human Genetics
                Genome-Wide Association Studies
                Biology and Life Sciences
                Developmental Biology
                Twins
                Social Sciences
                Sociology
                Education
                Educational Attainment
                Custom metadata
                vor-update-to-uncorrected-proof
                2021-06-28
                The dataset used to produce the results described in this manuscript will be made available, subject to a suitable data sharing agreement, in compliance with (EU) General Data Protection Regulations. Requests for the data used in this study should be made to the Twins Early Development Study (TEDS) at teds-project@ 123456kcl.ac.uk , and is not subject to the general TEDS data access policy: http://www.teds.ac.uk/researchers/teds-data-access-policy.

                Genetics
                Genetics

                Comments

                Comment on this article