16
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Robust methods in Mendelian randomization via penalization of heterogeneous causal estimates

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Methods have been developed for Mendelian randomization that can obtain consistent causal estimates under weaker assumptions than the standard instrumental variable assumptions. The median-based estimator and MR-Egger are examples of such methods. However, these methods can be sensitive to genetic variants with heterogeneous causal estimates. Such heterogeneity may arise from over-dispersion in the causal estimates, or specific variants with outlying causal estimates. In this paper, we develop three extensions to robust methods for Mendelian randomization with summarized data: 1) robust regression (MM-estimation); 2) penalized weights; and 3) Lasso penalization. Methods using these approaches are considered in two applied examples: one where there is evidence of over-dispersion in the causal estimates (the causal effect of body mass index on schizophrenia risk), and the other containing outliers (the causal effect of low-density lipoprotein cholesterol on Alzheimer’s disease risk). Through an extensive simulation study, we demonstrate that robust regression applied to the inverse-variance weighted method with penalized weights is a worthwhile additional sensitivity analysis for Mendelian randomization to provide robustness to variants with outlying causal estimates. The results from the applied examples and simulation study highlight the importance of using methods that make different assumptions to assess the robustness of findings from Mendelian randomization investigations with multiple genetic variants.

          Related collections

          Most cited references20

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the I2 statistic

          Background MR-Egger regression has recently been proposed as a method for Mendelian randomization (MR) analyses incorporating summary data estimates of causal effect from multiple individual variants, which is robust to invalid instruments. It can be used to test for directional pleiotropy and provides an estimate of the causal effect adjusted for its presence. MR-Egger regression provides a useful additional sensitivity analysis to the standard inverse variance weighted (IVW) approach that assumes all variants are valid instruments. Both methods use weights that consider the single nucleotide polymorphism (SNP)-exposure associations to be known, rather than estimated. We call this the `NO Measurement Error’ (NOME) assumption. Causal effect estimates from the IVW approach exhibit weak instrument bias whenever the genetic variants utilized violate the NOME assumption, which can be reliably measured using the F-statistic. The effect of NOME violation on MR-Egger regression has yet to be studied. Methods An adaptation of the I 2 statistic from the field of meta-analysis is proposed to quantify the strength of NOME violation for MR-Egger. It lies between 0 and 1, and indicates the expected relative bias (or dilution) of the MR-Egger causal estimate in the two-sample MR context. We call it I G X 2 . The method of simulation extrapolation is also explored to counteract the dilution. Their joint utility is evaluated using simulated data and applied to a real MR example. Results In simulated two-sample MR analyses we show that, when a causal effect exists, the MR-Egger estimate of causal effect is biased towards the null when NOME is violated, and the stronger the violation (as indicated by lower values of I G X 2 ), the stronger the dilution. When additionally all genetic variants are valid instruments, the type I error rate of the MR-Egger test for pleiotropy is inflated and the causal effect underestimated. Simulation extrapolation is shown to substantially mitigate these adverse effects. We demonstrate our proposed approach for a two-sample summary data MR analysis to estimate the causal effect of low-density lipoprotein on heart disease risk. A high value of I G X 2 close to 1 indicates that dilution does not materially affect the standard MR-Egger analyses for these data. Conclusions Care must be taken to assess the NOME assumption via the I G X 2 statistic before implementing standard MR-Egger regression in the two-sample summary data context. If I G X 2 is sufficiently low (less than 90%), inferences from the method should be interpreted with caution and adjustment methods considered.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Using published data in Mendelian randomization: a blueprint for efficient identification of causal risk factors

            Finding individual-level data for adequately-powered Mendelian randomization analyses may be problematic. As publicly-available summarized data on genetic associations with disease outcomes from large consortia are becoming more abundant, use of published data is an attractive analysis strategy for obtaining precise estimates of the causal effects of risk factors on outcomes. We detail the necessary steps for conducting Mendelian randomization investigations using published data, and present novel statistical methods for combining data on the associations of multiple (correlated or uncorrelated) genetic variants with the risk factor and outcome into a single causal effect estimate. A two-sample analysis strategy may be employed, in which evidence on the gene-risk factor and gene-outcome associations are taken from different data sources. These approaches allow the efficient identification of risk factors that are suitable targets for clinical intervention from published data, although the ability to assess the assumptions necessary for causal inference is diminished. Methods and guidance are illustrated using the example of the causal effect of serum calcium levels on fasting glucose concentrations. The estimated causal effect of a 1 standard deviation (0.13 mmol/L) increase in calcium levels on fasting glucose (mM) using a single lead variant from the CASR gene region is 0.044 (95 % credible interval −0.002, 0.100). In contrast, using our method to account for the correlation between variants, the corresponding estimate using 17 genetic variants is 0.022 (95 % credible interval 0.009, 0.035), a more clearly positive causal effect. Electronic supplementary material The online version of this article (doi:10.1007/s10654-015-0011-z) contains supplementary material, which is available to authorized users.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Evaluating the potential role of pleiotropy in Mendelian randomization studies

              Abstract Pleiotropy, the phenomenon of a single genetic variant influencing multiple traits, is likely widespread in the human genome. If pleiotropy arises because the single nucleotide polymorphism (SNP) influences one trait, which in turn influences another (‘vertical pleiotropy’), then Mendelian randomization (MR) can be used to estimate the causal influence between the traits. Of prime focus among the many limitations to MR is the unprovable assumption that apparent pleiotropic associations are mediated by the exposure (i.e. reflect vertical pleiotropy), and do not arise due to SNPs influencing the two traits through independent pathways (‘horizontal pleiotropy’). The burgeoning treasure trove of genetic associations yielded through genome wide association studies makes for a tantalizing prospect of phenome-wide causal inference. Recent years have seen substantial attention devoted to the problem of horizontal pleiotropy, and in this review we outline how newly developed methods can be used together to improve the reliability of MR.
                Bookmark

                Author and article information

                Contributors
                Role: Formal analysisRole: MethodologyRole: Writing – original draft
                Role: InvestigationRole: Writing – review & editing
                Role: InvestigationRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: MethodologyRole: SupervisionRole: Writing – original draft
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                2019
                23 September 2019
                : 14
                : 9
                : e0222362
                Affiliations
                [1 ] Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, CB1 8RN, United Kingdom
                [2 ] Edinburgh Clinical Trials Unit, Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, EH16 4UX, United Kingdom
                [3 ] Department of Health Sciences, University of Leicester, Leicester, LE1 7RH, United Kingdom
                [4 ] MRC Biostatistics Unit, University of Cambridge, Cambridge, CB2 0SR, United Kingdom
                Universitair Medisch Centrum Utrecht, NETHERLANDS
                Author notes

                Competing Interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: Stephen Burgess is a paid statistical reviewer for PLOS Medicine. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

                Author information
                http://orcid.org/0000-0001-5365-8760
                Article
                PONE-D-19-09699
                10.1371/journal.pone.0222362
                6756542
                31545794
                f2e269d5-c170-4817-8d01-eb9839a7c403
                © 2019 Rees et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 6 April 2019
                : 27 August 2019
                Page count
                Figures: 6, Tables: 5, Pages: 24
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/100004440, Wellcome Trust;
                Award ID: 204623/Z/16/Z
                Award Recipient :
                Stephen Burgess is supported by Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (Grant Number 204623/Z/16/Z).
                Categories
                Research Article
                Medicine and Health Sciences
                Mental Health and Psychiatry
                Dementia
                Alzheimer's Disease
                Medicine and Health Sciences
                Neurology
                Dementia
                Alzheimer's Disease
                Medicine and Health Sciences
                Neurology
                Neurodegenerative Diseases
                Alzheimer's Disease
                Biology and Life Sciences
                Genetics
                Genetics of Disease
                Genetic Predisposition
                Medicine and Health Sciences
                Mental Health and Psychiatry
                Schizophrenia
                Research and Analysis Methods
                Simulation and Modeling
                Biology and Life Sciences
                Biochemistry
                Proteins
                Lipoproteins
                Medicine and Health Sciences
                Epidemiology
                Medical Risk Factors
                Research and Analysis Methods
                Research Assessment
                Research Errors
                Research and Analysis Methods
                Mathematical and Statistical Techniques
                Statistical Methods
                Instrumental Variable Analysis
                Physical Sciences
                Mathematics
                Statistics
                Statistical Methods
                Instrumental Variable Analysis
                Custom metadata
                All data can be found in the PhenoScanner database ( http://www.phenoscanner.medschl.cam.ac.uk/). R code for performing the approaches outlined in the paper, and extracting genetic association estimates are found in S1 Appendix.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article