45
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genome-wide association study identifies 143 loci associated with 25 hydroxyvitamin D concentration

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Vitamin D deficiency is a candidate risk factor for a range of adverse health outcomes. In a genome-wide association study of 25 hydroxyvitamin D (25OHD) concentration in 417,580 Europeans we identify 143 independent loci in 112 1-Mb regions, providing insights into the physiology of vitamin D and implicating genes involved in lipid and lipoprotein metabolism, dermal tissue properties, and the sulphonation and glucuronidation of 25OHD. Mendelian randomization models find no robust evidence that 25OHD concentration has causal effects on candidate phenotypes (e.g. BMI, psychiatric disorders), but many phenotypes have (direct or indirect) causal effects on 25OHD concentration, clarifying the epidemiological relationship between 25OHD status and the health outcomes examined in this study.

          Abstract

          Vitamin D is a precursor of the steroid hormone 1,25-dihydroxyvitamin D3, and its deficiency is associated with many adverse health outcomes. Here, Revez et al. perform a genome-wide association study for circulating 25-hydroxyvitamin D in 417,580 individuals and test for potential causal relationships with other traits using Mendelian randomization.

          Related collections

          Most cited references36

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          GWAS on family history of Alzheimer’s disease

          Alzheimer’s disease (AD) is a public health priority for the 21st century. Risk reduction currently revolves around lifestyle changes with much research trying to elucidate the biological underpinnings. We show that self-report of parental history of Alzheimer’s dementia for case ascertainment in a genome-wide association study of 314,278 participants from UK Biobank (27,696 maternal cases, 14,338 paternal cases) is a valid proxy for an AD genetic study. After meta-analysing with published consortium data (n = 74,046 with 25,580 cases across the discovery and replication analyses), three new AD-associated loci (P < 5 × 10−8) are identified. These contain genes relevant for AD and neurodegeneration: ADAM10, BCKDK/KAT8 and ACE. Novel gene-based loci include drug targets such as VKORC1 (warfarin dose). We report evidence that the association of SNPs in the TOMM40 gene with AD is potentially mediated by both gene expression and DNA methylation in the prefrontal cortex. However, it is likely that multiple variants are affecting the trait and gene methylation/expression. Our discovered loci may help to elucidate the biological mechanisms underlying AD and, as they contain genes that are drug targets for other diseases and disorders, warrant further exploration for potential precision medicine applications.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            A resource-efficient tool for mixed model association analysis of large-scale data

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Genome-wide association meta-analysis of 78,308 individuals identifies new loci and genes influencing human intelligence

              Intelligence is associated with important economic and health-related life outcomes 1 . Despite substantial heritability 2 (0.54) and confirmed polygenic nature, initial genetic studies were mostly underpowered 3–5 . Here we report a meta-analysis for intelligence of 78,308 individuals. We identify 336 single nucleotide polymorphisms (SNPs) (METAL P 500,000 participants. All participants provided written informed consent; the UK Biobank received ethical approval from the National Research Ethics Service Committee North West–Haydock (reference 11/NW/0382), and all study procedures were performed in accordance with the World Medical Association Declaration of Helsinki ethical principles for medical research. The current study was conducted under the UK Biobank application number 16406. The study design of the UK Biobank has been described in detail elsewhere 35,36 . Briefly, invitation letters were sent out in 2006–2010 to ~9.2 million individuals including all people aged 40–69 years who were registered with the National Health Service and living up to ~25 miles from one of the 22 study assessment centers. A total of 503,325 participants were subsequently recruited into the study 35 . Apart from registry based phenotypic information, extensive self-reported baseline data have been collected by questionnaire, in addition to anthropometric assessments and DNA collection. For the present study we used imputed data obtained from UK Biobank (May 2015 release) including ~73 million genetic variants in 152,249 individuals. Details on the data are provided elsewhere (see URLs). In summary, the first ~50,000 samples were genotyped on the UK BiLEVE Axiom array, and the remaining ~100,000 samples were genotyped on the UK Biobank Axiom array. After standard quality control of the SNPs and samples, which was centrally performed by UK Biobank, the dataset comprised 641,018 autosomal SNPs in 152,256 samples for phasing and imputation. Imputation was performed with a reference panel that included the UK10K haplotype panel and the 1000 Genomes Project Phase 3 reference panel. We used two fluid intelligence phenotypes from the Biobank data set. These are based on questionnaires that were taken either in the assessment center at the initial intake (‘touchscreen’, field 20016) or at a later moment at home (‘web-based’, field 20191). The measures indicate the number of correct answers out of 13 fluid intelligence questions. The data distribution roughly approximates a normal distribution. For the analyses in our study, we only included individuals of Caucasian descent. After removal of related individuals, discordant sex, withdrawn consent, and missing phenotype data, 36,257 individuals remained for analysis for the fluid intelligence touchscreen measure and 28,846 for the web-based version. As 10,984 individuals had taken both the touchscreen and the web-based test, we only included the data from the touchscreen test for these individuals. This resulted in 54,119 individuals with a score on either the fluid intelligence web-based (UKB-wb) or touchscreen (UKB-ts) version (Supplementary Table 1). At the time of taking the test, participants’ ages ranged between 40 and 78. Half of the participants were between 40 and 60 years old, 44% between 60 and 70 and 6% were older than 70. The mean age was 58.98 with a standard deviation of 8.19. Summary statistics from CHIC consortium We downloaded the publicly available combined GWAS results from the meta-analyses as reported by CHIC 5 (see URLs). Details on the included cohorts and performed analyses are reported in the original publication 5 . Briefly, CHIC includes 6 cohorts totaling 12,441 individuals: the Avon Longitudinal Study of Parents and Children (ALSPAC, N = 5,517), the Lothian Birth Cohorts of 1921 and 1936 (LBC1921, N = 464; LBC1936, N = 947), the Brisbane Adolescent Twin Study subsample of Queensland Institute of Medical Research (QIMR, N = 1,752), the Western Australian Pregnancy Cohort Study (Raine, N = 936), and the Twins Early Development Study (TEDS, N = 2,825). All individuals are children aged between 6–18 years. Within each cohort the cognitive performance measure was adjusted for sex and age and principal components were included to adjust for population stratification. See also Supplementary Table 1. Full GWAS data from additional cohorts We used the same additional (non-CHIC) cohorts as described in detail in ref. 7 , which included 11,748 individuals from 5 cohorts. In ref. 7 , results were only reported for 69 SNPs, as these served as a secondary analysis for a look-up effort. In the current study we use the full genome-wide results from these cohorts. GWAS were conducted in 2013 and summary statistics were obtained from the PIs of the 5 cohorts. The quality control protocol entailed excluding SNPs with MAF 0.01. Positional annotations for all lead SNPs and SNPs in LD with the lead SNPs were obtained by performing ANNOVAR gene-based annotation using refSeq genes. In addition, CADD scores 38 , and RegulomeDB 15 scores were annotated to SNPs by matching chromosome, position, reference and alternative alleles. For each SNP eQTLs were extracted from GTEx (44 tissue types) 39 , Blood eQTL browser 40 and BIOS gene-level eQTLs 41 . The eQTLs obtained from GTEx were filtered on gene P-value < 0.05 and eQTLs obtained from the other two databases were filtered on FDR < 0.05. The FDR values were provided by GTEx, BIOS and Blood eQTL browser. For GTEx eQTLs, there is one FDR value available per gene-tissue pair. As such, the FDR is identical for all eQTLs belonging to the same gene-tissue pair. For BIOS and Blood eQTL browser, an FDR value was computed per SNP. To test whether the SNPs were functionally active by means of histone modifications, we obtained epigenetic data from the NIH Roadmap Epigenomics Mapping Consortium 42 and ENCODE 43 . For every 200bp of the genome a 15-core chromatin state was predicted by a Hidden Markov Model based on 5 histone marks (i.e. H3K4me3, H3K4me1, H3K27me3, H3K9me3, and H3K36me3) for 127 tissue/cell types 44 . We annotated chromatin states (15 states in total) to SNPs by matching chromosome and position for every tissue/cell type. We computed the minimum state (1: the most active state) and the consensus state (majority of states) across 127 tissue/cell types for each SNP. Chromatin states were also determined for the 52 genes (47 from the gene-based test + 5 additional genes implicated by single SNP GWAS). For each gene and tissue, the chromatin state was obtained per 200 bp interval in the gene. We then annotated the genes by means of a consensus decision when multiple states were present for a single gene; i.e. the state of the gene was defined as the modus of all states present in the gene. Tissue expression of genes RNA sequencing data of 1,641 tissue samples with 45 unique tissue labels was derived from the GTEx consortium 39 . This set includes 313 brain samples over 13 unique brain regions (see Supplementary Table 18 for sample size per tissue). Of the 52 genes implicated by either the GWAS or the GWGWAS, 44 were included in the GTEx data. Normalization of the data was performed as described previously 45 . Briefly, genes with RPKM (Reads Per Kilobase Million) value smaller than 0.1 in at least 80% of the samples were removed. The remaining genes were log2 transformed (after using a pseudocount of 1), and finally a zero-mean normalization was applied. Proxy-replication in educational attainment For the replication analysis we used a subset of the data from ref. 21. In particular, we excluded the Erasmus Rucphen Family, the Minnesota Center for Twin and Family Research Study, the Swedish Twin Registry Study, the 23andMe data and all individuals from UK Biobank, to make sure there was no sample overlap with our IQ dataset. Genetic correlation between intelligence and EA in this non-overlapping subsample was rg=0.73, SE=0.03, P=1.4×10−163. The replication analysis was based on the phenotype EduYears, which measures the number of years of schooling completed. A total of 306 out of our 336 top SNPs (and 16 out of 18 independent lead SNPs) was available in the educational attainment sample. We performed a sign concordance analysis for the 16 independent lead SNPs, using the exact binomial test. For each independent signal we determined whether either the lead SNP had a P-value smaller than 0.05/16 in the educational attainment analysis, or another (correlated) top SNP in the same locus if this was not the case. All 47 genes implicated in the GWGAS for intelligence were available for look-up in the EA sample. For each gene we determined whether it had a P-value smaller than 0.05/47 in the EA analysis. Polygenic Risk Score analysis We used LDpred 16 to calculate the variance explained in intelligence in independent samples by a polygenic risk score based on our discovery analysis, as well as based on two previous GWAS studies for intelligence 5,6 . LDpred adjusts GWAS summary statistics for the effects of linkage disequilibrium (LD) by using an approximate Gibbs sampler that calculates posterior means of effects, conditional on LD information, when calculating polygenic risk scores. We used varying priors for the fraction of SNPs with non-zero effects (prior: 0.01, 0.05, 0.1, 0.5, 1, and an infinitesimal prior). Independent datasets available for PRS analyses are described in the Supplementary Note. Supplementary Material 1 2 3 4
                Bookmark

                Author and article information

                Contributors
                naomi.wray@uq.edu.au
                j.mcgrath@uq.edu.au
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                2 April 2020
                2 April 2020
                2020
                : 11
                : 1647
                Affiliations
                [1 ]ISNI 0000 0000 9320 7537, GRID grid.1003.2, Institute for Molecular Bioscience, , The University of Queensland, ; Brisbane, QLD Australia
                [2 ]ISNI 0000 0000 9320 7537, GRID grid.1003.2, Queensland Brain Institute, , The University of Queensland, ; Brisbane, QLD Australia
                [3 ]ISNI 0000 0004 0606 3563, GRID grid.417162.7, Queensland Centre for Mental Health Research, , The Park Centre for Mental Health, ; Wacol, QLD Australia
                [4 ]ISNI 0000 0001 2294 1395, GRID grid.1049.c, QIMR Berghofer Medical Research Institute, ; Brisbane, QLD Australia
                [5 ]ISNI 0000000089150953, GRID grid.1024.7, School of Biomedical Sciences, Faculty of Health, and Institute of Health and Biomedical Innovation, , Queensland University of Technology, ; Brisbane, QLD Australia
                [6 ]ISNI 0000 0001 0348 3990, GRID grid.268099.c, Institute for Advanced Research, , Wenzhou Medical University, ; Wenzhou, Zhejiang 325027 China
                [7 ]ISNI 0000 0001 1956 2722, GRID grid.7048.b, National Centre for Register-based Research, , Aarhus University, ; Aarhus, Denmark
                Author information
                http://orcid.org/0000-0003-3204-5396
                http://orcid.org/0000-0002-5981-1911
                http://orcid.org/0000-0002-0285-0426
                http://orcid.org/0000-0001-8801-5220
                http://orcid.org/0000-0002-6137-3391
                http://orcid.org/0000-0003-1494-6772
                http://orcid.org/0000-0003-3502-9789
                http://orcid.org/0000-0002-9050-1516
                http://orcid.org/0000-0003-4069-8020
                http://orcid.org/0000-0002-2143-8760
                http://orcid.org/0000-0003-2001-2474
                http://orcid.org/0000-0001-7421-3357
                http://orcid.org/0000-0002-4792-6068
                Article
                15421
                10.1038/s41467-020-15421-7
                7118120
                32242144
                b3e98e17-28f2-435b-ae92-a68fab09a88f
                © The Author(s) 2020

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 27 November 2019
                : 3 March 2020
                Categories
                Article
                Custom metadata
                © The Author(s) 2020

                Uncategorized
                genome-wide association studies,calcium and vitamin d,psychiatric disorders,risk factors

                Comments

                Comment on this article