An atlas connecting shared genetic architecture of human diseases and molecular phenotypes provides insight into COVID-19 susceptibility

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

While genome-wide associations studies (GWAS) have successfully elucidated the genetic architecture of complex human traits and diseases, understanding mechanisms that lead from genetic variation to pathophysiology remains an important challenge. Methods are needed to systematically bridge this crucial gap to facilitate experimental testing of hypotheses and translation to clinical utility. Here, we leveraged cross-phenotype associations to identify traits with shared genetic architecture, using linkage disequilibrium (LD) information to accurately capture shared SNPs by proxy, and calculate significance of enrichment. This shared genetic architecture was examined across differing biological scales through incorporating data from catalogs of clinical, cellular, and molecular GWAS. We have created an interactive web database (interactive Cross-Phenotype Analysis of GWAS database (iCPAGdb); http://cpag.oit.duke.edu) to facilitate exploration and allow rapid analysis of user-uploaded GWAS summary statistics. This database revealed well-known relationships among phenotypes, as well as the generation of novel hypotheses to explain the pathophysiology of common diseases. Application of iCPAGdb to a recent GWAS of severe COVID-19 demonstrated unexpected overlap of GWAS signals between COVID-19 and human diseases, including with idiopathic pulmonary fibrosis driven by the DPP9 locus. Transcriptomics from peripheral blood of COVID-19 patients demonstrated that DPP9 was induced in SARS-CoV-2 compared to healthy controls or those with bacterial infection. Further investigation of cross-phenotype SNPs with severe COVID-19 demonstrated colocalization of the GWAS signal of the ABO locus with plasma protein levels of a reported receptor of SARS-CoV-2, CD209 (DC-SIGN), pointing to a possible mechanism whereby glycosylation of CD209 by ABO may regulate COVID-19 disease severity. Thus, connecting genetically related traits across phenotypic scales links human diseases to molecular and cellular measurements that can reveal mechanisms and lead to novel biomarkers and therapeutic approaches.

Related collections

Most cited references 101

Record: found
Abstract: not found
Article: not found

Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

Yoav Benjamini, Yosef Hochberg (1995)

0 comments Cited 24363 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

STAR: ultrafast universal RNA-seq aligner.

Alexander Dobin, Carrie A. Davis, Felix Schlesinger … (2013)

Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.

0 comments Cited 14041 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

limma powers differential expression analyses for RNA-sequencing and microarray studies

Matthew E. Ritchie, Belinda Phipson, Di Wu … (2015)

limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

0 comments Cited 11398 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): medRxiv

Journal ID (publisher-id): MEDRXIV

Title: medRxiv

Publisher: Cold Spring Harbor Laboratory

Publication date (Electronic): 22 December 2020

Electronic Location Identifier: 2020.12.20.20248572

Affiliations

[1 ]Department of Molecular Genetics and Microbiology, School of Medicine, Duke University, Durham, NC 27710, USA

[2 ]Duke Research Computing, Duke University, Durham, NC 27710, USA

[3 ]Center for Applied Genomics and Precision Medicine, Department of Medicine, Duke University, Durham, NC 27710, USA.

[4 ]Durham Veterans Affairs Health Care System, Durham, NC 27705, USA.

[5 ]Division of Infectious Diseases, Department of Medicine, Duke University Medical Center, Durham, NC 27710, USA.

[6 ]Department of Hospital Medicine, Duke Regional Hospital, Durham, NC, 27705, USA.

[7 ]Department of Biomedical Engineering, Woo Center for Big Data and Precision Health, Duke University, Durham, NC 27710, USA.

[8 ]Duke Molecular Physiology Institute and Department of Biostatistics and Bioinformatics, Duke University Medical Center Durham, NC 27710, USA

[9 ]Cooperative Studies Program Epidemiology Center-Durham, Durham VA Health Care System, Durham, NC 27705, USA

[10 ]Lead contact

Author notes

Author Contributions

LW and DCK conceived of the study. LW, TJB, ERH, AI, MRD, ERH, and DCK developed iCPAGdb. LW, TJB, FJC and RH carried out computational analysis. LW, ALA, and DCK analyzed iCPAGdb results. MTM, FJC, RH, TWB, XS, GSG, ELT, ERK, and CWW carried out the COVID-19 transcriptomics study and helped design subsequent analysis carried out by LW. All authors contributed to the manuscript.

[* ]To whom correspondence should be addressed: Dennis C. Ko, 0049 CARL Building Box 3053, 213 Research Drive, Durham, NC 27710. 919-684-5834. dennis.ko@ 123456duke.edu . @denniskoHiHOST

Article

DOI: 10.1101/2020.12.20.20248572

PMC ID: 7781346

PubMed ID: 33398303

SO-VID: 314530c8-13f3-47f6-8c20-39b427b7dda7

License:

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.

An atlas connecting shared genetic architecture of human diseases and molecular phenotypes provides insight into COVID-19 susceptibility

Read this article at

Abstract

Related collections

Novel Coronavirus Disease COVID-19

Most cited references 101

Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

STAR: ultrafast universal RNA-seq aligner.

limma powers differential expression analyses for RNA-sequencing and microarray studies

Author and article information

Journal

Affiliations

Author notes

Article

History

Categories

Comments

Comment on this article

Similar content 26

Cited by 6

Most referenced authors 4,613