55
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies

      Read this article at

      ScienceOpenPublisher
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In genome-wide association studies (GWAS) for thousands of phenotypes in large biobanks, most binary traits have substantially fewer cases than controls. Both of the widely used approaches, linear mixed model and the recently proposed logistic mixed model, perform poorly - producing large type I error rates - in the analysis of unbalanced case-control phenotypes. Here we propose a scalable and accurate generalized mixed model association test that uses the saddlepoint approximation to calibrate the distribution of score test statistics. This method, SAIGE, provides accurate p-values even when case-control ratios are extremely unbalanced. It utilizes state-of-art optimization strategies to reduce computational cost, and hence is applicable to GWAS for thousands of phenotypes by large biobanks. Through the analysis of UK Biobank data of 408,961 white British European-ancestry samples for >1400 binary phenotypes, we show that SAIGE can efficiently analyze large sample data, controlling for unbalanced case-control ratios and sample relatedness.

          Related collections

          Most cited references16

          • Record: found
          • Abstract: not found
          • Article: not found

          Methods of conjugate gradients for solving linear systems

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Approximate Inference in Generalized Linear Mixed Models

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Approximate Inference in Generalized Linear Mixed Models

                Bookmark

                Author and article information

                Journal
                Nature Genetics
                Nat Genet
                Springer Nature America, Inc
                1061-4036
                1546-1718
                August 13 2018
                Article
                10.1038/s41588-018-0184-y
                56cd22a9-fe2c-4beb-ac6f-5f4171371c0a
                © 2018

                http://www.springer.com/tdm

                History

                Comments

                Comment on this article