27
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Inferring the molecular and phenotypic impact of amino acid variants with MutPred2

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Identifying pathogenic variants and underlying functional alterations is challenging. To this end, we introduce MutPred2, a tool that improves the prioritization of pathogenic amino acid substitutions over existing methods, generates molecular mechanisms potentially causative of disease, and returns interpretable pathogenicity score distributions on individual genomes. Whilst its prioritization performance is state-of-the-art, a distinguishing feature of MutPred2 is the probabilistic modeling of variant impact on specific aspects of protein structure and function that can serve to guide experimental studies of phenotype-altering variants. We demonstrate the utility of MutPred2 in the identification of the structural and functional mutational signatures relevant to Mendelian disorders and the prioritization of de novo mutations associated with complex neurodevelopmental disorders. We then experimentally validate the functional impact of several variants identified in patients with such disorders. We argue that mechanism-driven studies of human inherited disease have the potential to significantly accelerate the discovery of clinically actionable variants.

          Abstract

          Identifying variants capable of causing genetic disease is challenging. The authors use semisupervised learning to predict pathogenic missense variants and their impacts on protein structure and function, enabling a molecular mechanism-driven approach to studying different types of human disease.

          Related collections

          Most cited references98

          • Record: found
          • Abstract: not found
          • Article: not found

          Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology

            The American College of Medical Genetics and Genomics (ACMG) previously developed guidance for the interpretation of sequence variants. 1 In the past decade, sequencing technology has evolved rapidly with the advent of high-throughput next generation sequencing. By adopting and leveraging next generation sequencing, clinical laboratories are now performing an ever increasing catalogue of genetic testing spanning genotyping, single genes, gene panels, exomes, genomes, transcriptomes and epigenetic assays for genetic disorders. By virtue of increased complexity, this paradigm shift in genetic testing has been accompanied by new challenges in sequence interpretation. In this context, the ACMG convened a workgroup in 2013 comprised of representatives from the ACMG, the Association for Molecular Pathology (AMP) and the College of American Pathologists (CAP) to revisit and revise the standards and guidelines for the interpretation of sequence variants. The group consisted of clinical laboratory directors and clinicians. This report represents expert opinion of the workgroup with input from ACMG, AMP and CAP stakeholders. These recommendations primarily apply to the breadth of genetic tests used in clinical laboratories including genotyping, single genes, panels, exomes and genomes. This report recommends the use of specific standard terminology: ‘pathogenic’, ‘likely pathogenic’, ‘uncertain significance’, ‘likely benign’, and ‘benign’ to describe variants identified in Mendelian disorders. Moreover, this recommendation describes a process for classification of variants into these five categories based on criteria using typical types of variant evidence (e.g. population data, computational data, functional data, segregation data, etc.). Because of the increased complexity of analysis and interpretation of clinical genetic testing described in this report, the ACMG strongly recommends that clinical molecular genetic testing should be performed in a CLIA-approved laboratory with results interpreted by a board-certified clinical molecular geneticist or molecular genetic pathologist or equivalent.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              A global reference for human genetic variation

              The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
                Bookmark

                Author and article information

                Contributors
                lilyak@ucsd.edu
                sdmooney@uw.edu
                predrag@northeastern.edu
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                20 November 2020
                20 November 2020
                2020
                : 11
                : 5918
                Affiliations
                [1 ]GRID grid.411377.7, ISNI 0000 0001 0790 959X, Department of Computer Science, , Indiana University, ; Bloomington, IN USA
                [2 ]GRID grid.34477.33, ISNI 0000000122986657, Department of Biomedical Informatics and Medical Education, , University of Washington, ; Seattle, WA USA
                [3 ]GRID grid.266100.3, ISNI 0000 0001 2107 4242, Department of Psychiatry, , University of California San Diego, ; La Jolla, CA USA
                [4 ]GRID grid.5600.3, ISNI 0000 0001 0807 5670, Institute of Medical Genetics, School of Medicine, , Cardiff University, ; Cardiff, UK
                [5 ]GRID grid.266100.3, ISNI 0000 0001 2107 4242, Beyster Center for Genomics of Psychiatric Diseases, , University of California San Diego, ; La Jolla, CA USA
                [6 ]GRID grid.266100.3, ISNI 0000 0001 2107 4242, Department of Cellular and Molecular Medicine, , University of California San Diego, ; La Jolla, CA USA
                [7 ]GRID grid.261112.7, ISNI 0000 0001 2173 3359, Khoury College of Computer Sciences, , Northeastern University, ; Boston, MA USA
                [8 ]GRID grid.147455.6, ISNI 0000 0001 2097 0344, Present Address: Computational Biology Department, School of Computer Science, , Carnegie Mellon University, ; 5000 Forbes Avenue, Pittsburgh, PA 15213 USA
                [9 ]GRID grid.21107.35, ISNI 0000 0001 2171 9311, Present Address: Institute for Computational Medicine, Whiting School of Engineering, , Johns Hopkins University, ; 220 Hackerman Hall, 3400 N Charles St, Baltimore, MD 21218 USA
                [10 ]GRID grid.16821.3c, ISNI 0000 0004 0368 8293, Present Address: School of Biomedical Engineering, , Shanghai Jiao Tong University, ; Shanghai, 200030 People’s Republic of China
                Author information
                http://orcid.org/0000-0002-6312-1577
                http://orcid.org/0000-0001-9496-0149
                http://orcid.org/0000-0002-8943-8484
                http://orcid.org/0000-0002-9087-526X
                http://orcid.org/0000-0002-4542-5219
                http://orcid.org/0000-0002-6769-0793
                Article
                19669
                10.1038/s41467-020-19669-x
                7680112
                33219223
                485b28c6-302b-44f4-a4a9-d47e0c4ada52
                © The Author(s) 2020

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 23 July 2019
                : 23 October 2020
                Funding
                Funded by: FundRef https://doi.org/10.13039/100001906, Washington Research Foundation (WRF);
                Award ID: Fund for Innovation in Data-Intensive Discovery
                Award Recipient :
                Funded by: FundRef https://doi.org/10.13039/100000936, Gordon and Betty Moore Foundation (Gordon E. and Betty I. Moore Foundation);
                Award ID: Moore/Sloan Data Science Environments Project
                Award Recipient :
                Funded by: FundRef https://doi.org/10.13039/100000879, Alfred P. Sloan Foundation;
                Award ID: Moore/Sloan Data Science Environments Project
                Award Recipient :
                Funded by: FundRef https://doi.org/10.13039/100000092, U.S. Department of Health & Human Services | NIH | U.S. National Library of Medicine (NLM);
                Award ID: K99 LM012992
                Award ID: R01 LM009722
                Award Recipient :
                Funded by: FundRef https://doi.org/10.13039/100000025, U.S. Department of Health & Human Services | NIH | National Institute of Mental Health (NIMH);
                Award ID: R01 MH076431
                Award ID: R01 MH104766
                Award ID: R01 MH109885
                Award ID: R01 MH105524
                Award Recipient :
                Funded by: FundRef https://doi.org/10.13039/100000893, Simons Foundation;
                Award ID: 345469
                Award Recipient :
                Funded by: U.S. Department of Health & Human Services | NIH | National Institute of Mental Health (NIMH)
                Funded by: U.S. Department of Health & Human Services | NIH | National Institute of Mental Health (NIMH)
                Funded by: U.S. Department of Health & Human Services | NIH | U.S. National Library of Medicine (NLM)
                Funded by: U.S. Department of Health & Human Services | NIH | National Institute of Mental Health (NIMH)
                Categories
                Article
                Custom metadata
                © The Author(s) 2020

                Uncategorized
                computational biology and bioinformatics,machine learning,protein analysis,protein function predictions,protein structure predictions

                Comments

                Comment on this article