8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      MMSplice: modular modeling improves the predictions of genetic variant effects on splicing

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Predicting the effects of genetic variants on splicing is highly relevant for human genetics. We describe the framework MMSplice (modular modeling of splicing) with which we built the winning model of the CAGI5 exon skipping prediction challenge. The MMSplice modules are neural networks scoring exon, intron, and splice sites, trained on distinct large-scale genomics datasets. These modules are combined to predict effects of variants on exon skipping, splice site choice, splicing efficiency, and pathogenicity, with matched or higher performance than state-of-the-art. Our models, available in the repository Kipoi, apply to variants including indels directly from VCF files.

          Electronic supplementary material

          The online version of this article (10.1186/s13059-019-1653-z) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references11

          • Record: found
          • Abstract: found
          • Article: not found

          Splicing regulation: from a parts list of regulatory elements to an integrated splicing code.

          Alternative splicing of pre-mRNAs is a major contributor to both proteomic diversity and control of gene expression levels. Splicing is tightly regulated in different tissues and developmental stages, and its disruption can lead to a wide range of human diseases. An important long-term goal in the splicing field is to determine a set of rules or "code" for splicing that will enable prediction of the splicing pattern of any primary transcript from its sequence. Outside of the core splice site motifs, the bulk of the information required for splicing is thought to be contained in exonic and intronic cis-regulatory elements that function by recruitment of sequence-specific RNA-binding protein factors that either activate or repress the use of adjacent splice sites. Here, we summarize the current state of knowledge of splicing cis-regulatory elements and their context-dependent effects on splicing, emphasizing recent global/genome-wide studies and open questions.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Predictive identification of exonic splicing enhancers in human genes.

            Specific short oligonucleotide sequences that enhance pre-mRNA splicing when present in exons, termed exonic splicing enhancers (ESEs), play important roles in constitutive and alternative splicing. A computational method, RESCUE-ESE, was developed that predicts which sequences have ESE activity by statistical analysis of exon-intron and splice site composition. When large data sets of human gene sequences were used, this method identified 10 predicted ESE motifs. Representatives of all 10 motifs were found to display enhancer activity in vivo, whereas point mutants of these sequences exhibited sharply reduced activity. The motifs identified enable prediction of the splicing phenotypes of exonic mutations in human genes.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Hyperopt: a Python library for model selection and hyperparameter optimization

                Bookmark

                Author and article information

                Contributors
                jane.e.doe@cambridge.co.uk
                gagneur@in.tum.de
                Journal
                Genome Biol
                Genome Biol
                Genome Biology
                BioMed Central (London )
                1474-7596
                1474-760X
                1 March 2019
                1 March 2019
                2019
                : 20
                : 48
                Affiliations
                [1 ]ISNI 0000000123222966, GRID grid.6936.a, Department of Informatics, , Technical University of Munich, Boltzmannstraße, ; Garching, 85748 Germany
                [2 ]ISNI 0000 0004 1936 973X, GRID grid.5252.0, Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, ; München, Germany
                [3 ]ISNI 0000 0004 1936 9094, GRID grid.40263.33, Center for Computational Molecular Biology, Brown University, ; Providence, Rhode Island USA
                [4 ]ISNI 0000 0004 1936 9094, GRID grid.40263.33, Department of Molecular Biology, Cell Biology and Biochemistry, , Brown University, ; Providence, Rhode Island USA
                Author information
                http://orcid.org/0000-0002-8924-8365
                Article
                1653
                10.1186/s13059-019-1653-z
                6396468
                30823901
                12671ff7-e308-424d-8190-52ef9230faed
                © The Author(s) 2019

                Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 2 October 2018
                : 12 February 2019
                Categories
                Method
                Custom metadata
                © The Author(s) 2019

                Genetics
                splicing,variant effect,variant pathogenicity,deep learning,modular modeling
                Genetics
                splicing, variant effect, variant pathogenicity, deep learning, modular modeling

                Comments

                Comment on this article