26
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Comprehensive genetic diagnosis of tandem repeat expansion disorders with programmable targeted nanopore sequencing

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          More than 50 neurological and neuromuscular diseases are caused by short tandem repeat (STR) expansions, with 37 different genes implicated to date. We describe the use of programmable targeted long-read sequencing with Oxford Nanopore’s ReadUntil function for parallel genotyping of all known neuropathogenic STRs in a single assay. Our approach enables accurate, haplotype-resolved assembly and DNA methylation profiling of STR sites, from a list of predetermined candidates. This correctly diagnoses all individuals in a small cohort ( n = 37) including patients with various neurogenetic diseases ( n = 25). Targeted long-read sequencing solves large and complex STR expansions that confound established molecular tests and short-read sequencing and identifies noncanonical STR motif conformations and internal sequence interruptions. We observe a diversity of STR alleles of known and unknown pathogenicity, suggesting that long-read sequencing will redefine the genetic landscape of repeat disorders. Last, we show how the inclusion of pharmacogenomic genes as secondary ReadUntil targets can further inform patient care.

          Abstract

          Abstract

          Programmable targeted nanopore sequencing profiles all known pathogenic short tandem repeats (STRs) in a single test.

          Related collections

          Most cited references87

          • Record: found
          • Abstract: found
          • Article: not found

          Minimap2: pairwise alignment for nucleotide sequences

          Heng Li (2018)
          Recent advances in sequencing technologies promise ultra-long reads of ∼100 kb in average, full-length mRNA or cDNA reads in high throughput and genomic contigs over 100 Mb in length. Existing alignment programs are unable or inefficient to process such data at scale, which presses for the development of new alignment algorithms.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Assembly of long, error-prone reads using repeat graphs

            Accurate genome assembly is hampered by repetitive regions. Although long single molecule sequencing reads are better able to resolve genomic repeats than short-read data, most long-read assembly algorithms do not provide the repeat characterization necessary for producing optimal assemblies. Here, we present Flye, a long-read assembly algorithm that generates arbitrary paths in an unknown repeat graph, called disjointigs, and constructs an accurate repeat graph from these error-riddled disjointigs. We benchmark Flye against five state-of-the-art assemblers and show that it generates better or comparable assemblies, while being an order of magnitude faster. Flye nearly doubled the contiguity of the human genome assembly (as measured by the NGA50 assembly quality metric) compared with existing assemblers.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Tandem repeats finder: a program to analyze DNA sequences.

              G. Benson (1999)
              A tandem repeat in DNA is two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats have been shown to cause human disease, may play a variety of regulatory and evolutionary roles and are important laboratory and analytic tools. Extensive knowledge about pattern size, copy number, mutational history, etc. for tandem repeats has been limited by the inability to easily detect them in genomic sequence data. In this paper, we present a new algorithm for finding tandem repeats which works without the need to specify either the pattern or pattern size. We model tandem repeats by percent identity and frequency of indels between adjacent pattern copies and use statistically based recognition criteria. We demonstrate the algorithm's speed and its ability to detect tandem repeats that have undergone extensive mutational change by analyzing four sequences: the human frataxin gene, the human beta T cellreceptor locus sequence and two yeast chromosomes. These sequences range in size from 3 kb up to 700 kb. A World Wide Web server interface atc3.biomath.mssm.edu/trf.html has been established for automated use of the program.
                Bookmark

                Author and article information

                Contributors
                Role: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: ResourcesRole: SoftwareRole: Validation
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing - original draftRole: Writing - review & editing
                Role: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: Writing - original draft
                Role: Formal analysisRole: ResourcesRole: Software
                Role: ConceptualizationRole: InvestigationRole: MethodologyRole: SupervisionRole: VisualizationRole: Writing - review & editing
                Role: InvestigationRole: Resources
                Role: ResourcesRole: Writing - review & editing
                Role: InvestigationRole: ResourcesRole: Writing - review & editing
                Role: ResourcesRole: ValidationRole: Writing - review & editing
                Role: ConceptualizationRole: InvestigationRole: ResourcesRole: ValidationRole: Writing - review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: Project administrationRole: ResourcesRole: SoftwareRole: SupervisionRole: ValidationRole: VisualizationRole: Writing - original draftRole: Writing - review & editing
                Role: InvestigationRole: ValidationRole: Writing - review & editing
                Role: InvestigationRole: Resources
                Role: Funding acquisitionRole: InvestigationRole: ResourcesRole: SupervisionRole: ValidationRole: Writing - review & editing
                Role: Funding acquisitionRole: ResourcesRole: Writing - review & editing
                Role: Resources
                Role: ConceptualizationRole: Funding acquisitionRole: InvestigationRole: Project administrationRole: ResourcesRole: SupervisionRole: Writing - review & editing
                Role: Writing - review & editing
                Role: ConceptualizationRole: ResourcesRole: ValidationRole: Writing - review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: Project administrationRole: ResourcesRole: SupervisionRole: ValidationRole: VisualizationRole: Writing - review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: Funding acquisitionRole: MethodologyRole: Project administrationRole: SupervisionRole: ValidationRole: VisualizationRole: Writing - original draftRole: Writing - review & editing
                Journal
                Sci Adv
                Sci Adv
                sciadv
                advances
                Science Advances
                American Association for the Advancement of Science
                2375-2548
                March 2022
                04 March 2022
                : 8
                : 9
                : eabm5386
                Affiliations
                [1 ]Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia.
                [2 ]School of Medicine, University of New South Wales, Sydney, NSW, Australia.
                [3 ]St Vincent’s Clinical School, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia.
                [4 ]School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, Australia.
                [5 ]Garvan-Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia.
                [6 ]The University of Sydney, Brain and Mind Centre and School of Medical Sciences, Faculty of Medicine and Health, Camperdown, NSW, Australia.
                [7 ]Harry Perkins Institute of Medical Research, University of Western Australia, Nedlands, WA, Australia.
                [8 ]Diagnostic Genomics, PathWest Laboratory Medicine WA, Nedlands, WA, Australia.
                [9 ]Westmead Hospital, Westmead, NSW, Australia and Sydney Medical School, The University of Sydney, NSW, Australia.
                [10 ]Department of Neurology, Royal North Shore Hospital and The University of Sydney, Sydney, NSW, Australia.
                [11 ]Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London, UK.
                [12 ]The National Hospital for Neurology and Neurosurgery, London, UK.
                [13 ]Raphael Recanati Genetics Institute, Rabin Medical Center, Beilinson Hospital, Petah Tikva, Israel.
                [14 ]The Neurology Department, Rabin Medical Center, Beilinson Hospital, Petah Tikva, Israel.
                [15 ]Northcott Neuroscience Laboratory, ANZAC Research Institute, Sydney, NSW, Australia.
                [16 ]Faculty of Health and Medicine, University of Sydney, Camperdown, NSW, Australia.
                [17 ]Molecular Medicine Laboratory, Concord Hospital, Concord, NSW, Australia.
                [18 ]Neurology Department, Central Clinical School, Concord Repatriation General Hospital, University of Sydney, Concord, NSW, Australia.
                Author notes
                [* ]Corresponding author. Email: i.deveson@ 123456garvan.org.au
                [†]

                These authors contributed equally to this work.

                Author information
                https://orcid.org/0000-0002-7713-1979
                https://orcid.org/0000-0002-8106-5011
                https://orcid.org/0000-0002-9034-9905
                https://orcid.org/0000-0002-6192-6937
                https://orcid.org/0000-0002-9003-0101
                https://orcid.org/0000-0001-9700-7898
                https://orcid.org/0000-0003-3085-2282
                https://orcid.org/0000-0001-7973-3773
                https://orcid.org/0000-0002-2866-7777
                https://orcid.org/0000-0002-2211-0257
                https://orcid.org/0000-0003-0422-8398
                https://orcid.org/0000-0002-0626-9030
                https://orcid.org/0000-0001-5111-3732
                https://orcid.org/0000-0001-5354-9134
                https://orcid.org/0000-0003-3332-5074
                https://orcid.org/0000-0003-3482-6962
                https://orcid.org/0000-0003-3861-0472
                Article
                abm5386
                10.1126/sciadv.abm5386
                8896783
                35245110
                ffeb3cab-2721-485d-91d1-56d30af7c4bb
                Copyright © 2022 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC).

                This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license, which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.

                History
                : 23 September 2021
                : 11 January 2022
                Funding
                Funded by: Medical Research Futures Fund;
                Award ID: MRF1173594
                Categories
                Research Article
                Biomedicine and Life Sciences
                SciAdv r-articles
                Genetics
                Life Sciences
                Genetics
                Custom metadata
                Lou Notario

                Comments

                Comment on this article