27
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Comprehensive genetic diagnosis of tandem repeat expansion disorders with programmable targeted nanopore sequencing

      Read this article at

          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          More than 50 neurological and neuromuscular diseases are caused by short tandem repeat (STR) expansions, with 37 different genes implicated to date. We describe the use of programmable targeted long-read sequencing with Oxford Nanopore’s ReadUntil function for parallel genotyping of all known neuropathogenic STRs in a single assay. Our approach enables accurate, haplotype-resolved assembly and DNA methylation profiling of STR sites, from a list of predetermined candidates. This correctly diagnoses all individuals in a small cohort ( n = 37) including patients with various neurogenetic diseases ( n = 25). Targeted long-read sequencing solves large and complex STR expansions that confound established molecular tests and short-read sequencing and identifies noncanonical STR motif conformations and internal sequence interruptions. We observe a diversity of STR alleles of known and unknown pathogenicity, suggesting that long-read sequencing will redefine the genetic landscape of repeat disorders. Last, we show how the inclusion of pharmacogenomic genes as secondary ReadUntil targets can further inform patient care.

          Abstract

          Programmable targeted nanopore sequencing profiles all known pathogenic short tandem repeats (STRs) in a single test.

          Related collections

          Most cited references87

          • Record: found
          • Abstract: found
          • Article: not found

          Minimap2: pairwise alignment for nucleotide sequences

          Heng Li (2018)
          Recent advances in sequencing technologies promise ultra-long reads of ∼100 kb in average, full-length mRNA or cDNA reads in high throughput and genomic contigs over 100 Mb in length. Existing alignment programs are unable or inefficient to process such data at scale, which presses for the development of new alignment algorithms.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Assembly of long, error-prone reads using repeat graphs

            Accurate genome assembly is hampered by repetitive regions. Although long single molecule sequencing reads are better able to resolve genomic repeats than short-read data, most long-read assembly algorithms do not provide the repeat characterization necessary for producing optimal assemblies. Here, we present Flye, a long-read assembly algorithm that generates arbitrary paths in an unknown repeat graph, called disjointigs, and constructs an accurate repeat graph from these error-riddled disjointigs. We benchmark Flye against five state-of-the-art assemblers and show that it generates better or comparable assemblies, while being an order of magnitude faster. Flye nearly doubled the contiguity of the human genome assembly (as measured by the NGA50 assembly quality metric) compared with existing assemblers.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Tandem repeats finder: a program to analyze DNA sequences.

              G. Benson (1999)
              A tandem repeat in DNA is two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats have been shown to cause human disease, may play a variety of regulatory and evolutionary roles and are important laboratory and analytic tools. Extensive knowledge about pattern size, copy number, mutational history, etc. for tandem repeats has been limited by the inability to easily detect them in genomic sequence data. In this paper, we present a new algorithm for finding tandem repeats which works without the need to specify either the pattern or pattern size. We model tandem repeats by percent identity and frequency of indels between adjacent pattern copies and use statistically based recognition criteria. We demonstrate the algorithm's speed and its ability to detect tandem repeats that have undergone extensive mutational change by analyzing four sequences: the human frataxin gene, the human beta T cellreceptor locus sequence and two yeast chromosomes. These sequences range in size from 3 kb up to 700 kb. A World Wide Web server interface atc3.biomath.mssm.edu/trf.html has been established for automated use of the program.
                Bookmark

                Author and article information

                Contributors
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                Journal
                Science Advances
                Sci. Adv.
                American Association for the Advancement of Science (AAAS)
                2375-2548
                March 04 2022
                March 04 2022
                : 8
                : 9
                Affiliations
                [1 ]Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia.
                [2 ]School of Medicine, University of New South Wales, Sydney, NSW, Australia.
                [3 ]St Vincent’s Clinical School, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia.
                [4 ]School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, Australia.
                [5 ]Garvan-Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia.
                [6 ]The University of Sydney, Brain and Mind Centre and School of Medical Sciences, Faculty of Medicine and Health, Camperdown, NSW, Australia.
                [7 ]Harry Perkins Institute of Medical Research, University of Western Australia, Nedlands, WA, Australia.
                [8 ]Diagnostic Genomics, PathWest Laboratory Medicine WA, Nedlands, WA, Australia.
                [9 ]Westmead Hospital, Westmead, NSW, Australia and Sydney Medical School, The University of Sydney, NSW, Australia.
                [10 ]Department of Neurology, Royal North Shore Hospital and The University of Sydney, Sydney, NSW, Australia.
                [11 ]Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London, UK.
                [12 ]The National Hospital for Neurology and Neurosurgery, London, UK.
                [13 ]Raphael Recanati Genetics Institute, Rabin Medical Center, Beilinson Hospital, Petah Tikva, Israel.
                [14 ]The Neurology Department, Rabin Medical Center, Beilinson Hospital, Petah Tikva, Israel.
                [15 ]Northcott Neuroscience Laboratory, ANZAC Research Institute, Sydney, NSW, Australia.
                [16 ]Faculty of Health and Medicine, University of Sydney, Camperdown, NSW, Australia.
                [17 ]Molecular Medicine Laboratory, Concord Hospital, Concord, NSW, Australia.
                [18 ]Neurology Department, Central Clinical School, Concord Repatriation General Hospital, University of Sydney, Concord, NSW, Australia.
                Article
                10.1126/sciadv.abm5386
                ffeb3cab-2721-485d-91d1-56d30af7c4bb
                © 2022
                History

                Comments

                Comment on this article