14
views
0
recommends
+1 Recommend
1 collections
    0
    shares

      Why publish your research Open Access with G3: Genes|Genomes|Genetics?

      Learn more and submit today!

      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Application of an optimized annotation pipeline to the Cryptococcus deuterogattii genome reveals dynamic primary metabolic gene clusters and genomic impact of RNAi loss

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Evaluating the quality of a de novo annotation of a complex fungal genome based on RNA-seq data remains a challenge. In this study, we sequentially optimized a Cufflinks-CodingQuary-based bioinformatics pipeline fed with RNA-seq data using the manually annotated model pathogenic yeasts Cryptococcus neoformans and Cryptococcus deneoformans as test cases. Our results show that the quality of the annotation is sensitive to the quantity of RNA-seq data used and that the best quality is obtained with 5–10 million reads per RNA-seq replicate. We also showed that the number of introns predicted is an excellent a priori indicator of the quality of the final de novo annotation. We then used this pipeline to annotate the genome of the RNAi-deficient species Cryptococcus deuterogattii strain R265 using RNA-seq data. Dynamic transcriptome analysis revealed that intron retention is more prominent in C. deuterogattii than in the other RNAi-proficient species C. neoformans and C. deneoformans. In contrast, we observed that antisense transcription was not higher in C. deuterogattii than in the two other Cryptococcus species. Comparative gene content analysis identified 21 clusters enriched in transcription factors and transporters that have been lost. Interestingly, analysis of the subtelomeric regions in these three annotated species identified a similar gene enrichment, reminiscent of the structure of primary metabolic clusters. Our data suggest that there is active exchange between subtelomeric regions, and that other chromosomal regions might participate in adaptive diversification of Cryptococcus metabolite assimilation potential.

          Most cited references75

          • Record: found
          • Abstract: found
          • Article: not found

          Fast gapped-read alignment with Bowtie 2.

          As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Cutadapt removes adapter sequences from high-throughput sequencing reads

              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              HTSeq—a Python framework to work with high-throughput sequencing data

              Motivation: A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed. Results: We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes. Availability and implementation: HTSeq is released as an open-source software under the GNU General Public Licence and available from http://www-huber.embl.de/HTSeq or from the Python Package Index at https://pypi.python.org/pypi/HTSeq. Contact: sanders@fs.tum.de
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                G3 (Bethesda)
                Genetics
                g3journal
                G3: Genes|Genomes|Genetics
                Oxford University Press
                2160-1836
                February 2021
                11 January 2021
                11 January 2021
                : 11
                : 2
                : jkaa070
                Affiliations
                [1 ] Département de Mycologie, Institut Pasteur, Unité Biologie des ARN des Pathogènes Fongiques , F-75015 Paris, France
                [2 ] Programa de Pós-Graduação em Biologia Celular e Molecular, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul , Porto Alegre 15005, Brazil
                [3 ] Département Biologie Computationnelle, Institut Pasteur, HUB Bioinformatique et Biostatistique, C3BI , USR 3756 IP CNRS, F-75015 Paris, France
                [4 ] Department of Molecular Genetics and Microbiology, Duke University Medical Center , Durham, NC 27710, USA
                [5 ] Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard , Cambridge, MA 02142, USA
                Author notes

                Both authors should be considered as senior authors.

                Corresponding author: Institut Pasteur, Unité Biologie des ARN des Pathogènes Fongiques, Département de Mycologie, 25 rue du Dr Roux, 75015 Paris, France. janbon@ 123456pasteur.fr
                Author information
                https://orcid.org/0000-0002-5778-960X
                https://orcid.org/0000-0002-2433-6903
                https://orcid.org/0000-0002-4788-1154
                Article
                jkaa070
                10.1093/g3journal/jkaa070
                8022950
                33585873
                dc07e74d-494c-464f-8700-7f155546400b
                © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 01 September 2020
                : 24 December 2020
                : 14 December 2020
                Page count
                Pages: 18
                Categories
                Fungal Genetics and Genomics
                AcademicSubjects/SCI01180
                AcademicSubjects/SCI01140
                AcademicSubjects/SCI00010
                AcademicSubjects/SCI00960

                Genetics
                cryptococus deuterogattii,genome annotation pipeline,rnai,metabolic gene cluster
                Genetics
                cryptococus deuterogattii, genome annotation pipeline, rnai, metabolic gene cluster

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content185

                Cited by7

                Most referenced authors1,697